registry#
- scikitplot.corpus.registry = ComponentRegistry(chunkers=0, filters=0, readers=0, normalizers=0)[source]#
Central look-up table for corpus pipeline components.
Stores class references (not instances) for four component types: chunkers, filters, readers, and normalizers. Callers retrieve a class and instantiate it with their own parameters.
Notes
The module-level
registrysingleton is pre-populated with all built-in components viaregister_builtins. Third-party packages can register additional components after import.Examples
>>> from scikitplot.corpus._registry import registry >>> registry.register_builtins() >>> cls = registry.get_chunker("paragraph") >>> chunker = cls(min_chars=20)