documents_to_polars#
- scikitplot.corpus.documents_to_polars(docs, *, include_embedding=False)[source]#
Convert a list of
CorpusDocumentinstances to apolars.DataFrame.- Parameters:
- docslist of CorpusDocument
Documents to convert. Must be non-empty.
- include_embeddingbool, optional
When
True, include a column"embedding"with list-of-float values. Default:False.
- Returns:
- polars.DataFrame
One row per document. Metadata fields are promoted to columns.
- Raises:
- ImportError
If
polarsis not installed.- ValueError
If
docsis empty.
- Parameters:
docs (list[CorpusDocument])
include_embedding (bool)
- Return type:
pl.DataFrame
Examples
>>> docs = [CorpusDocument.create("f.txt", i, f"Sentence {i}.") for i in range(3)] >>> df = documents_to_polars(docs) >>> len(df) 3