opendataval.dataloader.datasets.nlpsets#

NLP data sets.

Uses HuggingFace transformers. as dependency.

Module Attributes

bbc_embedding(cache_dir, force_download, ...)

Classification data set registered as "bbc-embeddings", BERT text embeddings.

imdb_embedding(cache_dir, force_download, ...)

Classification data set registered as "imdb-embeddings", BERT text embeddings.

Functions

BertEmbeddings(func[, batch_size])

Convert text data into pooled embeddings with DistilBERT model.

bbc_embedding(cache_dir, force_download, ...)

Classification data set registered as "bbc-embeddings", BERT text embeddings.

download_bbc(cache_dir, force_download)

Classification data set registered as "bbc".

download_imdb(cache_dir, force_download)

Binary category sentiment analysis data set registered as "imdb".

imdb_embedding(cache_dir, force_download, ...)

Classification data set registered as "imdb-embeddings", BERT text embeddings.