Skip to content

Semanlink

Semanlink dataset.

Parameters

  • batch_size

  • use_labels – defaults to True

  • shuffle – defaults to True

  • pre_compute – defaults to True

  • num_workers – defaults to 1

  • seed – defaults to None

Attributes

  • train (list): Training set.

    valid (list): Validation set. test (list): Testing set. entities (dict): Index of entities. relations (dict): Index of relations. n_entity (int): Number of entities. n_relation (int): Number of relations.

Examples

>>> from ckb import datasets

>>> dataset = datasets.Semanlink(batch_size=1, pre_compute=True, shuffle=True, seed=42)

>>> dataset
Semanlink dataset
    Batch size  1
    Entities  5454
    Relations  4
    Shuffle  True
    Train triples  6422
    Validation triples  803
    Test triples  803

Methods

fetch
get_train_loader

Initialize train dataset loader.

Parameters

  • mode
mapping_entities

Construct mapping entities.

mapping_relations

Construct mapping relations.

test_dataset
test_stream
validation_dataset