Cora

Citation Author(s):
Andrew
McCallum
Submitted by:
Sepideh Neshatfar
Last updated:
Mon, 03/11/2024 - 17:55
DOI:
10.21227/jsg4-wp31
Data Format:
License:
25 Views
Categories:
Keywords:
0
0 ratings - Please login to submit your rating.

Abstract 

The Cora dataset consists of 2708 scientific publications classified into one of seven classes. The citation network consists of 5429 links. Each publication in the dataset is described by a 0/1-valued word vector indicating the absence/presence of the corresponding word from the dictionary. The dictionary consists of 1433 unique words.

Instructions: 

The Cora dataset consists of 2708 scientific publications classified into one of seven classes. The citation network consists of 5429 links. Each publication in the dataset is described by a 0/1-valued word vector indicating the absence/presence of the corresponding word from the dictionary. The dictionary consists of 1433 unique words.

 

Comments

Some built-in Python packages also provide this dataset.

Submitted by Sepideh Neshatfar on Mon, 03/11/2024 - 17:56

Dataset Files

    Files have not been uploaded for this dataset