Bayesian Network benchmark Datasets and mixed data

Citation Author(s):
Ruijing
Cui
Submitted by:
Ruijing Cui
Last updated:
Wed, 09/06/2023 - 04:09
DOI:
10.21227/k8eh-yn74
Data Format:
License:
0
0 ratings - Please login to submit your rating.

Abstract 

Contains the benchmark Bayesian network dataset, which uses the seed of Bayesian networks from https://www.bnlearn.com. Some of the data comes from https://pages.mtu.edu/~lebrown/supplements/mmhc_paper/mmhc_index.html. And other datasets from the UCI that contain mixed data. These data can be used to learn the basic structure of Bayesian networks, the research of cause-based feature selection algorithms, etc bnlearn is an R package for learning the graphical structure of Bayesian networks, estimating their parameters and performing some useful inference. First released in 2007, it has been under continuous development for more than 10 years (and still going strong). The Bayesian Network Repository contains the networks stored in multiple formats as well as citations to the original papers. Each zip file contains the 10 datasets used for learning at each sample size (500, 1000,1500..., 5000) and a file, Name_graph.txt, that contains the adjacency matrix of the true network.

Instructions: 

Detailed sources and information on the data are available at https://www.bnlearn.com, https://pages.mtu.edu/~lebrown/supplements/mmhc_paper/mmhc_index.html, https://arch ive.ics.uci.edu/datasets.