Datacommons.org

From HandWiki
Short description: Knowledge repository integrating open datasets

Datacommons.org is an open knowledge graph hosted by Google that provides a unified view across multiple public datasets, combining economic, scientific and other open datasets into an integrated data graph.[1] The Datacommons.org site was launched in May 2018 with an initial dataset consisting of fact-checking data published in Schema.org "ClaimReview" format by several fact checkers from the International Fact-Checking Network.Cite error: Closing </ref> missing for <ref> tag its APIs[2] also include tools — such as a Pandas dataframe interface — oriented towards data science, statistics and data visualization.

Datacommons.org is integrative, meaning that, rather than providing a hosting platform for diverse datasets, it attempts to consolidate much of the information the datasets provide into a single data graph.

Technology

Datacommons.org is built on a graph data-model. The graph can be accessed through a browser interface and several APIs,[1][3] and is expanded through loading data (typically CSV and MCF-based templates).[4] The graph can be accessed by natural language queries in Google Search.[5] The data vocabulary used to define the datacommons.org graph is based upon Schema.org.[1] In particular the Schema.org terms StatisticalPopulation[6] and Observation[7] were proposed to Schema.org to support datacommons-like usecases.[8]

Software from the project is available on GitHub under Apache 2 license.[9]

References

External links