데이터 덤프는 Freebase의 다운로드 가능한 데이터 버전입니다. 이러한 데이터는 Freebase에 저장된 데이터와 이를 구조화하는 스키마의 스냅샷으로, 동일한 CC-BY 라이선스에 따라 제공됩니다. Freebase/Wikidata 매핑은 CC0 라이선스에 따라 제공됩니다.
RDF 덤프를 파싱하는 코드를 직접 작성하는 경우 데이터를 먼저 추출한 다음 압축 해제된 데이터를 처리하는 것보다 GZip 파일에서 직접 읽는 것이 더 효율적인 경우가 많습니다.
<subject> <predicate> <object> .
참고: Freebase에서 객체에는 /m/012rkqx와 같은 MID가 있습니다. RDF에서 이러한 MID는 m.012rkqx가 됩니다. 마찬가지로 /common/topic와 같은 Freebase 스키마는 common.topic로 작성됩니다.
주제는 Freebase 객체의 ID입니다. 주제 및 CVT의 경우 Freebase MID(예: m.012rkqx)일 수 있고, common.topic)을 사용합니다.
술어는 항상 Freebase 속성 또는 RDFS와 같은 표준 RDF 어휘의 속성에 대한 사람이 읽을 수 있는 ID입니다. Freebase 외래 키 네임스페이스는 네임스페이스별로 키를 쉽게 조회할 수 있도록 술어로도 사용됩니다.
객체 필드에는 객체의 Freebase MID 또는 Freebase나 기타 RDF 어휘의 스키마에 대한 사람이 읽을 수 있는 ID가 포함될 수 있습니다. 문자열, 불리언, 숫자 값과 같은 리터럴 값도 포함될 수 있습니다.
주제 설명에는 줄바꿈이 포함되는 경우가 많습니다. 각 트리플이 한 줄에 맞도록 줄바꿈을 '\n'으로 이스케이프했습니다.
Freebase 삭제된 트리플
또한 시간이 지남에 따라 Freebase에서 삭제된 트리플의 덤프도 제공합니다. 이는 2013년 3월까지의 일회성 덤프입니다. 향후 최근에 삭제된 트리플의 정기 업데이트를 제공하는 것을 고려할 수 있지만, 현재로서는 구체적인 시기가 정해져 있지 않으며 이 일회성 덤프만 제공됩니다.
덤프는 .tar.gz 파일 (압축 시 2.1GB, 압축 해제 시 7.7GB)로 배포됩니다. 20개의 파일에 삭제된 트리플이 63,036,271개 포함되어 있습니다. 개별 파일에는 특별한 의미가 없으며 하나의 큰 파일보다 여러 개의 작은 파일을 조작하는 것이 더 쉽습니다.
이 데이터 공개를 가능하게 해 준 Chun How Tan과 John Giannandrea에게 감사드립니다.
데이터 형식은 기본적으로 CSV이지만 한 가지 중요한 주의사항이 있습니다. 객체 필드에는 쉼표를 비롯한 모든 문자 (및 생각할 수 있는 기타 적절한 구분 기호)가 포함될 수 있습니다. 하지만 다른 모든 필드에는 쉼표가 포함되지 않으므로 데이터를 명확하게 파싱할 수 있습니다.
데이터 세트의 열은 다음과 같이 정의됩니다.
creation_timestamp (밀리초 단위의 Unix epoch 시간)
크리에이터
deletion_timestamp (밀리초 단위의 Unix epoch 시간)
삭제자
주제 (MID)
술어 (MID)
객체 (MID/리터럴)
language_code
CSV
1352854086000,/user/mwcl_wikipedia_en,1352855856000,/user/mwcl_wikipedia_en,/m/03r90,/type/object/key,/wikipedia/en/$B816,en
1355171076000,/user/mwcl_musicbrainz,1364258198000,/user/turtlewax_bot,/m/0nncp9z,/music/recording/artist,/m/01vbfm4,en
1176630380000,/user/mwcl_images,1335928144000,/user/gardening_bot,/m/029w57m,/common/image/size,/m/0kly56,en
1292854917000,/user/mwcl_musicbrainz,1364823418001,/user/mbz_pipeline_merge_bot,/m/0fv1vl8,/type/object/type,/common/topic,en
1205530905000,/user/mwcl_images,1336022041000,/user/gardening_bot,/m/01x5scz,/common/licensed_object/license,/m/02x6b,en
1302391361000,/user/content_administrator,1336190973000,/user/gardening_bot,/m/0gkb45y,/type/object/type,/type/content,en
1176728962002,/user/mwcl_images,1335954186000,/user/gardening_bot,/m/08430h,/common/topic/image,/m/02cs147,en
1172002568007,/user/mwcl_chefmoz,1283588560000,/user/delete_bot,/m/01z4c1z,/type/object/name,La Casa Rosa Mexican Restaurant,en
Freebase/Wikidata 매핑
이 데이터는 2013년 10월 28일의 Wikidata 덤프를 기반으로 생성되었으며, 공통 Wikipedia 링크가 2개 이상이고 반대되는 Wikipedia 링크가 없는 링크만 포함합니다. 또한 줄은 공통 Wikipedia 링크 수로 정렬됩니다 (Turtle에서는 실제로 중요하지 않음).
Freebase 데이터 덤프는 Google에서 정기적으로 업데이트하며 어떤 목적으로든 무료로 제공됩니다. 이러한 데이터는 Freebase와 마찬가지로 크리에이티브 커먼즈 저작자 표시 (CC-BY)에 따라 배포되며 사용 시 서비스 약관이 적용됩니다.
Freebase/Wikidata ID 매핑은 CC0에 따라 제공되며 제한 없이 사용할 수 있습니다.
인용
게시물에서 이러한 데이터 덤프를 인용하려면 다음을 사용하세요.
Google, Freebase Data Dumps, https://developers.google.com/freebase/data, <month> <day>, <year>
또는 BibTeX로:
BibTex
@misc{freebase:datadumps,title="Freebase Data Dumps"author="Google",howpublished="\url{https://developers.google.com/freebase/data}",edition="<month> <day>, <year>",year="<year>"}
[null,null,["최종 업데이트: 2025-07-27(UTC)"],[[["\u003cp\u003eThe Freebase API has been shut down, but this page provides access to the last available data dump for research purposes.\u003c/p\u003e\n"],["\u003cp\u003eThe data dumps, including Freebase Triples, Deleted Triples, and Wikidata Mappings, are substantially out of date but are available for download.\u003c/p\u003e\n"],["\u003cp\u003eThe datasets are provided under the CC-BY license, except for the Freebase/Wikidata mappings, which are provided under the CC0 license.\u003c/p\u003e\n"],["\u003cp\u003eThese datasets are not maintained, and Google provides no support regarding their contents or format.\u003c/p\u003e\n"]]],[],null,["# Data Dumps\n\n| The Freebase API has been shut down. Freebase was a data-sharing project that ran from 2007 to 2015. You can read more about it at [Wikipedia](https://en.wikipedia.org/wiki/Freebase_(database)), or the [announcement of its closure](https://web.archive.org/web/20170729170416/https://plus.google.com/109936836907132434202/posts/bu3z2wVqcQc?cfem=1).\n|\n| \u003cbr /\u003e\n|\n| This page provides access to the last available data dump. The historical data dumps below are\n| substantially out of date, and may contain information that is not now (or never was) accurate.\n| We continue to host these files in acknowledgement that they have been cited in\n| [numerous academic and scientific studies](https://scholar.google.com/scholar?as_sdt=0%2C5&q=freebase+rdf&btnG=),\n| and in the hope that they may contribute to ongoing efforts around open data and graph databases.\n| These datasets aren't maintained and no support can be provided regarding their\n| contents, format, or structure.\n\nData Dumps are a downloadable version of the data in Freebase. They constitute a snapshot of the data stored in Freebase and the Schema that structures it, and are provided under the same CC-BY license. The Freebase/Wikidata mappings are provided under the CC0 license.\n\n1. [Freebase Triples](#freebase-rdf-dumps)\n2. [Freebase Deleted Triples](#freebase-deleted-triples)\n3. [Freebase/Wikidata Mappings](#freebase-wikidata-mappings)\n4. [License](#license)\n5. [Citing](#citing)\n\nFreebase Triples\n----------------\n\n|---------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------|----------|\n| This dataset contains every fact currently in Freebase. | - **Total triples:** 1.9 billion - **Updated:** Weekly - **Data Format:** [N-Triples RDF](https://www.w3.org/TR/rdf-testcases/#ntriples) - **License:** [CC-BY](https://creativecommons.org/licenses/by/2.5/) | **22 GB** gzip **250 GB** uncompressed | Download |\n\nThe RDF data is serialized using the N-Triples format, encoded as [UTF-8](https://en.wikipedia.org/wiki/Utf8) text and compressed with Gzip.\nRDF \n\n```\n\u003chttp://rdf.freebase.com/ns/g.11vjz1ynm\u003e \u003chttp://rdf.freebase.com/ns/measurement_unit.dated_percentage.date\u003e \"2001-02\"^^\u003chttp://www.w3.org/2001/XMLSchema#gYearMonth\u003e .\n\u003chttp://rdf.freebase.com/ns/g.11vjz1ynm\u003e \u003chttp://rdf.freebase.com/ns/measurement_unit.dated_percentage.source\u003e \u003chttp://rdf.freebase.com/ns/g.11x1gf2m6\u003e .\n\u003chttp://rdf.freebase.com/ns/g.11vjz1ynm\u003e \u003chttp://rdf.freebase.com/ns/type.object.type\u003e \u003chttp://rdf.freebase.com/ns/measurement_unit.dated_percentage\u003e .\n\u003chttp://rdf.freebase.com/ns/g.11vjz1ynm\u003e \u003chttp://rdf.freebase.com/ns/measurement_unit.dated_percentage.rate\u003e 4.5 .\n\u003chttp://rdf.freebase.com/ns/g.11vjz1ynm\u003e \u003chttp://www.w3.org/1999/02/22-rdf-syntax-ns#type\u003e \u003chttp://rdf.freebase.com/ns/measurement_unit.dated_percentage\u003e .\n```\n\nIf you're writing your own code to parse the RDF dumps its often more efficient to read directly from GZip file rather than extracting the data first and then processing the uncompressed data. \n\n \u003csubject\u003e \u003cpredicate\u003e \u003cobject\u003e .\n\nNote: In Freebase, objects have MIDs that look like `/m/012rkqx`. In RDF those MIDs become m.012rkqx. Likewise, Freebase schema like `/common/topic` are written as `common.topic`.\n\nThe *subject* is the ID of a Freebase object. It can be a Freebase MID (ex. m.012rkqx) for topics and CVTs or a human-readable ID (ex. `common.topic`) for schema.\n\nThe *predicate* is always a human-readable ID for a Freebase property or a property from a standard RDF vocabulary like [RDFS](https://en.wikipedia.org/wiki/RDFS). Freebase foreign key namespaces are also used as predicates to make it easier to look up keys by namespace.\n\nThe object field may contain a Freebase MID for an object or a human-readable ID for schema from Freebase or other RDF vocabularies. It may also include literal values like strings, booleans and numeric values.\n\nTopic descriptions often contain newlines. In order to make each triple fit on one line, we have escaped newlines with \"\\\\n\".\n\nFreebase Deleted Triples\n------------------------\n\nWe also provide a dump of triples that have been deleted from Freebase over time. This is a one-time dump through March 2013. In the future, we might consider providing periodic updates of recently deleted triples, but at the moment we have no specific timeframe for doing so, and are only providing this one-time dump.\n\nThe dump is distributed as a .tar.gz file (2.1Gb compressed, 7.7Gb uncompressed). It contains 63,036,271 deleted triples in 20 files (there is no particular meaning to the individual files, it is just easier to manipulate several smaller files than one huge file).\n\n\nThanks to Chun How Tan and John Giannandrea for making this data release possible.\n\n|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------|----------|\n| - **Total triples:** 63 million - **Updated:** June 9, 2013 - **Data Format:** [CSV](https://en.wikipedia.org/wiki/Comma-separated_values) - **License:** [CC-BY](https://creativecommons.org/licenses/by/2.5/) | **2 GB** gzip **8 GB** uncompressed | Download |\n\nThe data format is essentially CSV with one important caveat. The object field may contain any characters, including commas (as well as any other reasonable delimiters you could think of). However, all the other fields are guaranteed not to contain commas, so the data can still be parsed unambiguously.\n\nThe columns in the dataset are defined as:\n\n- creation_timestamp (Unix epoch time in milliseconds)\n- creator\n- deletion_timestamp (Unix epoch time in milliseconds)\n- deletor\n- subject (MID)\n- predicate (MID)\n- object (MID/Literal)\n- language_code\n\nCSV \n\n```\n1352854086000,/user/mwcl_wikipedia_en,1352855856000,/user/mwcl_wikipedia_en,/m/03r90,/type/object/key,/wikipedia/en/$B816,en\n1355171076000,/user/mwcl_musicbrainz,1364258198000,/user/turtlewax_bot,/m/0nncp9z,/music/recording/artist,/m/01vbfm4,en\n1176630380000,/user/mwcl_images,1335928144000,/user/gardening_bot,/m/029w57m,/common/image/size,/m/0kly56,en\n1292854917000,/user/mwcl_musicbrainz,1364823418001,/user/mbz_pipeline_merge_bot,/m/0fv1vl8,/type/object/type,/common/topic,en\n1205530905000,/user/mwcl_images,1336022041000,/user/gardening_bot,/m/01x5scz,/common/licensed_object/license,/m/02x6b,en\n1302391361000,/user/content_administrator,1336190973000,/user/gardening_bot,/m/0gkb45y,/type/object/type,/type/content,en\n1176728962002,/user/mwcl_images,1335954186000,/user/gardening_bot,/m/08430h,/common/topic/image,/m/02cs147,en\n1172002568007,/user/mwcl_chefmoz,1283588560000,/user/delete_bot,/m/01z4c1z,/type/object/name,La Casa Rosa Mexican Restaurant,en\n```\n\nFreebase/Wikidata Mappings\n--------------------------\n\n|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|--------------------------------------------|----------|\n| The data has been created based on the Wikidata-Dump of October 28, 2013, and contains only those links that have at least two common Wikipedia-Links and not a single disagreeing Wikipedia-Link. Furthermore, the lines are sorted by the number of common Wikipedia-Links (although in Turtle this does not really matter). | - **Total triples:** 2.1M - **Updated:** October 28, 2013 - **Data Format:** [N-Triples RDF](https://www.w3.org/TR/rdf-testcases/#ntriples) - **License:** [CC0](https://creativecommons.org/publicdomain/zero/1.0/) | **21.2 MB** gzip **242.9 MB** uncompressed | Download |\n\nThe RDF data is serialized using the N-Triples format, encoded as [UTF-8](https://en.wikipedia.org/wiki/Utf8) text and compressed with Gzip.\nRDF \n\n```\n\u003chttp://rdf.freebase.com/ns/m.0695j\u003e \u003chttp://www.w3.org/2002/07/owl#sameAs\u003e \u003chttp://www.wikidata.org/entity/Q6718\u003e .\n\u003chttp://rdf.freebase.com/ns/m.05nrg\u003e \u003chttp://www.w3.org/2002/07/owl#sameAs7\u003e \u003chttp://www.wikidata.org/entity/Q538\u003e .\n\u003chttp://rdf.freebase.com/ns/m.0jgd\u003e \u003chttp://www.w3.org/2002/07/owl#sameAs\u003e \u003chttp://www.wikidata.org/entity/Q414\u003e .\n\u003chttp://rdf.freebase.com/ns/m.0d_23\u003e \u003chttp://www.w3.org/2002/07/owl#sameAs\u003e \u003chttp://www.wikidata.org/entity/Q2537\u003e .\n\u003chttp://rdf.freebase.com/ns/m.04g7d\u003e \u003chttp://www.w3.org/2002/07/owl#sameAs\u003e \u003chttp://www.wikidata.org/entity/Q315\u003e .\n```\n\nLicense\n-------\n\nFreebase Data Dumps are provided free of charge for any purpose with regular updates by Google. They are distributed, like Freebase itself, under the [Creative Commons Attribution (aka CC-BY)](https://creativecommons.org/licenses/by/2.5/) and use is subject to the [Terms of Service](/freebase/terms).\nThe Freebase/Wikidata ID mappings are provided under [CC0](https://creativecommons.org/publicdomain/zero/1.0/) and can be used without restrictions.\n\nCiting\n------\n\nIf you'd like to cite these data dumps in a publication, you may use: \n\n```\nGoogle, Freebase Data Dumps, https://developers.google.com/freebase/data, \u003cmonth\u003e \u003cday\u003e, \u003cyear\u003e\n```\n\nOr as BibTeX:\nBibTex \n\n```transact-sql\n@misc{freebase:datadumps,\n title = \"Freebase Data Dumps\"\n author = \"Google\",\n howpublished = \"\\url{https://developers.google.com/freebase/data}\",\n edition = \"\u003cmonth\u003e \u003cday\u003e, \u003cyear\u003e\",\n year = \"\u003cyear\u003e\"\n}\n```"]]