español (España) English (United States)

In this page, we will include details about available data sources supported by the members of the network. The links to the Spanish open dataset catalogue that is maintained by the network are here: CKAN and DataHub.

Generated by members of the network

GeoLinkedData.es

GeoLinked Data (.es) is an open initiative of the OEG whose aim is to enrich the Web of Data with Spanish geospatial data. This initiative started off by publishing  in the Web of Data diverse information sources belonging to the National Geographic Institute of Spain

The data source used, until now, in GeoLinkedData.es belongs to the pertenecen al Instituto National Geographic Institute of Spain (IGN in Spanish) and the National Statistics Institute of Spain (INE in Spanish):

  • The Numerical Cartographic Database, called BCN200 (scale 1:200,000)
  • The Numerical Cartographic Database, called BTN25, (scale 1:25,000)
  • The National Geographic Gazetteer data source (scale 1:50,000) also called Georreferenced DataBase or NOMGEO
  • The Conciso Gazetteer with its information compiled at a scale 1:1 million
  • The National Atlas data source (scale 1: 1:500,000).
  • EuroGlobalMap (EGM)

SemanticXBRL

SemanticXBRL is a dataset generated by GRIHO from financial reports in XBRL format, which many companies listed on U.S. stock market sent to the SEC. SmenaticXBRL is part of the Linked Open Data Cloud and is available in CKAN.

WebenemasunoLinkeData.es

WebenemasunoLinkedData dataset with the information of the travel guides and blogs of El Viajero: Guías de Viajes de El País. Information available:

  • El Viajero travel guides: in a variant IPTC format which includes all the information about the travel guides and their metadata (creation date, authors, role, date, references). IPTCs also includes references to images and videos.
  • Blogs and posts: posts about the nearly 600 blogs that make up El Viajero, as well as the metadata of each post: description, author, creation date, etc. Comments are also included.
  • UPOS: geolocation source for the guides.

Censo 2001 RDF

Censo 2001 RDF is a DataWeb research work to promote the publication of open governmental data, specifically in Spain, within the framework of transparency and interoperability set up in the Open Data movement. The goal is to prove that the great amount of data generated by the administrations, currently almost useless for third-parties, can be easily converted to open formats, providing great value for these third-parties and the administrations giving them interoperability. This site offers information of the conversion process of a 5% sampling of the 2001 Spanish census from a plain useless format to RDF, a semantic representation supported by different products, services and protocols. In addition to the information of the process, a SPARQL querying interface is provided, as well as a set of visualization examples which make use of the SPARQL Endpoint to show the potential of the open data framework.

DBpedia español (Spanish DBpedia)

The DBpedia project has been generating semantic information from the english wikipedia. From June 2011 the extraction process extracts information from Wikipedia in 15 of their non-english languages. One of these is the Spanish language. The DBpedia internationalization committee has assigned a web site and a SPARQL endpoint to each one of these languages.
The extraction process for the Spanish language has produced so far almost 100 millon RDF triples. In the SPARQL endpoint the most relevant triples (~70 millions) are available. This work depends on researchers from UAM (Mariano Rico), UPM(Oscar Corcho), all of them belonging to the Spanish Thematic Network of Linked Data, as well as individuals who dedicate their time and effort to this initiative. This initiative started on a mapping generation hackathon that was celebrated under the auspices of the aforementioned network.

 

In progress by members of the network

Dublin Core data source for publications.

Deusto will create a Dublin Core data source with all the publications, at least from 2010 to 2011, of the research group MoreLab. It will try to create a processor of links to PDF publications files1, from which to extract the metadata needed to generate instances Dublin Core.

SPARQL endpoint of the MoreLab group

An SPARQL endpoint will be created for the MoreLab group website. The data in RDF format that can be consulted will be modeled according to the Dublin Core, Geo, DOAP and Bio vocabularies. An automatic generator of personal home pages of the group will be created, including sections such as Bio, publications or projects.

AGRIS Project

GRIHO group is working with the project AGRIS of UN-FAO for publishing as Linked Data references that have been collected about publications related to food and agriculture.
 

Gerenerated outside of the network

Open Data initiatives of different governments

API for accessing BizkaiaBus public data

Iker Perez has developed an API for accessing the public data of the bus network BizkaiaBus. More information: http://openbizkaibus.appspot.com/api/

API Infocarretera

An API has been developed for accessing to the public data provided by the Department of Interior of the Basque Government about the roads status. For more information: http://www.infocarretera.com/api/

Mediating action

Our network also wants to act as mediator for the publication of data in Linked Data format. To do this, if you have any interest in finding help for the generation of data in Linked Data format in a specific area, send us an .with the following information and we will post your request and we will try to find groups or people working on it.

  • Your name
  • Your email address
  • Your country
  • Your occupation/affiliation (if relevant)
  • What data would you like?
  • Do you have direct access to the original data sources? In which format?
  • Why do you want this data? What would you do with it if it was opened up?
  • Would you be willing to pay something for the transformation and maintenance?