Linked Traceable Agriculture Product Data

鏈結產銷履歷農產品資料

Introduction

The recent developments of open data initiative in Taiwan is populating linked data on the Web. The effort to interlink data has opportunity to enhance the shareability, extensibility, and re-usability of open data. As such, linked open data potentially provides benefits in improvement of public service and creation of social and economic value.

A Traceable Agricultural Product (TAP) record can provide a track about an agricultural product from a farm to a store. The TAP records are published on open data portal. However, the content of the TAP records is often inconsistent as well as not machine-readable. The problems make the TAP records difficult to reuse for adding more value. To deal with the problems, there is a need to improve the data quality of TAP records. Linked data paradigm can be a way to promote TAP records to be shareable, reusable and extensible. This paper reports our expensive to transform TAP records from open data to linked open data, and reuse the linked TAP records in an online recipe service for offering users safety agricultural product.

To clarify the concepts in TAP records, a TAP ontology is created on the basis of existing vocabularies such as W3C Time, OGC GeoSPARQL, FOAF, and AGROVOC. To make the content of TAP record consistent, we cleaned the dataset of TAP records to standardize and unify the names of crop, stores (supermarkets), and restaurants. After the data clean process, TAP records are transformed from JSON to RDF, and the RDF triples are populated to three SPARQL endpoints: TAP records, Stores, and Restaurants. With having the three SPARQL endpoints, we developed a Chrome extension for implementing federated SPARQL queries to provide a user where the safety foods come from, where the restaurants provide the safety foods, and what is the price of the foods when the user browse a recipe.


Raw Open Data

原始資料

The raw data is publised on Taiwanese Open Data Portal under the Open Government Data License, version 1.0 which is compatible with the Creative Commons Attribution License 4.0 International.


Data Process

資料處理

Extraction of crop names from production names.

Matching crop names between TAP and wholesale agricultural markets for obtaining crop price information.

Inventory of stores via a web crawler, and analysis of the store informaiton in raw TAP data for mapping a traceable agriculture product to stores.

Crawling restaurants from the website about restaurants providing dishse using TAP.

To obtain longitude and latitude from cadastral number via transformation tool developed by Ronny Wang, a civic hacker.


Ontology for TAP

知識本體

The ontology for TAP data can be distinguished three parts. For location, OGC GeoSPARQL is used to clarify the geospatial information in TAP. In terms of agricultural data, AGROVOC is used to explained the crop and agricultural product. In terms of people and organization, FOAF is used for farmers, stores, certification agencies, and producers. The detail vocabularies of the ontology is on TAP


SPARQL Endpoints

查詢

By using D2R server, three SPARQL endpoints are developed for semantified data retrieve.

Through using RDFlib, a Chrome extension is developed for querying TAP data via the three endpoints on the basis of ingredients of the recipe website (iCook.tw)