validation approach. In this paper, we introduced a domain specific approach to
validate schema.org annotations. Our approach allows domain experts to spec-
ify a domain based on a subset of schema.org vocabulary as well as validation
rules for semantic consistency. We showed the web based implementation of our
approach alongside a use case in the tourism area.
For the future work we will follow the works of different groups, especially the
RDF Data Shapes Working Group, to find out possible alignments between our
approaches. For instance, development in the SHACL shows promising results
and can be utilized for the later implementation of our approach.
Moreover, we are in the processes of advancing the tool that implements our
approach while including the development of more sophisticated rule designer
and validator. We will test our tool in a larger scale in tourism domain within
the next months.
Our approach currently does not consider multi-typed entities, which are
encouraged by the schema.org initiative. For instance, the schema.org hotel ex-
tension [4] suggests that a lodging business should define their rooms as both
schema.org/Room and schema.org/Product in order to conform schema.org spec-
ifications. We will investigate how we can adopt the multi-typed entity notion
in the future work.
References
1. F¨urber, C., Hepp, M.: Using SPARQL and SPIN for Data Quality Management
on the Semantic Web. In: Abramowicz, W., Tolksdorf, R. (eds.) Business Infor-
mation Systems: 13th International Conference, BIS 2010, Berlin, Germany, May
3-5, 2010. Proceedings, pp. 35–46. Springer Berlin Heidelberg, Berlin, Heidelberg
(2010), http://dx.doi.org/10.1007/978-3-642-12814-1{_}4
2. Guha, R.V., Brickley, D., Macbeth, S.: Schema.org: Evolution of structured data
on the web. Commun. ACM 59(2), 44–51 (Jan 2016), http://doi.acm.org/10.
1145/2844544
3. K¨arle, E., Fensel, A., Toma, I., Fensel, D.: Why Are There More Hotels in Tyrol
than in Austria? Analyzing Schema. org Usage in the Hotel Domain. In: Infor-
mation and Communication Technologies in Tourism 2016, pp. 99–112. Springer
(2016)
4. K¨arle, E., Simsek, U., Akbar, Z., Hepp, M., Fensel, D.: Extending the schema. org
vocabulary for more expressive accommodation annotations. In: Information and
Communication Technologies in Tourism 2017, pp. 31–41. Springer (2017)
5. Khalili, A., Auer, S.: WYSIWYM Authoring of Structured Content Based on
Schema.org, pp. 425–438. Springer Berlin Heidelberg, Berlin, Heidelberg (2013),
http://dx.doi.org/10.1007/978-3-642-41154-0{_}32
6. Knublauch, H., Kontokostas, D.: Shapes Constraint Language (2016), https://
w3c.github.io/data-shapes/shacl/
7. Le Hors, A., Solbrig, H., Prudhommeaux, E.: Rdf validation workshop report,
practical assurances for quality rdf data. Tech. rep., Cambridge, MA, USA (2013),
https://www.w3.org/2012/12/rdf-val/report
8. Meusel, R., Bizer, C., Paulheim, H.: A Web-scale Study of the Adoption and Evo-
lution of the Schema.Org Vocabulary over Time. In: Proceedings of the 5th Inter-
national Conference on Web Intelligence, Mining and Semantics. pp. 15:1—-15:11.