Data Annotation

Data Annotation

Annotations are an important way in which social machines interact with data. Most systems that have been developed for data annotation assume a two-level structure in which annotations are superimposed on, and separate from, the data itself. However, there are many cases in which annotations are themselves annotated, such as when discussion threads in emails and newsgroups make it possible to attach comments to other people's comments.

The Data Annotation project explores the challenges associated with annotated databases: We have developed hierarchical models of annotation in which there is no absolute distinction between annotation and data, and explored how annotations can propagate through queries in such a model. Annotated databases pose new challenges in query optimisation, as propagation of annotations demands that we reconsider many fundamental tasks, such as query containment.

Publications
Kostylev, E. V., Reutter J. L., & Salamon A. Z. (2014).  Classification of Annotation Semirings over Containment of Conjunctive Queries. ACM Trans. Database Syst.. 39, 1:1–1:39.
Buneman, P., Kostylev E. V., & Vansummeren S. (2013).  Annotations Are Relative. Proceedings of the 16th International Conference on Database Theory. 177–188.
Kostylev, E. V., Reutter J. L., & Salamon A. Z. (2012).  Classification of Annotation Semirings over Query Containment. Proceedings of the 31st ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems. 237–248.