Overview of Annotation Creation: Processes & Tools
Mark A. Finlayson, Toma\v{z} Erjavec

TL;DR
This paper reviews the processes and essential tool capabilities for creating high-quality linguistic annotations, emphasizing the importance of supporting annotation schemes and language-specific features.
Contribution
It provides an overview of annotation creation workflows and identifies key tool capabilities and common issues, focusing on abstract functionalities rather than specific tools.
Findings
Core tool capabilities include support for annotation schemes and language compatibility.
Additional useful capabilities are categorized into widely available, occasionally useful, and underdeveloped features.
Highlights ongoing challenges and areas for tool improvement in linguistic annotation.
Abstract
Creating linguistic annotations requires more than just a reliable annotation scheme. Annotation can be a complex endeavour potentially involving many people, stages, and tools. This chapter outlines the process of creating end-to-end linguistic annotations, identifying specific tasks that researchers often perform. Because tool support is so central to achieving high quality, reusable annotations with low cost, the focus is on identifying capabilities that are necessary or useful for annotation tools, as well as common problems these tools present that reduce their utility. Although examples of specific tools are provided in many cases, this chapter concentrates more on abstract capabilities and problems because new tools appear continuously, while old tools disappear into disuse or disrepair. The two core capabilities tools must have are support for the chosen annotation scheme and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
