Skip to content

ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.

License

Notifications You must be signed in to change notification settings

uds-lsv/ATC-Anno

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Latest Version: DOI

ATC-Anno

ATC-Anno is an annotation tool for the transcription and semantic annotation of air traffic control utterances. Its most unusual feature is its prediction of semantic annotations based on a finite-state grammar. As long as a transcribed sentence is moderately close to a sentence in the grammar, the annotation found in the grammar can be applied to the transcription.

Screenshot of ATC-Anno

Tutorial

To demonstrate the workflow and features of the tool, we have created a brief video tutorial. The video has optional subtitles.

Requirements

ATC-Anno is written in Python 2.7 and requires the following additional packages:

Note: The installation on Ubuntu 20.04 or higher is non trivial due to older dependencies. We therefore have a build on a VM, which can be accessed here: https://unisaarlandde-my.sharepoint.com/:f:/g/personal/s9algawl_uni-saarland_de/EomjdHvZ0gtBtrr8-s36C3QBlbV_v42HmGk9exmiRiOWIQ?e=Z7btQP

Finite-state Grammar

To access the Get XML feature, you need to provide a finite-state grammar that uses the OpenFST format. The default location for the grammar is data/grammars/default.fst

Concept Extraction

The conversion of airline names to callsign representations (e.g. "Lufthansa" to "DLH") is based on the airline dictionary found at data/airlines/callsigns.txt. You can expand the file to include all airlines that occur in your grammar.

Attribution

This software is published under an MIT License.

If you use it in your research or work, please cite the following publication:

Schulder, Marc and O'Mahony, Johannah and Bakanouski, Yury and Klakow, Dietrich (2020). "ATC-Anno: Semantic Annotation for Air Traffic Control with Assistive Auto-Annotation". Proceedings of the 12th Conference on Language Resources and Evaluation (LREC), pages 6375–6380, Marseille, France, 13 May 2020.

BibTex

@InProceedings{schulder2020atcAnno,
  author = {Schulder, Marc and O'Mahony, Johannah and Bakanouski, Yury and Klakow, Dietrich},
  title = {{ATC-Anno}: Semantic Annotation for Air Traffic Control with Assistive Auto-Annotation},
  booktitle = {Proceedings of the Twelfth International Conference on Language Resources and Evaluation (LREC 2020)},
  year = {2020},
  month = {May},
  address = {Marseille, France},
  publisher = {European Language Resources Association (ELRA)},
}

About

ATC-Anno is an annotation tool for Air Traffic Control data that offers automatic semantic and concept annotation.

Topics

Resources

License

Stars

Watchers

Forks

Languages