Skip to content

Latest commit

 

History

History
39 lines (27 loc) · 2.53 KB

README.md

File metadata and controls

39 lines (27 loc) · 2.53 KB

SCAF - Semantic Change Analysis with Frequency

Build Status

This repository contains code for semantic change analysis of words that combines word embeddings with word frequencies. For results on a synthetic corpus, the Google Books Ngram Corpus and on Twitter data see

Adrian Englhardt, Jens Willkomm, Martin Schäler and Klemens Böhm, "Improving Semantic Change Analysis by Combining Word Embeddings and Word Frequencies", International Journal on Digital Libraries (IJDL), 19 Mar 2019.

For a download of the generated word embeddings and results, see the companion website

The code is licensed under a MIT License and the data set under a Creative Commons Attribution 4.0 International License. If you use this code or data set in your scientific work, please reference the companion paper.

Installing

To install the package run the following steps:

pip install -r requirements.txt
python setup.py install

Run python setup.py test to run the tests or tox to run test for all supported python versions (2.7, 3.5 and 3.6).

Overview

This repository contains the following elements to perform semantic change analysis:

For a full example from training word embeddings up to the change detection see the example notebook in example/example.ipynb.

Author

For questions and comments, please contact Adrian Englhardt.