Skip to content

englhardt/scaf

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SCAF - Semantic Change Analysis with Frequency

Build Status

This repository contains code for semantic change analysis of words that combines word embeddings with word frequencies. For results on a synthetic corpus, the Google Books Ngram Corpus and on Twitter data see

Adrian Englhardt, Jens Willkomm, Martin Schäler and Klemens Böhm, "Improving Semantic Change Analysis by Combining Word Embeddings and Word Frequencies", International Journal on Digital Libraries (IJDL), 19 Mar 2019.

For a download of the generated word embeddings and results, see the companion website

The code is licensed under a MIT License and the data set under a Creative Commons Attribution 4.0 International License. If you use this code or data set in your scientific work, please reference the companion paper.

Installing

To install the package run the following steps:

pip install -r requirements.txt
python setup.py install

Run python setup.py test to run the tests or tox to run test for all supported python versions (2.7, 3.5 and 3.6).

Overview

This repository contains the following elements to perform semantic change analysis:

For a full example from training word embeddings up to the change detection see the example notebook in example/example.ipynb.

Author

For questions and comments, please contact Adrian Englhardt.

About

Semantic Change Analysis with Frequency

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published