Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WikiConv Chinese Dataset #146

Open
thomaspzollo opened this issue Feb 21, 2022 · 1 comment
Open

WikiConv Chinese Dataset #146

thomaspzollo opened this issue Feb 21, 2022 · 1 comment
Labels
good first issue ideal for first-time contributors

Comments

@thomaspzollo
Copy link

I see the original WikiConv paper says there were conversations in Chinese collected, are these available through ConvoKit?

@cristiandnm
Copy link
Contributor

The full Chinese section of the WikiConv corpus is not yet available in ConvoKit.

We have however released a small sample; see section 1.2 of this example notebook: https://github.com/CornellNLP/Cornell-Conversational-Analysis-Toolkit/blob/master/examples/politeness-strategies/Politeness_Strategies_in_MT-mediated_Communication.ipynb

If you need the full corpus and want to add it yourself, that would be of course appreciated; see data contribution guidelines here:
https://github.com/CornellNLP/Cornell-Conversational-Analysis-Toolkit/blob/master/CONTRIBUTING.md

@cristiandnm cristiandnm added help wanted good first issue ideal for first-time contributors and removed help wanted labels Mar 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue ideal for first-time contributors
Projects
None yet
Development

No branches or pull requests

3 participants