Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

中文语料失效 #6

Open
buaasky opened this issue Jul 5, 2018 · 4 comments
Open

中文语料失效 #6

buaasky opened this issue Jul 5, 2018 · 4 comments

Comments

@buaasky
Copy link

buaasky commented Jul 5, 2018

您好,请问中文语料的格式是什么样的?百度网盘的连接现在不能下载了。
我用自己的语料训练会报错?不知道能够提供一下中文语料的格式,谢谢!

@zhezhaoa
Copy link
Owner

Hi, 好像是可以下载的 。你是否对你的语料分词了呢?以及推荐utf-8编码

@DouTong
Copy link

DouTong commented Jul 12, 2018

您好,中文语料的下载地址失效了。

@buaasky
Copy link
Author

buaasky commented Jul 12, 2018

@zhezhaoa
好的,谢谢!我试一下自己的语料。

@DouTong
百度网盘的用客户端好像可以下载,不过速度很慢。

@HongyanJiao
Copy link

感谢分享代码,借楼问中文语料的格式是什么样的,只想构造小数据跑起来试一下

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants