為了《中國哲學書電子化計劃》輸入用
-
Updated
May 22, 2024 - C#
為了《中國哲學書電子化計劃》輸入用
The Jieba Chinese Word Segmentation Implemented in Rust
A convenient Chinese word segmentation tool 简便中文分词器
GUI application for Chinese word segmentation
中文分词
SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm
Chinese tokenizer base on nodejieba and pullword
100+ Chinese Word Vectors 上百种预训练中文词向量
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Jcseg is a light weight NLP framework developed with Java. Provide CJK and English segmentation based on MMSEG algorithm, With also keywords extraction, key sentence extraction, summary extraction implemented based on TEXTRANK algorithm. Jcseg had a build-in http server and search modules for lucene,solr,elasticsearch,opensearch
Elasticsearch analysis plugin of ICTCLAS
Using Flask export jieba, SnowNLP, pkuseg as http API web service.
一个微型&算法全面的中文分词引擎 | A micro tokenizer for Chinese
Jiebago 的性能优化版, 支持从 io.Reader 加载字典
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
从jieba分词到BERT-wwm,一步步带你进入中文NLP的世界
A PyTorch implementation of a BiLSTM \ BERT \ Roberta (+ BiLSTM + CRF) model for Chinese Word Segmentation (中文分词) .
MONPA 罔拍是一個提供正體中文斷詞、詞性標註以及命名實體辨識的多任務模型
✂️用 100 行实现简单版本的 jieba 分词
Add a description, image, and links to the chinese-word-segmentation topic page so that developers can more easily learn about it.
To associate your repository with the chinese-word-segmentation topic, visit your repo's landing page and select "manage topics."