Skip to content

收集各种和NLP相关的资料,包括学习资料、模型实现、数据收集等,记录个人学习成长过程中遇到的各种优异资源

Notifications You must be signed in to change notification settings

27182812/NLP-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 

Repository files navigation

NLP资料集

收集各种和NLP相关的资料,包括学习资料、模型实现、数据收集等,记录个人学习成长过程中遇到的各种优异资源,实时更新中。。。

很棒的中文自然语言处理相关资料项目,包括NLP工具、语料、学习组织竞赛、学习资料等

https://github.com/crownpku/Awesome-Chinese-NLP

语料

一些中文 自然语言处理 语料/数据集,包括情感/观点/评论 倾向性分析、中文命名实体识别、推荐系统、FAQ问答系统 https://github.com/SophonPlus/ChineseNlpCorpus

搜索所有中文NLP数据集 https://github.com/CLUEbenchmark/CLUEDatasetSearch

心理咨询问答语料库 https://github.com/chatopera/efaqa-corpus-zh?spm=5176.12282016.0.0.416e7dc0g0sg7R

知乎总结,有哪些中文语料库 https://www.zhihu.com/question/44764422

中文NLP.数据集搜索 https://www.cluebenchmarks.com/dataSet_search.html

情感对话数据集 https://www.biendata.xyz/ccf_tcci2018/datasets/ecg/

中文开放知识图谱 http://openkg.cn/home

大杂烩,非常全

https://github.com/fighting41love/funNLP

学习资料

斯坦福大学CS224N 【2019】课程的【所有】相关的资料 https://github.com/zhanlaoban/CS224N-Stanford-Winter-2019

NLP研究入门之道-刘知远 https://github.com/zibuyu/research_tao

自然语言处理算法与实战 https://github.com/nlpinaction/learning-nlp

七月NLP课程笔记 https://codingcat.cn/pages/608d0a/

pytorch中文教程,聊天机器人教程 https://pytorch.apachecn.org/docs/1.0/chatbot_tutorial.html

百度paddlenlp课程学习:https://aistudio.baidu.com/aistudio/education/group/info/24177https://aistudio.baidu.com/aistudio/projectdetail/1978303

查找论文对应开源代码

https://www.paperswithcode.com/

在各项NLP任务上的排行榜

https://nlpprogress.com/

国内自然语言处理爱好者的群体博客

https://www.52nlp.cn/

代码实现

问答系统实践,由两个部分组成,一是基于tf-idf检索的召回模型,二是基于CNN的精排模型 https://github.com/WenRichard/Customer-Chatbot

CakeChat: 情感生成对话系统 https://www.ctolib.com/lukalabs-cakechat.html

Make your own Rick Sanchez (bot) with Transformers and DialoGPT fine-tuning:https://colab.research.google.com/drive/15wa925dj7jvdvrz8_z3vU7btqAFQLVlG#scrollTo=7KrNfVNueNhR

对话情感分析:https://github.com/declare-lab/conv-emotion

多轮对话基础实现: https://github.com/gmftbyGMFTBY/MultiTurnDialogZoo

应用类or工具类or有趣

DeepMoji:输入一句话生成相应表情 https://deepmoji.mit.edu/

面向FAQ集合的问答系统框架、文本语义匹配工具SimNet https://github.com/baidu/AnyQ

an open-source text annotation tool:https://github.com/doccano/doccano

Bad Words: https://github.com/LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words

文章类

预训练模型相关paper:https://github.com/thunlp/PLMpapers

对话情感识别与生成简述 https://blog.csdn.net/sdu_hao/article/details/106855262

A paper list for aspect based sentiment analysis https://github.com/jiangqn/Aspect-Based-Sentiment-Analysis

A Paper List for Style Transfer in Text: https://github.com/fuzhenxin/Style-Transfer-in-Text

awesome-emotion-recognition-in-conversations: https://github.com/declare-lab/awesome-emotion-recognition-in-conversations

how-to-build-a-state-of-the-art-conversational-ai-with-transfer-learning:https://medium.com/huggingface/how-to-build-a-state-of-the-art-conversational-ai-with-transfer-learning-2d818ac26313

The Illustrated Transformer(中文版,非常好的讲解Transformer的文章): https://blog.csdn.net/yujianmin1990/article/details/85221271

Awesome-Efficient-PLM: https://github.com/TobiasLee/Awesome-Efficient-PLM

云服务器

矩池云 https://www.matpool.com/

恒源智享云 https://gpushare.com/

About

收集各种和NLP相关的资料,包括学习资料、模型实现、数据收集等,记录个人学习成长过程中遇到的各种优异资源

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published