Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

大佬,table-unet的训练样本怎么标注? #1

Open
deping-1 opened this issue Jul 20, 2020 · 9 comments
Open

大佬,table-unet的训练样本怎么标注? #1

deping-1 opened this issue Jul 20, 2020 · 9 comments

Comments

@deping-1
Copy link

No description provided.

@daaidouya
Copy link
Owner

chineseocr/table-ocr#1
横竖线分别标注
还可以参考这篇文章:https://cloud.tencent.com/developer/article/1452973

@deping-1
Copy link
Author

有没有标注的样图

@deping-1
Copy link
Author

标注一张图片都很慢,大概需要标注多少张图片,模型效果才较好呢?

@daaidouya
Copy link
Owner

标注一张图片都很慢,大概需要标注多少张图片,模型效果才较好呢?

目前我是拿TableBank训练的,用传统方法识别出来横竖线条再手动挑选出来识别正确的,一千张就效果还可以了

@deping-1
Copy link
Author

你的意思是 用opencv检测横竖线条?那你用unet或者其他深度学习方法做过表格识别和重构么?

@daaidouya
Copy link
Owner

你的意思是 用opencv检测横竖线条?那你用unet或者其他深度学习方法做过表格识别和重构么?

是拿这个结果去制作unet的数据集,可以不用手动标注。我也是才接触表格识别

@deping-1
Copy link
Author

噢,可以直接使用传统方法识别横竖线,生成图片作为unet的训练样本图片?感谢大佬!

@daaidouya
Copy link
Owner

噢,可以直接使用传统方法识别横竖线,生成图片作为unet的训练样本图片?感谢大佬!

嗯是的

@gjj123
Copy link

gjj123 commented Jan 3, 2023

刚做表格线检测,有些困惑,如果图片上有不是表格线的线,比如文字下划线,页眉页脚画线这些,标注吗?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants