Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
369 views
in Technique[技术] by (71.8m points)

machine learning - Aspect-based text labeling for named entity recognition in AWS sagemaker

I'd like to know the best way to label text in a machine learning training data set. I'm using Amazon Sagemaker ground Groundtruth to train named entity recognition (NER). Here's an example:

I walked for 30 minutes and then slept for 60 minutes.

In this example, I'd like the NER to recognize that the action "walked" is associated with "30 minutes" and "slept" is for "60 minutes"

If Sagemaker permitted nested labels, then one strategy I could do is to label "walked for 30 minutes" as an event, then nested under that this event label, "walk" could be labeled an action and 30 minutes "duration".

As far I can tell, Sagemaker doesn't allow nested labels (all words can only be associated with a single label).

What is a good labeling strategy for a training dataset that that similar numbers (minutes) are associated with different parts of a sentence or part of the text?

Feel free to let me know how I can make this question more clear. I'm new to creating training datasets.

Thank you!

question from:https://stackoverflow.com/questions/65910179/aspect-based-text-labeling-for-named-entity-recognition-in-aws-sagemaker

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)
Waitting for answers

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

...