brat 淘气鬼快速标注工具-0001-迷你简介-淘气鬼什么意思?

2023-08-08 17:04:41

 

0、背景

研究一下淘气鬼快速标注工具~

(1)本系列文章

首篇暂无~

1、淘气鬼迷你简介 - mini-introduction to brat

brat is a web-based tool for text annotation; that is, for adding notes to existing text documents.

淘气鬼是一个基于网页的文本标注工具;也就是为已有文本文档添加标注。

brat is designed in particular for structured annotation, where the notes are not freeform text but have a fixed form that can be automatically processed and interpreted by a computer.

淘气鬼设计来做结构化标注,也就是标注不是自由格式的文本,他们有固定的格式可以被计算机自动处理和解释。

the following screenshot shows a simple example where a sentence has been annotated to identify mentions of some real-world entities (things) and their types, and a relation between two.

下面的截图展示了一个简单的示例,一个句子被标注来识别真实世界实体(事物)以及他们的类型,以及二者之间的关系。

example annotations (following in part the ACE 2005 entity and relation annotation guidelines)

配图说明:标注示例(遵循 ACE 2005 实体和关系标注指引)

this example illustrates two basic categories of annotation:

该示例展示了 2 中标注的基本类型:

* text span annotations, such as those marked with the Organization and Person types in the example

01.文本标注,实例中这些标注为组织 O 和人物 P 类型的

* relation annotations, such as the Family relation in the example

02.关系标注,实例中标注为家庭 F 关系的

the simple typed text span category is suitable for creating annotations for named entity recognition, and binary relations for simple relational information extraction tasks, among others.

这个简单的文本类型适用于为命名实体识别 NER 创建标注,为信息抽取任务创建简单的二分分类。

brat also supports the annotation of n-ary associations that can link together any number of other annotations participating in specific roles. This category of annotation can be used for example for event annotation, such as TRANSFER in the following example:

淘气鬼同样支持多关系标注,也就是可以连接任意数量的其他标注到特定角色。这种类型的标注可以用作事件标注,例如下例中的转账 T 关系:

example annotations (following in part the ACE 2005 entity, relation and event annotation guidelines)

配图说明:标注示例(遵循 ACE 2005 实体、关系和事件标注指引)

the detailed types and properties of other annotations can be further specified through the use of attributes that can be set on annotations, for example marking an event as being factual or speculative, or marking an entity mention as referring to a group or an individual.

标注的详细类型和属性可以通过使用特定的属性来指定标注,例如将事件标注为事实的或者推测的,或者一个实体指的是一组还是个体。

to allow the unique identification of the real-world entities referred to by specific text expressions, brat supports also normalization annotations (brat v1.3 (Crunchy Frog) and newer) that associate other annotations with entries in resources such as Wikipedia:

通过允许指定现实世界实体唯一识别文本标识,淘气鬼也支持归一的标注(淘气鬼版本 1.3 (松脆的青蛙-炸了一下嘛?)及更新版本)可以将标注关联到实体资源,例如到维基百科:

information popup for normalized annotation showing information from Wikipedia (image © Andrés Monroy, licensed CC-BY-SA)

配图说明:展示来自于维基百科的归一化标注弹框信息(图片版权 © AM CC-BY-SA)

finally, although not a primary focus of the tool, brat does also allow freeform text "notes" to be added to an annotation.

最后,不是工具主要的功能点,淘气鬼也同样支持自由格式的文本记录作为标注添加。

the applied categories of annotations, their types, and the constraints regarding their use (for example, that a Family relation must always connect annotations of the Person type) are all fully configurable, allowing brat to be applied to nearly any text annotation task.

标注应用的类别、他们的类型,以及使用的约束(例如:家庭 F 关系必须连接的是人物 P 类型)都是完全可配置的,这也成就了淘气鬼可以用于几乎任何文本标注任务。

brat also implements a number of features relying on natural language processing techniques to support human annotation efforts.

淘气鬼也实现了一系列的依赖自然语言处理 NLP 技术来支持人工标注。

2、后记

却道天凉好个秋

秋天来了嘛

有点凉

~


以上就是关于《brat 淘气鬼快速标注工具-0001-迷你简介-淘气鬼什么意思?》的全部内容,本文网址:https://www.7ca.cn/baike/60838.shtml,如对您有帮助可以分享给好友,谢谢。
标签:
声明

排行榜