kyoto_reader.base_phrase module

class kyoto_reader.base_phrase.BasePhrase(tag: pyknp.knp.tag.Tag, dmid_offset: int, dtid: int, sid: str, doc_id: str, parent: Optional[BasePhrase] = None, children: Optional[List[BasePhrase]] = None)[source]

Bases: object

文中に出現する基本句を表すクラス

tag

Tag object in pyknp.

Type:Tag
sid

Sentence ID.

Type:str
dtid

Document-wide tag ID.

Type:int
content_dmid

Document-wide morpheme ID of the content word in the base phrase.

Type:int
parent

Dependency parent.

Type:Optional[BasePhrase]
children

Dependency children.

Type:List[BasePhrase]
__init__(tag: pyknp.knp.tag.Tag, dmid_offset: int, dtid: int, sid: str, doc_id: str, parent: Optional[BasePhrase] = None, children: Optional[List[BasePhrase]] = None)[source]
Parameters:
  • tag (Tag) – Tag object in pyknp.
  • dmid_offset (int) – Document-wide morpheme ID of the previous morpheme.
  • dtid (int) – Document-wide tag ID.
  • sid (str) – Sentence ID.
  • doc_id (str) – Document ID.
  • parent (Optional[BasePhrase]) – Dependency parent.
  • children (List[BasePhrase]) – Dependency children.
core

A core expression without ancillary words.

dmid

Document-wide morpheme ID.

dmids

A list of document-wide morpheme IDs.

mrph2dmid

A mapping from morpheme to its document-wide ID.

mrph_list() → List[pyknp.juman.morpheme.Morpheme][source]

A list of morphemes

mrphs

A list of morphemes.

surf

A surface expression.

tid

Tag ID in pyknp.