Natural Language Processing Platform The Árni Magnússon Institute for Icelandic Studies

Malvinnsla API

Malvinnsla API (2.0.1)

License: Apache 2.0

The Árni Magnússon Institute`s Natural Language Processing Platform provides an API where basic text and natural language processing tools for Icelandic can be accessed.

PoS-tag and Lemmatize

Takes raw Icelandic text as input and returns the text in tokens, along with PoS tags and lemmas

Request Body schema:
required
text
string

The text to be PoS-tagged and/or lemmatized

lemma
boolean

Boolean flag which, when true, returns the tokens` lemmas, along with the tokens` PoS tags. The lemmatizer used is Nefnir.

expand_tag
boolean

Boolean flag which, when true, returns a JSON-formatted string, containing a human-readable version of the PoS tags` morphological information.

Responses

Request samples

Content type
{
  • "text": Hér er setning. Hér er önnur.\nSvo er hægt að nota línubil líka.,
  • "lemma": true,
  • "expand_tag": true
}

Response samples

Content type
application/json
{
  • "submitted": string,
  • "sentences": [
    ]
}

Segment words

Request Body schema: application/json
required
One of
text
required
string

The text which contains the words to be segmented.

hyphenation_mode
required
string
Value: pattern

The method to be used for segmenting words.

hyphen_type
required
string
Enum: soft hard custom split

The method for word segmentation (inserting characters). hard inserts a hyphen (-), soft inserts an invisible hyphen, custom inserts a custom character, split returns the segmented words without any inserted characters.

hyphen_character
string or null

Automatically

Responses

Request samples

Content type
application/json
Example
{
  • "text": Þessi setning inniheldur orð sem á að skipta upp.,
  • "hyphenation_mode": pattern,
  • "hyphen_type": hard,
  • "hyphen_character": string
}

Response samples

Content type
application/json
{
  • "sentences": [
    ]
}
NLP tools