Tagged Norwegian Bokmål texts from NBdigital

Open data API in a single place

Provided by difi

Get early access to Tagged Norwegian Bokmål texts from NBdigital API!

Let us know and we will figure it out for you.

Dataset information

Country of origin
Updated
2016.02.29 00:00
Created
2016.01.12
Available languages
Norwegian
Keywords
språkforskning, korpus, språkbanken, tekst, språkteknologi
Quality scoring
245

Dataset description

This corpus contains 4807 morphologically tagged texts in Norwegian Bokmål from the National Library of Norway's corpus of texts in the public domain. All texts have been published after 1960. The texts were automatically tagged with the Oslo-Bergen tagger (see http://www.tekstlab.uio.no/obt-ny/english/index.html), with syntactic disambiguation. In theory, this should give an accuracy of approximately 96,5%. However, the texts have been digitized and OCR-read automatically (with an average word confidence of approximately 90%); this means the overall accuracy is probably considerably lower. The data is stored as one xml file per text/book, with a simple xml structure. See the documentation file for an example.
Build on reliable and scalable technology
Revolgy LogoAmazon Web Services LogoGoogle Cloud Logo
FAQ

Frequently Asked Questions

Some basic informations about API Store ®.

Operation and development of APIs are currently fully funded by company Apitalks and its usage is for free.
Yes, you can.
All important information such as time of last update, license and other information are in response of each API call.
In case of major update that would not be compatible with previous version of API, we keep for 30 days both versions so you will have enough time to transfer to new version. We will inform you about the changes in advance by e-mail.

Didn't find the API you need?

Let us know and we will figure it out for you.

API Store provides access to European Open Data via scalable and reliable REST API interface.
Copyright © 2024. Made with ♥ by Apitalks