Legal Documents from Norwegian Nynorsk Municipialities

Open data API in a single place

Provided by difi

Get early access to Legal Documents from Norwegian Nynorsk Municipialities API!

Let us know and we will figure it out for you.

Dataset information

Country of origin
Updated
2020.12.04 00:00
Created
2019.10.16
Available languages
Norwegian
Keywords
språkteknologi, språkbanken, språkforskning, tekst, korpus
Quality scoring
245

Dataset description

The texts in this corpus have been collected with the web crawler Veidemann in collaboration with the National Library's Web Archive, based on a revised list of municipalities from the National Association of Nynorsk Municipalities (see lnk.no). The web crawler was set to download documents in pdf format. The resulting collection of documents was then scanned using Google's OCR API (Optical Character Recognition). Although the OCR generally is of high quality, some errors will remain in the material. The resulting corpus is made up of 50.000 documents (including legal documents, minutes from meetings etc.), and contains a total of some 127 million words. About 88.5 million of these are in Norwegian Nynorsk, the rest is mostly Norwegian Bokmål. All the texts in the corpus are classified by language. The corpus is currently published as a json object, where the key is an identifier (URN) for the Veidemann download, and the value is a list of lists of pages in the document with associated page numbers and target form. A text file is also provided, containing a list of the URNs in the corpus. These URNs refer to the website (URL) from which the document was downloaded. The original pdf files and the OCR format are available on request to Språkbanken.
Build on reliable and scalable technology
Revolgy LogoAmazon Web Services LogoGoogle Cloud Logo
FAQ

Frequently Asked Questions

Some basic informations about API Store ®.

Operation and development of APIs are currently fully funded by company Apitalks and its usage is for free.
Yes, you can.
All important information such as time of last update, license and other information are in response of each API call.
In case of major update that would not be compatible with previous version of API, we keep for 30 days both versions so you will have enough time to transfer to new version. We will inform you about the changes in advance by e-mail.

Didn't find the API you need?

Let us know and we will figure it out for you.

API Store provides access to European Open Data via scalable and reliable REST API interface.
Copyright © 2024. Made with ♥ by Apitalks