Texts from Norwegian Wikipedia

Open data API in a single place

Provided by difi

Get early access to Texts from Norwegian Wikipedia API!

Let us know and we will figure it out for you.

Dataset information

Country of origin
Updated
2019.03.22 00:00
Created
2007.06.23
Available languages
Norwegian
Keywords
språkteknologi, språkbanken, tekst, korpus, språkforskning
Quality scoring
245

Dataset description

This corpus is a dump from approximately March 20 2019 of all Wikipedia articles written in Norwegian Bokmål, Norwegian Nynorsk and Northern Sami. The corpus contains 492,864 articles for Norwegian Bokmål, 139,927 for Norwegian Nynorsk and 7626 for Northern Sami. The files are structured as a JSON Array of all the articles as they appear on the web. Each article is a structured element, with one level of "key:value" pairs containing text and metadata. There are eight such key:value pairs per article: - bytelength: length of text in number of bytes - pageid: text identifier - title: title as in Wikipedia - hiddencategories: metadata - text: text as in Wikipedia - revised: audit information - contentcategories: metadata - wikidata: other data An example of the JSON format can be found in the documentation file.
Build on reliable and scalable technology
Revolgy LogoAmazon Web Services LogoGoogle Cloud Logo
FAQ

Frequently Asked Questions

Some basic informations about API Store ®.

Operation and development of APIs are currently fully funded by company Apitalks and its usage is for free.
Yes, you can.
All important information such as time of last update, license and other information are in response of each API call.
In case of major update that would not be compatible with previous version of API, we keep for 30 days both versions so you will have enough time to transfer to new version. We will inform you about the changes in advance by e-mail.

Didn't find the API you need?

Let us know and we will figure it out for you.

API Store provides access to European Open Data via scalable and reliable REST API interface.
Copyright © 2024. Made with ♥ by Apitalks