The Bosnian-Herzegovinian National Corpus (BHNC) is a digital database of reference texts from Bosnia and Herzegovina, encompassing various text types: printed daily and weekly newspapers, scientific publications, online portals, legal texts, parliamentary debates, literary works, and more. The goal is to continuously expand the database with texts ranging from historical ones after World War II to entirely contemporary ones, which will be searchable exclusively for scientific purposes via the CQPweb platform. In addition to a wide range of reference texts, the platform also allows for the installation of smaller corpora to meet the needs of individual researchers and university teaching. Access to the corpus is strictly limited to scientific research and academic education and cannot be used for any commercial purposes.

Corpus Statistics

Total Number of Words in the Corpus

0

Number of Words by Type of Publication

Web portals
0
(75,82%)
Daily Newspapers
0
(19,31%)
Weekly newspapers
0
(2,80%)
Public documents
0
(0,70%)
Scientific publications
0
(0,59%)
Wartime newspapers
0
(0,57%)
Bh. epic literature
0
(0,18%)

Distribution by cities

Sarajevo
0,00%
Banja Luka
0,00%
Mostar
0,00%
Zenica
0,00%
Tuzla
0,00%
Trebinje
0,00%
Bihać
0,00%
Visoko
0,00%
Kalesija
0,00%
Ljubinje
0,00%
Živinice
0,00%
Književnost
0,00%
Bugojno
0,00%
Cazin
0,00%
Ljubuški
0,00%
Kakanj
0,00%
Bosanska Krupa
0,00%

Percentage of Words by Year

0% 2% 4% 6% 8% 10% 12% 14%
Overview of Word Distribution by Year of Publication
0,27% 1940-1950
0,03% 1950-1960
0,04% 1960-1970
0,02% 1970-1980
0,20% 1980-1990
1,12% 1990-2000
1,04% 2000-2004
1,21% 2005
1,24% 2006
1,27% 2007
1,38% 2008
1,49% 2009
2,36% 2010
2,67% 2011
2,31% 2012
2,93% 2013
3,73% 2014
3,88% 2015
4,67% 2016
4,73% 2017
6,50% 2018
6,03% 2019
7,24% 2020
7,59% 2021
8,21% 2022
9,34% 2023
12,25% 2024
6,25% 2025

News

BHNC Surpasses 1.2 Billion Words

With great pleasure, we present the latest and most extensive version to date of the Bosnian and Herzegovinian National Corpus – BHNC – 1 – 2026. At this stage of development, the corpus has achieved a significant milestone...

BHNC: Reached 250 Million Words

With great pleasure, we announce that the Bosnian-Herzegovinian National Corpus (BHNC) has reached an impressive milestone of 250 million words. This significant achievement represents an exceptionally rich...

New Researchers Using BHNC

We are pleased to announce that the following researchers have started using the Bosnian-Herzegovinian National Corpus (BHNC) in their linguistic analyses: 🔹 Aida Kršo (UNSA) – The Interrelation of the Perfect, Past Anterior, and Aorist in the Journalistic Style of the Standard Language...

BHNC Update

The Bosnian-Herzegovinian National Corpus (BHNC) has reached 186,610 texts and 85,732,177 words. An additional 40 million new words are being prepared, further expanding the BHNC. We thank...