The Bosnian-Herzegovinian National Corpus (BHNC) is a digital database of reference texts from Bosnia and Herzegovina, encompassing various text types: printed daily and weekly newspapers, scientific publications, online portals, legal texts, parliamentary debates, literary works, and more. The goal is to continuously expand the database with texts ranging from historical ones after World War II to entirely contemporary ones, which will be searchable exclusively for scientific purposes via the CQPweb platform. In addition to a wide range of reference texts, the platform also allows for the installation of smaller corpora to meet the needs of individual researchers and university teaching. Access to the corpus is strictly limited to scientific research and academic education and cannot be used for any commercial purposes.

Corpus Statistics

Total Number of Words in the Corpus

0

Number of Words by Type of Publication

Daily Newspapers
0
(54.2%)
Scientific Journal
0
(0.5%)
Portal
0
(43.7%)
Weekly Magazine
0
(1.6%)

Percentage of Words by Year

0% 2% 4% 6% 8% 10% 12%
Overview of text distribution by year of publication
0.03% 1987
0.09% 1988
0.00% 1993
0.33% 1994
0.08% 1996
0.00% 1999
0.01% 2000
0.03% 2002
2.40% 2003
0.12% 2004
2.62% 2005
1.97% 2006
1.98% 2007
2.15% 2008
1.71% 2009
6.18% 2010
6.46% 2011
4.37% 2012
4.52% 2013
5.52% 2014
4.96% 2015
5.05% 2016
4.79% 2017
4.75% 2018
5.53% 2019
5.85% 2020
5.58% 2021
6.32% 2022
5.71% 2023
10.80% 2024
0.07% 2025

Vijesti

BHNC: Reached 250 Million Words

With great pleasure, we announce that the Bosnian-Herzegovinian National Corpus (BHNC) has reached an impressive milestone of 250 million words. This significant achievement represents an exceptionally rich...

New Researchers Using BHNC

We are pleased to announce that the following researchers have started using the Bosnian-Herzegovinian National Corpus (BHNC) in their linguistic analyses: 🔹 Aida Kršo (UNSA) – The Interrelation of the Perfect, Past Anterior, and Aorist in the Journalistic Style of the Standard Language...

BHNC Update

The Bosnian-Herzegovinian National Corpus (BHNC) has reached 186,610 texts and 85,732,177 words. An additional 40 million new words are being prepared, further expanding the BHNC. We thank...

BHNC

In 2024, the primary task is the development of the Bosnian-Herzegovinian National Corpus (BHNC), which will consist of newspaper texts from Bosnia and Herzegovina. Seven individuals are working on preparing texts for the BHNC...