Introducing the Polifonia Corpus: explore music concepts and texts from the Polifonia Project with this new web tool

The latest Polifonia tool opens doors of multilingual textual musical heritage resources. Find out what you can do with this tool and how it was developed.

24 February 2023

Università di Bologna (UniBo) launches the long–awaited web application Polifonia Corpus, as part of the Polifonia H2020 project. An interactive dashboard has been created to easily access the Polifonia Corpus and carries a user-friendly design based on a music player. The corpus exists of Wikipedia data (all music-related pages), books (e.g. from the Biblioteca Nacional de España), influential music periodicals (e.g. The Musical Times) and the textual sources belonging to Polifonia pilots BELLS, CHILD, MEETUPS, MUSICBO and ORGANS (e.g. the Dutch organ encyclopaedia). The tool will help linguists, scholars and students to access multi-language music related corpora and to investigate them according to new and different criteria. 

Challenges in multi-lingual corpus and transcending keyword-based search
The new tool interrogates a collection of Italian, English, French, Spanish, German and Dutch sources. The large modularized corpus contains more than 100 million words for each language. A significant part of the sources of the corpus was only available as images or pdf files and Optical Character Recognition (OCR) to convert them in a processable format. The team from UniBo, consisting of Valentina Presutti, Rocco Tripodi, Arianna Graciotti, Marco Grasso, have been using more Natural Language Processing techniques to process the corpus and produce automatic morphosyntactic, semantic and MH-specific annotations. Further, custom APIs enable domain experts, scholars and music professionals to leverage the annotations produced to perform advanced structured queries on the corpus. The available search capabilities transcend standard keyword-based search, and allow for querying the corpus by using the advanced semantic information.

How to use Polifonia Corpus
To search in this corpus, the user first needs to prepare a few parameters. The typical user, linguists or students in the field, can start by entering a keyword in the “Query” section, which should be a musical concept such as ‘guitar’, ‘opera’, or ‘aria’. In the “Type” section users specify how the tool should search: by keyword, lemma, conceptual or named entities search. Then follows the selection of the “Module” to determine the source collection the tool should dig into (Wikipedia, Books, Periodicals or Pilots). The next section asks for selection of the module’s “Language”. The results that follow are sentences in which the input word is found. These sentences are listed in a Key Word In Context (KWIC) index, a well known practice in linguistic corpora querying. The results are listed in concordance lines, which means that they showcase the textual content following and preceding the concordance line keyword. It is also possible to access the full sentence line and its related source.

Release
The Polifonia Corpus is now live and released through the dedicated Polifonia Corpus GitHub repository and the interactive website. The Corpus, metadata and statistics, along with its annotations and interrogation tools are also part of the Polifonia Ecosystem.

Recent News

Polifonia recently released results of 40 months of research and development at the intersection of musicology, semantic web technologies, AI and Music Information Retrieval. And the Polifonia Web Portal, which enhances the discoverability of European musical heritage with Linked Open Data and Knowledge Graphs, can now be explored.

Polifonia recently released results of 40 months of research and development at the intersection of…

3 July 2024

With a strong base in academia, the Polifonia team looks forward to the conference season every year. One of the highlights is the Extended Semantic Web Conference. In this article, a brief review of our participation in this conference and update on our paper output.

With a strong base in academia, the Polifonia team looks forward to the conference season every year.…

26 June 2024

From the beginning of the project, Podiumkunst.net and Polifonia have been in close contact and looked to each other as role models. Our stakeholder Podiumkunst.net reflects on this synergy with a positive outlook. Remco de Boer and Monique in het Veld on the importance of the collaboration and the impact. 

From the beginning of the project, Podiumkunst.net and Polifonia have been in close contact and looked…

11 June 2024

TONALITIES, IReMus’ pilot for musical heritage data project Polifonia, develops tools for the modal-tonal identification, exploration and classification of monophonic and polyphonic notated music from the Renaissance to the twentieth century. Now, the tools are available for use within the TONALITIES Interface for music analysis. Additionally, a patent was recently acquired for this collaborative interface by the IReMus lab.

TONALITIES, IReMus' pilot for musical heritage data project Polifonia, develops tools for the modal-tonal…

29 May 2024

From April 8 to May 6 Polifonia organised their own version of the Eurovision Song Contest, the Polifonia Song Contest: musicians of all levels were challenged to create the ‘soundtrack of our history’ by using samples from the rich collections in the Polifonia project. Today we can announce the winning song.

From April 8 to May 6 Polifonia organised their own version of the Eurovision Song Contest, the Polifonia…

13 May 2024

After four years of development work, the Polifonia project team is excited to present the results. The consortium, consisting of 10 partners from Italy, the Netherlands, France, England and Ireland launches the music discoverability platform ‘Polifonia Web Portal’. In addition, the researchers and developers have also unlocked and linked other music data, developed tools and software that will help musicologists take steps forward in their research on European musical heritage.

After four years of development work, the Polifonia project team is excited to present the results.…

8 May 2024

The Polifonia project formally ended on April 30, which means that the tools and software developed within this 4-year-project are released and ready for use. Today we look at ‘Patterns UI’.

The Polifonia project formally ended on April 30, which means that the tools and software developed…

3 May 2024

Polifonia Song Contest is two weeks in, and will continue for another two weeks. Have you downloaded the sample pack yet?

With two weeks to go until the deadline, the "Polifonia Song Contest" beckons all musicians who find…

22 April 2024

Are you the type of musician that is inspired by old sounds, such as cheerful Irish folk melodies, the majestic resonance of pipe organ concerts, and the timeless chimes echoing from century-old Italian bell towers? Then ‘Polifonia Song Contest’ is your challenge!

Are you the type of musician that is inspired by old sounds, such as cheerful Irish folk melodies, the…

8 April 2024

The consortium is preparing for the last face-to-face consortium meeting of the Polifonia project in April 2024.

The consortium is preparing for the last face-to-face consortium meeting of the Polifonia project in…

4 April 2024

This project has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement N. 101004746