VU Research Portal

(1)

VU Research Portal

The automatic acquisition of a Dutch lexicon for opinion mining

Maks, E.

2018

document version

Publisher's PDF, also known as Version of record

Link to publication in VU Research Portal

citation for published version (APA)

Maks, E. (2018). The automatic acquisition of a Dutch lexicon for opinion mining.

General rights

Copyright and moral rights for the publications made accessible in the public portal are retained by the authors and/or other copyright owners and it is a condition of accessing publications that users recognise and abide by the legal requirements associated with these rights. • Users may download and print one copy of any publication from the public portal for the purpose of private study or research. • You may not further distribute the material or use it for any profit-making activity or commercial gain

• You may freely distribute the URL identifying the publication in the public portal ?

Take down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

E-mail address:

(2)

1 |

Introduction

People have and express their views on a sheer infinite variety of subjects. Is Rome the most beautiful city in the world? How do people feel about the Dutch king? What are the best universities in the world? Would Brexit be bad for London’s financial centre? For all sorts of reasons, people take a great interest in knowing what views other people have on subjects like these. The amount of digitized texts in which people express their opinions and attitudes give us abundant opportunities to obtain answers to these questions.

The language and style that conveys this kind of information is often diverse and com-plex. Opinions and evaluations come in many forms such as judgements, allegations, desires, intentions, beliefs and speculations (Wiebe et al. (2005)). Moreover, we can find opinions in many different texts and text genres such as news, editorials, blogs, forums, reviews and online debates.

Various tools and techniques have been developed for the automatic extraction and in-terpretation of opinionated information from text. To accomplish this, a method is needed to distinguish between opinionated and non-opinionated pieces of text. Also, a method is needed for classifying expressions into sentiment categories such as positive, negative, and neutral. Consider, for example, the following text in which a reviewer describes his visit to a museum in Rome1_.

(1) A must for both locals and tourists.

This is it! This museum demands time and respect. An extraordinary example of suc-cessful conversion of a decommissioned power plant into an amazingly spacious and airy exhibition space, showcasing beautiful ancient sculptures from Rome’s imperial times as well as beautiful refined mosaics. Highly recommended, take your time. The writer clearly wants to show his enthusiasm for the museum described. The review offers a number of expressions that contribute to this purpose of which successful, amazingly,

beautiful, highly recommended are the most obvious ones. Expressions such as this is it!, demands time and respect surely add to the positive opinion conveyed by this review, but

require more context for interpretation. If an automatic analysis relies on the principle of compositionality, that is, considers the review’s meaning as the sum of its words, it will certainly be able to classify this review as positive.

1_{https://www.tripadvisor.nl/Attraction_Review-g187791-d19099c0-Reviews-Centrale_}

VU Research Portal