• No results found

A Bigger Fish to Fry: Scaling up the Automatic Understanding of Idiomatic Expressions

N/A
N/A
Protected

Academic year: 2021

Share "A Bigger Fish to Fry: Scaling up the Automatic Understanding of Idiomatic Expressions"

Copied!
2
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

University of Groningen

A Bigger Fish to Fry

Haagsma, Hessel

DOI:

10.33612/diss.131057087

IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please check the document version below.

Document Version

Publisher's PDF, also known as Version of record

Publication date: 2020

Link to publication in University of Groningen/UMCG research database

Citation for published version (APA):

Haagsma, H. (2020). A Bigger Fish to Fry: Scaling up the Automatic Understanding of Idiomatic Expressions. University of Groningen. https://doi.org/10.33612/diss.131057087

Copyright

Other than for strictly personal use, it is not permitted to download or to forward/distribute the text or part of it without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license (like Creative Commons).

Take-down policy

If you believe that this document breaches copyright please contact us providing details, and we will remove access to the work immediately and investigate your claim.

Downloaded from the University of Groningen/UMCG research database (Pure): http://www.rug.nl/research/portal. For technical reasons the number of authors shown on this cover page is limited to 10 maximum.

(2)

Stellingen

behorende bij het proefschrift

A Bigger Fish to Fry

Scaling up the Automatic Understanding of Idiomatic Expressions

van

Hessel Haagsma

1. Individual idioms are rare, which makes automatically processing them dif-ficult. Idioms, as a group, are frequent, which makes being able to process them a crucial NLP problem.

2. The lack of sizeable and varied idiom datasets has hampered previous re-search in many ways. A large, comprehensive dataset is an important step towards solving this.

3. The automatic extraction of potentially idiomatic extractions, an important prerequisite for large-scale corpus building, can be solved using a rule-based approach based on existing NLP tools.

4. Utilising crowdsourcing for difficult tasks like annotating the meaning of po-tentially idiomatic expressions can be successful, but the devil is in the de-tails: breaking down the task into smaller bits and careful instruction are cru-cial.

5. Unsupervised disambiguation approaches have intuitive appeal and show promising results, but never actuallygood results.

6. Deep learning models for the disambiguation of potentially idiomatic expres-sions show a generalisation capability other supervised models have failed to achieve.

7. If natural language had been designed by a logician, idioms would not exist. — Philip Johnson-Laird, 1993

8. Lakoff reports surprising unanimity in informants concerning the prototyp-ical mental image invoked by spilling the beans. For example, most say that the beans are uncooked and in a container about the size of the human head; that they are supposed to be in the container; that the spilling is accidental; that the beans go all over the place and are never easy to retrieve; and that the spill is messy. — Rosamund Moon, 1998

9. The use of spill the beans and kick the bucket as example idioms in scientific publications should be outlawed.

10. beating a dead horse (British, idiomatic): repeatedly rewriting and

Referenties

GERELATEERDE DOCUMENTEN

Now perform the same PSI blast search with the human lipocalin as a query but limit your search against the mammalian sequences (the databases are too large, if you use the nr

To test this assumption the mean time needed for the secretary and receptionist per patient on day 1 to 10 in the PPF scenario is tested against the mean time per patient on day 1

The Participation Agreement creates a framework contract between the Allocation Platform and the Registered Participant for the allocation of Long Term

In addition, in this document the terms used have the meaning given to them in Article 2 of the common proposal developed by all Transmission System Operators regarding

3.3.10.a Employees who can submit (a) medical certificate(s) that SU finds acceptable are entitled to a maximum of eight months’ sick leave (taken either continuously or as

The present text seems strongly to indicate the territorial restoration of the nation (cf. It will be greatly enlarged and permanently settled. However, we must

Land acquisition in order to settle the land claim depends on the availability of land on the market. South African land reform follows the market-led approach. Therefore, there

• You must not create a unit name that coincides with a prefix of existing (built-in or created) units or any keywords that could be used in calc expressions (such as plus, fil,