• No results found

Separate training for conditional random fields using co-occurrence rate factorization

N/A
N/A
Protected

Academic year: 2021

Share "Separate training for conditional random fields using co-occurrence rate factorization"

Copied!
1
0
0

Bezig met laden.... (Bekijk nu de volledige tekst)

Hele tekst

(1)

Separate Training for Conditional Random Fields Using

Co-occurrence Rate Factorization

Zhemin Zhu ✎

CTIT Database Group, University of Twente, Enschede, The Netherlands

Email: z.zhu@utwente.nl

Djoerd Hiemstra

CTIT Database Group, University of Twente, Enschede, The Netherlands

Email: d.hiemstra@utwente.nl

Peter Apers

CTIT Database Group, University of Twente, Enschede, The Netherlands

Email: p.m.g.apers@utwente.nl

Andreas Wombacher

CTIT Database Group, University of Twente, Enschede, The Netherlands

Email: a.wombacher@utwente.nl

Conditional Random Fields (CRFs) are undirected graphical models which are well suited to many natural language processing (NLP) tasks, such part-of-speech (POS) tagging and named entity recognition (NER). The standard training method of CRFs can be very slow for large-scale applications. As an alternative to the standard training method, piecewise training divides the full graph into pieces, trains them independently, and combines the learned weights at test time. But piecewise training does not scale well in the variable cardinality. In this paper we present separate training for undirected models based on the novel Co-occurrence Rate factorization (CR-F). Separate training is a local training method without global propagation. In contrast to directed markov models such as MEMMs, separate training is unaff ected by the label bias problem even it is a local normalized method. We do experiments on two NLP tasks, i.e., POS tagging and NER. Results show that separate training (i) is unaffected by the label bias problem; (ii) reduces the training time from weeks to seconds; and (iii) obtains competitive results to the standard and piecewise training on linear-chain CRFs. Separate training is a promising technique for scaling undirected models for natural language processing tasks. More details can be found here (http://eprints.eemcs.utwente.nl/22600/).

Referenties

GERELATEERDE DOCUMENTEN

2 shows a simplified mapping of data providers and users with a sample of organisations and actors present in the climate services value chain, helping illustrate the fluidity of

Met deze theorie is het niet alleen mogelijk om de structuur van de diverse relaties tussen mensen en technologieën te analyseren, maar ook om te onderzoeken welke impact

4) Is het verenigbaar met het recht van de Unie, in het bijzonder met de door dit recht vereiste afweging tussen de grondrechten van partijen, een internetprovider te gelasten

In 1999 het Swede die eerste land ter wêreld geword met meer vroue in die kabinet (11:9) as mans en teen 2009 word Monaco die laaste land ter wêreld wat ’n vroulike minister tot

In this investigation, the effect of a pre-center drill hole and tool material comprising HSS-Mo, HSS-Co, and HSS-Ti-coated tools on the generated cutting force, surface roughness,

Being a father going through a divorce; seeking to make sense of myself and my place in the world in relation to my daughter, my family and my community has made the

A few large particles starting near the bottom of the breaking wave pass through the ‘tail’, where they travel in a region of many small particles with a very small vertical

This suggests that in addition to the role played by substitutional Li as a FM defect, both substitutional and interstitial defects may be supporting ferromagnetism