• Telephone:+31 20 59 89782
  • Room nr:11a-24
  • E-mail:a.gorog@vu.nl
  • Unit:faculteit der letteren (taal en communicatie)
  • Position:Researcher

Office hours: Monday to Friday from 9.00 to 17.00

Research interests

Lexicology, Terminology, Semantics, Cognitive linguistics, Corpus Linguistics and Computational Linguistics.

Research

Project DutchSemCor

Most NLP applications require large sense-tagged corpora along with lexical databases to reach satisfactory results in Word Sense Disambiguation (WSD) tasks such as machine translation, question & answering, summarization and terminology extraction. The number of English language resources have increased in the past years, the data scarceness for other languages, however, is more than obvious. The situation is similar for the Dutch language: scarceness of semantically annotated corpus material to train WSD machines. In order to overcome the data bottleneck the DutchSemCor project is aiming to deliver a one-million word Dutch corpus that is fully sense-tagged with senses and domain tags from the Cornetto lexical database. 250K examples of this corpus are being manually tagged. The remainder will be automatically tagged using three different WSD systems and will be validated by human annotators. The corpus data is based on existing corpus material collected in the projects CGN, D-CoI and SoNaR. These corpora have already been parsed and tagged in previous projects and will be extended where necessary in order to find sufficient examples for different word senses that are less frequent and do not appear in the above corpora. When writing this essay, our project is in a preliminary phase, we have currently begun with the annotation of our corpus material for Dutch nouns.

Project Dutch Terminology Service Centre

The Dutch Terminology Service Centre (Steunpunt Nederlandstalige Terminologie) was founded in 2007 by the Dutch Language Union (Nederlandse Taalunie), a Dutch-Flemish government institution. It functions as a non-commercial information center for all aspects of terminology and serves the entire Dutch-speaking community. We give advice on terminology research to anyone who is involved in terminology-related work (companies, organisations, translators, terminologists, teachers, scientists etc). Our organisation also maintains the website NedTerm (www.nedterm.org). NedTerm provides information on terminology activities and also includes a bibliography of terminological works, an overview of terminology training courses in the Netherlands and in Flanders, and information on standardisation issues. We are particularly keen on providing practical information on and links to on-line resources (terminological databases, information platforms) as well as promoting various language tools (esp. open source) which can be used for terminology work. We also organize terminology trainings and workshops for translators as well as master classes for scientists and language experts. Apart from organizing in-depth study days, we are also offering practical solutions to our targeted group. An example is the development of web courses for terminology management. The courses ad hoc terminology and systematic terminology management are both aimed at translators, translation agencies, terminologists, language professionals and master students and are available for free download on our website or can be consulted on-line.

Publications

  • Görög, A.“Description of Terms from Cognitive Psychology: Work Method” -  in: ten Hacken, Pius (ed.), Terminology, Computing and Translation, 2006, Tübingen: Narr
  • Görög, A., Vossen, P. "Computer Assisted Semantic Annotation in the DutchSemCor Project."  In Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10). 2010, Malta, Valletta

 

Links

 


Ancillary activities

ExacTerm

Amsterdam
Freelance vertaal&terminologie-diensten
01 January 2009
Last changes Ancillary activities: Amsterdam, 6 February 2012
© Copyright VU University Amsterdam

spamfuik@vu.nl