- open source software capable of solving almost any text processing proble;
- a mature and extensive community of developers, users, educators, students and scientists;
- a defined and repeatable process for creating robust and maintainable text processing workflowsin active use for all sorts of language processing tasks and applications, including: voice of the customer; cancer research; drug research; decision support; recruitment; web mining; information extraction; semantic annotation
-the result of a €multi-million R&D programme running since 1995, funded by commercial users, the EC, BBSRC, EPSRC, AHRC, JISC, etc.
- used by corporations, SMEs, research labs and Universities worldwide
- the Eclipse of Natural Language Engineering, the Lucene of Information Extraction, the ISO 9001 of Text Mining
- a world-class team of language processing developers...
The European Parliament has agreed to cut part of its translation service, in a bid to shave its budget by some €8.6 million per year.
Lawmakers approved on Tuesday 21 November a report by Stanimir Ilchev, a Bulgarian Liberal MEP, changing the procedural rules for the recording of plenary debates.
As of the next plenary, on 10 December, the Parliament will no longer be required to translate the session into all 23 official languages of the European Union, a process which has proved costly and can take up to four months.
The EU legislative will only record proceedings in their original language, while still being required to translate them following a request by a member state.
N-grams are combinations of 1 or more words. 1: monogram, 2: bi-gram, 3: tri-gram, etc. Rarely is it more than 3, unless looking for a specific slogan or turn of phrase. Words are not taken from any part of speech class, so you’re going to get any and all strings. This is important because often times you can simply filter out all words of a particular part of speech class (nouns, verbs, adjectives, adverbs, etc) to improve your signal-to-noise ratio.
Halliday's work has also contributed many of its analytical tools for the kind of linguistic analysis carried out in Critical Discourse Analysis (CDA). CDA is probably the most comprehensive attempt to develop a theory of the ...
The Department of Linguistics and English Language seeks to appoint two Researchers to work full time on the newly established Centre for Corpus Approaches to Social Science, funded by the Economic and Social Research Council. The posts are offered for five years in the first instance.
You must have relevant research experience in Corpus Linguistics, with a focus on any of the following: corpus based discourse analysis, corpus methods, corpus tool development, statistical analysis. You will pursue research on developing existing and new approaches to the use of corpus linguistics in the social sciences under the direction of Prof. Tony McEnery and Dr. Andrew Hardie.
Alternatively, we could name the microlex after some corpus-linguistics pioneer(s), like Kucera or Francis. Then a frequency of 1 per million words would be (say) 1 kucera; 1 per 10 million words would be 1 deci-kucera; etc.
These corpora were created by Mark Davies, Professor of Linguistics at Brigham Young University. They have many different uses, including: finding out how native speakers actually speak and write; looking at language variation and change; finding the frequencyof words, phrases, and collocates; and designing authentic language teaching materials and resources.The corpora are used by more than 100,000 people each month (more than 200,000 visits), which makes them perhaps the most widely-used corpora currently available. They also serve as the basis for an increasing number of publications by researchers from throughout the world.
Learning collocations is a good idea because they can: a) give you the most natural way to say something: smoking is strictly forbidden is more natural than smoking is strongly forbidden. b) give you alternative ways of saying something, which ...
OPTIMALE (Optimising Professional Translator Training in a Multilingual Europe) is an Erasmus Academic Network involving 70 partners from 32 different European countries (including 27 within the EU).
OPTIMALE will monitor the changing nature of the translation professions in the age of the internet, social networks and increasing automation. It aims to act as a vehicle and stimulus for innovation and high quality in the training of professional translators.
Chuchotage is French for whispering, i.e., whispered interpreting. The Professionals' use of the French term is a reminder of the influence of French interpreters on the development of modern conference interpreting.
“In English, all inanimate objects are neutral. Save, of course, an avid car enthusiast who might say ‘she’s a beast’ while polishing off the bonnet of their pride and joy.
A flower is just a flower – over time society has imbued that flower with supposedly ‘feminine’ traits – fragility, beauty, and to those who know nothing of their own ecosystems (or women, for that matter), for the most part decorative. But the neutral nature of our language leaves room for a shift in this notion – just as once, blue was deemed a ‘delicate and dainty’ colour and pink a ‘more decided and stronger’ one, all the humble flower needs is some heavy marketing campaign and we’ll realise ‘she’s a man after all’…”
University signs agreements to promote French languagePeninsula On-lineHBKU's collaboration efforts through its Translation and Interpreting Institute (TII), with the Institut Français and Sorbonne Nouvelle University Paris – 3's Ecole Supérieure ...
A few weeks ago, the same teen language consultant who warned me that abbreviating words in texting (e.g. "u" = "you", "4″ = "for") is something that only old people do anymore, pointed out that my habit of ending statement-style texts with a period communicates an affect that I probably don't intend.
I was skeptical, but this morning's PhD Comics confirms the generalization (although it's about email, which obviously skews the sample to an older demographic)…