language technology and business internationalization
6.7K views | +0 today
Follow
language technology and business internationalization
The language technology as a very helpful tool for business internationalization
Your new post is loading...
Your new post is loading...
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Tsinghua University Publishes Comprehensive Machine Translation Reading List

Tsinghua University Publishes Comprehensive Machine Translation Reading List | language technology and business internationalization | Scoop.it
Tsinghua Natural Language Processing Group (THUNLP) has published a great reading list for any budding AI researchers whose New Year's resolution is to study machine translation. The list compiles the most influential machine translation papers from the past 30 years, spotlighting the 10 most...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

A corpus-based approach to Charles Dickens’s use of direct thought presentation

A corpus-based approach to Charles Dickens’s use of direct thought presentation | language technology and business internationalization | Scoop.it
Delving into characters’ minds is not Dickens’s strong suit. On the contrary, Dickens’s figures are best known for their simplicity, being frequently characterized by a repeated use of either a striking phrase that dominates their speech.
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

The Widely-Spoken Languages We Still Can’t Translate Online

The Widely-Spoken Languages We Still Can’t Translate Online | language technology and business internationalization | Scoop.it
People who speak languages missing from Google Translate, Siri, and Wikipedia will face future crises—leaving aid organizations scrambling.
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Call for Papers: Research in Corpus Linguistics journal

Call for Papers: Research in Corpus Linguistics journal | language technology and business internationalization | Scoop.it
A call for papers has been released for a special issue on Research in Corpus Linguistics for the peer-reviewed journal of the Spanish Association for Corpus Linguistics, AELINCO. Scholars working within the areas of computational linguistics, typology, and lexicology are encouraged to send in...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Fast and scalable neural embedding models for biomedical sentence classification | BMC Bioinformatics | Full Text

Fast and scalable neural embedding models for biomedical sentence classification | BMC Bioinformatics | Full Text | language technology and business internationalization | Scoop.it
Biomedical literature is expanding rapidly, and tools that help locate information of interest are needed. To this end, a multitude of different approaches for classifying sentences in biomedical publications according to their coarse semantic and rhetoric categories (e.g., Background, Methods,...
more...
No comment yet.
Rescooped by Andoni Sagarna Izagirre from Language Tech Market News
Scoop.it!

#NLU Poised to Transform How We Work

#NLU Poised to Transform How We Work | language technology and business internationalization | Scoop.it
If language understanding can be automated in a wide range of contexts, it is likely to have a profound effect on many professional jobs. Communication using written words plays a central part in many people’s working lives. But it will become a less exclusively human task if machines learn how to extract meaning from text and churn out reports.

In some ways, this computer assistance is badly needed. Many information workers struggle to handle the steadily increasing amount of information they are expected to monitor and process.

Via LT-Innovate
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Journal Article: “Quadcopters or Linguistic Corpora: Establishing RDM Services for Small-Scale Data Producers at Big Universities”

The following article was recently published by LIBER Quarterly: The Journal of the European Association of Research Libraries. Title  Quadcopters or Linguistic Corpora: Establishing RDM Services for Small-Scale Data Producers at Big Universities Authors Viola Voß ULB Münster, Germany Göran Hamr...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Corpus-Based Teaching of German Compound Nouns and Lexical Bundles for Improving Academic Writing Skills

Corpus-Based Teaching of German Compound Nouns and Lexical Bundles for Improving Academic Writing Skills | language technology and business internationalization | Scoop.it
1. Introduction Foreign language teaching in Master’s degree engineering programs at Peter the Great St Petersburg Polytechnic University (SPbPU) reflects the main trends in language learning for p...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Can Corpus Linguistics Help Us in the EAP Classroom?

Professor Dr Anne O’Keeffe delivered her academic talk on “ Can Corpus Linguistics help us in the EAP Classroom ?” Professor Dr. Anne O’Keeffe is Director of...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Language Log » Law & Corpus Linguistics Conference

« previous post | next post » [Forwarded from James Heilpern] Call for Papers: The Fourth Annual Law & Corpus Linguistics Conference Deadline: October 10, 2018 Event Date: February 7-9, 2019 Location: Brigham Young University, Provo, UT Organization: Brigham Young University Contact: James Heilpern, heilpernj@law.byu.edu BYU Law School is pleased to announce the Fourth Annual Law & Corpus Linguistics Conference, to be held in Provo, Utah on February 7-9, 2019. The Law School seeks original proposals for papers to be presented at the conference, addressing a broad range of topics related to the emerging discipline of Law & Corpus Linguistics, including (but not limited to), applications of corpus linguistics to constitutional, statutory, contract, patent, trademark, probate, administrative, and criminal law; philosophical, normative, and pragmatic justifications for the use of corpus linguistics in the law; philosophical, normative, and pragmatic criticisms of the use of corpus linguistics in the law; best practices and ethical considerations for the use of corpus linguistics in trial and appellate advocacy; potential applications of corpus linguistics in legislative, regulatory, and contractual drafting; corpus design, especially as it relates to the building of future legal corpora; Law & Corpus Linguistics and statistics; and sociolinguistic insights drawn from corpus linguistics, especially as it applies to the relationship of racial, ethnic, or linguistic minorities to legal and government institutions. The proposal deadline is October 10, 2018. Proposals should include an abstract of no more than 750 words, an outline of the proposed paper, and complete contact information. Please send materials to James Heilpern at heilpernj@law.byu.edu.   August 18, 2018 @ 4:27 am · Filed by Mark Liberman under Announcements Permalink
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

[1810.02156] Neural Networks for Cross-lingual Negation Scope Detection

Download: PDF Other formats (license) Current browse context: cs.CL < prev | next > new | recent | 1810 Change to browse by: cs References & Citations NASA ADS Computer Science > Computation and Language Neural Networks for Cross-lingual Negation Scope Detection Federico Fancellu, Adam Lopez, Bonnie Webber (Submitted on 4 Oct 2018) Negation scope has been annotated in several English and Chinese corpora, and highly accurate models for this task in these languages have been learned from these annotations. Unfortunately, annotations are not available in other languages. Could a model that detects negation scope be applied to a language that it hasn't been trained on? We develop neural models that learn from cross-lingual word embeddings or universal dependencies in English, and test them on Chinese, showing that they work surprisingly well. We find that modelling syntax is helpful even in monolingual settings and that cross-lingual word embeddings help relatively little, and we analyse cases that are still difficult for this task. Comments: Subjects: Computation and Language (cs.CL) Cite as: arXiv:1810.02156 [cs.CL]   (or arXiv:1810.02156v1 [cs.CL] for this version) Submission history From: Federico Fancellu [view email] [v1] Thu, 4 Oct 2018 11:51:47 GMT (139kb,D) Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

 
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

[1512.04419] Sentence Entailment in Compositional Distributional Semantics

Download: PDF Other formats (license) Current browse context: cs.CL < prev | next > new | recent | 1512 Change to browse by: cs cs.AI math math.CT References & Citations NASA ADS DBLP - CS Bibliography listing | bibtex Computer Science > Computation and Language Sentence Entailment in Compositional Distributional Semantics Esma Balkir, Dimitri Kartsaklis, Mehrnoosh Sadrzadeh (Submitted on 14 Dec 2015 (v1), last revised 9 Oct 2018 (this version, v2)) Distributional semantic models provide vector representations for words by gathering co-occurrence frequencies from corpora of text. Compositional distributional models extend these from words to phrases and sentences. In categorical compositional distributional semantics, phrase and sentence representations are functions of their grammatical structure and representations of the words therein. In this setting, grammatical structures are formalised by morphisms of a compact closed category and meanings of words are formalised by objects of the same category. These can be instantiated in the form of vectors or density matrices. This paper concerns the applications of this model to phrase and sentence level entailment. We argue that entropy-based distances of vectors and density matrices provide a good candidate to measure word-level entailment, show the advantage of density matrices over vectors for word level entailments, and prove that these distances extend compositionally from words to phrases and sentences. We exemplify our theoretical constructions on real data and a toy entailment dataset and provide preliminary experimental evidence. Comments: Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Category Theory (math.CT) MSC classes: 03B65 ACM classes: I.2.7 Journal reference: Ann Math Artif Intell (2018) 82: 189. https://doi.org/10.1007/s10472-017-9570-x DOI: 10.1007/s10472-017-9570-x Cite as: arXiv:1512.04419 [cs.CL]   (or arXiv:1512.04419v2 [cs.CL] for this version) Submission history From: Mehrnoosh Sadrzadeh [view email] [v1] Mon, 14 Dec 2015 17:36:35 GMT (62kb,D) [v2] Tue, 9 Oct 2018 10:49:35 GMT (87kb,D) Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
more...
No comment yet.
Rescooped by Andoni Sagarna Izagirre from Language Tech Market News
Scoop.it!

@Arria 's #NLG Engine Powers Ernst & Young's Global Portal

@Arria 's #NLG Engine Powers Ernst & Young's Global Portal | language technology and business internationalization | Scoop.it

Arria NLG, a provider of Natural Language Generation solutions, has successfully launched of EY's Natural Language Generation portal. This solution allows EY employees to make better and faster decisions and to provide its clients with exceptional client services.


Via LT-Innovate
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

The future for Corpus Linguistics - News

The future for Corpus Linguistics - News | language technology and business internationalization | Scoop.it
International experts gather to discuss future directions in a first for Wales...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

A cognitive scientist’s perspective on taking the CorpusMOOC | ESRC Centre for Corpus Approaches to Social Science (CASS)

A cognitive scientist’s perspective on taking the CorpusMOOC | ESRC Centre for Corpus Approaches to Social Science (CASS) | language technology and business internationalization | Scoop.it
Rose Hendricks, a researcher at the Frameworks Institute in Washington D.C., shares her experience of taking the CorpusMOOC: ‘I’m a social science researcher and have been curious for a while how we can learn more about human culture and cognition by looking at large collections of language — so I jumped at the opportunity to take the Corpus Linguistics online course by Lancaster University. The course had an great mix of videos, readings, and activities, and covered topics in just the right amount of detail. There was enough information to get a good sense of how corpus linguistics methods can be used in a huge range of ways, from addressing questions in sociolinguistics to developing textbooks, dictionaries, and resources for language learners. Conversations with researchers who use corpus linguistics methods gave us an even deeper sense of the interesting and important topics that benefit from tools to extract patterns from huge amounts of text. Throughout the course, I came up with many ideas I plan to explore with the methods we learned about, especially #LancsBox, a tool that helps researchers analyze and visualize their language data. I would recommend this course to people with any level of background knowledge on the topic — there’s something for everyone.’
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Xtext - Language Engineering Made Easy! : programming

Xtext - Language Engineering Made Easy! : programming | language technology and business internationalization | Scoop.it
r/programming: Computer Programming...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Forensic linguistics courses offered by universities (2016)

Forensic linguistics courses offered by universities (2016) | language technology and business internationalization | Scoop.it
Following my recent compendium of legal translation courses (see here), and in view of the growth in the discipline of forensic linguistics, today’s post aims to give you a flavour of forensic linguistics courses available at universities worldwide.
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

[D] Comparing word similarities from different corpora : MachineLearning

r/MachineLearning:...
more...
No comment yet.
Rescooped by Andoni Sagarna Izagirre from Translation Memory
Scoop.it!

All Articles

All  Articles | language technology and business internationalization | Scoop.it
All 1750+ available MultiLingual magazine articles are listed, dating back to 1990. To read multiple full articles, you must log in.

Via Sergey Rybkin
more...
No comment yet.
Rescooped by Andoni Sagarna Izagirre from Language Tech Market News
Scoop.it!

Machines That Can Complete (#Multilingual) Sentences

In the weeks after the release of OpenAI’s system, outside researchers applied it to conversation. An independent group of researchers used OpenAI’s technology to create a system that leads a competition to build the best chatbot that was organized by several top labs, including the Facebook AI Lab. And this month, Google “open sourced” its Bert system, so others can apply it to additional tasks. Mr. Devlin and his colleagues have already trained it in 102 languages.


Via LT-Innovate
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes | BMC Bioinformatics | Full Text

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes | BMC Bioinformatics | Full Text | language technology and business internationalization | Scoop.it
Despite a wide adoption of English in science, a significant amount of biomedical data are produced in other languages, such as French. Yet a majority of natural language processing or semantic tools as well as domain terminologies or ontologies are only available in English, and cannot be readily...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Automated Phrase Mining from Massive Text Corpora

Automated Phrase Mining from Massive Text Corpora | language technology and business internationalization | Scoop.it
IEEE Computer Society Digital Library...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

NameSampo: A Linked Open Data Infrastructure and Workbench for Toponomastic Research (.pdf) –

NameSampo: A Linked Open Data Infrastructure and Workbench for Toponomastic Research (.pdf) – | language technology and business internationalization | Scoop.it
« This paper presents a series of projects where one of the main sources for toponomastic research in Finland, the corpora of 2.7 million place names in the Names Archive database of the Institute …...
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

[1810.02100] Semi-Supervised Methods for Out-of-Domain Dependency Parsing

Download: PDF Other formats (license) Current browse context: cs.CL < prev | next > new | recent | 1810 Change to browse by: cs References & Citations NASA ADS Computer Science > Computation and Language Semi-Supervised Methods for Out-of-Domain Dependency Parsing Juntao Yu (Submitted on 4 Oct 2018) Dependency parsing is one of the important natural language processing tasks that assigns syntactic trees to texts. Due to the wider availability of dependency corpora and improved parsing and machine learning techniques, parsing accuracies of supervised learning-based systems have been significantly improved. However, due to the nature of supervised learning, those parsing systems highly rely on the manually annotated training corpora. They work reasonably good on the in-domain data but the performance drops significantly when tested on out-of-domain texts. To bridge the performance gap between in-domain and out-of-domain, this thesis investigates three semi-supervised techniques for out-of-domain dependency parsing, namely co-training, self-training and dependency language models. Our approaches use easily obtainable unlabelled data to improve out-of-domain parsing accuracies without the need of expensive corpora annotation. The evaluations on several English domains and multi-lingual data show quite good improvements on parsing accuracy. Overall this work conducted a survey of semi-supervised methods for out-of-domain dependency parsing, where I extended and compared a number of important semi-supervised methods in a unified framework. The comparison between those techniques shows that self-training works equally well as co-training on out-of-domain parsing, while dependency language models can improve both in- and out-of-domain accuracies. Comments: Subjects: Computation and Language (cs.CL) Cite as: arXiv:1810.02100 [cs.CL]   (or arXiv:1810.02100v1 [cs.CL] for this version) Submission history From: Juntao Yu [view email] [v1] Thu, 4 Oct 2018 08:41:50 GMT (404kb,D) Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)
more...
No comment yet.
Scooped by Andoni Sagarna Izagirre
Scoop.it!

Revisiting the Disputed Federalist Papers: Historical Forensics with the Chaos Game Representation and AI—Wolfram Blog

Revisiting the Disputed Federalist Papers: Historical Forensics with the Chaos Game Representation and AI—Wolfram Blog | language technology and business internationalization | Scoop.it
Forthcoming scientific paper uses FCGR, machine learning and the Wolfram Language to test authorship attribution for the twelve articles claimed by both James Madison and Alexander Hamilton.
more...
No comment yet.