Academia.eduAcademia.edu

Natural Language Processing

84,509 papers
239,113 followers
AI Powered
Natural Language Processing (NLP) is a subfield of artificial intelligence and linguistics that focuses on the interaction between computers and human language. It involves the development of algorithms and models to enable machines to understand, interpret, generate, and respond to natural language in a way that is both meaningful and contextually relevant.
Semantic Annotation of Deverbal Nominalizations in the Spanish AnCora corpus Aina Peris CLiC-UB Gran Via,585 08007 Barcelona aina.peris@ub.edu Mariona Taulé CLiC-UB Gran Via,585 08007 Barcelona mtaule@ub.edu Horaci Rodríguez TALP -UPC... more
Resumen: En este artículo se describe un nuevo recurso: AnCora-Nom, un léxico de nominalizaciones deverbales del español. Actualmente, contiene 1.655 entradas léxicas y 3.094 sentidos, donde cada sentido tiene asociado el tipo denotativo... more
Despite the high accuracy offered by state-of-the-art deep natural-language models (e.g. LSTM, BERT), their application in reallife settings is still widely limited, as they behave like a black-box to the end-user. Hence, explainability... more
A case based reasoning approach is introduced as a learning technique in the domain of machine translation of natural language. In our approach syntactical and semantic features are part of the cases in the case-base. To implement this,... more
In the past decade, we have observed a growing interest in using technologies such as artificial intelligence (AI), machine learning, and chatbots to provide assistance to language learners, especially in second language learning. By... more
This research work sets out to explore the major distinctions between Arabic and Bangla—the languages with unidentical origins. Comparing and analyzing the various features of these two languages requires huge linguistic expertise in the... more
Proverbs are commonly metaphoric in nature and the mapping across domains is commonly established in proverbs. The abundance of proverbs in terms of metaphors makes them an extremely valuable linguistic resource since they can be utilized... more
In this paper, we combine existing NLP techniques with minimal supervision to build memory tips according to the keyword method, a well established mnemonic device for second language learning. We present what we believe to be the first... more
We present BRAINSUP, an extensible framework for the generation of creative sentences in which users are able to force several words to appear in the sentences and to control the generation process across several semantic dimensions,... more
This paper describes the system developed for SemEval 2017 task 6: #HashTagWars -Learning a Sense of Humor. Learning to recognize sense of humor is the important task for language understanding applications. Different set of features... more
Automatic short answer grading (ASAG) has become part of natural language processing problems. Modern ASAG systems start with natural language preprocessing and end with grading. Researchers started experimenting with machine learning in... more
The term Morphologically Rich Languages (MRLs) refers to languages in which significant information concerning syntactic units and relations is expressed at word-level. There is ample evidence that the application of readily available... more
Translation is the transfer of the meaning of a text from one language to another. It is a means of sharing information across languages and therefore essential for addressing information inequalities. The work of translation was... more
Dialectometry is a discipline devoted to studying the variations of a language around a geographical region. One of their goals is the creation of linguistic atlases capturing the similarities and differences of the language under study... more
This study employed a text mining method to investigate the lexical features and their dynamic changes of PhD theses across the natural sciences, social sciences and humanities. Four quantitative indices, i.e. TTR, h-point, R1 and... more
Abstract This article explores the dynamic relationship between child psychology and linguistics, focusing on how language development both reflects and influences cognitive, emotional, and social growth in children. Drawing on... more
na publicação dos anais do evento. A RESP é uma publicação científica multidisciplinar, com publicação em fluxo contínuo mensal, cujo objetivo é auxiliar na produção intelectual do conhecimento sobre a saúde populacional, sempre... more
This paper presents a strategy for syntactic analysis based on the combination of two different parsing techniques: lexical syntactic tagging and phrase structure syntactic parsing. The basic proposal is to take advantage of the good... more
One of the striking commonalities between languages is the way word frequencies are distributed. Across languages, word frequencies follow a Zipfian distribution, showing a power law relation between a word's frequency and its rank .... more
In the current context of rapid and constant evolution of global communication and specialised discourses, the need for devising methods for ensuring both high quality levels of specialised translation and successful translation training... more
Sentiment Classification is a special task of Sentiments Analysis in which a text document is assigned into some category like positive, negative, and neutral on the basis of some subjective information contained in documents. This... more
Machine Translation (MT) is a programmed conversion in which computer software is utilized to convert manuscripts from one Natural Language (like English) to a different Language (such as Hindi). To process any such conversion, through... more
Asian and low-resource language information processing refers to the field of computational linguistics that aims to develop natural language processing (NLP) technologies for languages that have fewer available language resources or are... more
Word segmentation is the first and obligatory task for every NLP. For inflectional languages like English, French, Dutch,.. their word boundaries are simply assumed to be whitespaces or punctuations. Whilst in various Asian languages,... more
For improving the interaction between students and teachers, it is fundamental for teachers to understand students' learning levels. An intelligent computer system should be able to automatically evaluate students' answers when... more
Recently, deep learning-based test case generation approaches have been proposed to automate the generation of unit test cases. In this study, we leverage Transformer-based code models to generate unit tests with the help of Domain... more
Being successful in intercultural communication relies strongly on understanding of communicative purposes of interlocutors and pragmatic meaning of their utterances. Established on numerous cross-cultural studies, politeness is a... more
O‗zbekiston Milliy universiteti Jurnalistika fakulteti Kompyuter lingvistikasi yo‗nalishi 1-kurs magistranti Annotatsiya: Ushbu maqolada parallel korpusning matn tarjimasida o‗ziga xos o‗rni hamda tillarni o‗rganish va bir-biriga... more
O‗zbekiston Milliy universiteti Jurnalistika fakulteti Kompyuter lingvistikasi yo‗nalishi 1-kurs magistranti Annotatsiya: Ushbu maqolada parallel korpusning matn tarjimasida o‗ziga xos o‗rni hamda tillarni o‗rganish va bir-biriga... more
BACKGROUND: The integration of the rapidly expanding corpus of information about the genome, transcriptome, and proteome, engendered by powerful technological advances, such as microarrays, and the availability of genomic sequence from... more
Text mining studies have started to investigae relations between positive and negative opinions and patients' physical health. Several studies linked the personal lexicon with health and the health-related behavior of the individual.... more
A unit of translation is a source text fragment of any length or nature that piques a translator's interest during translation. Alves and Vale proposed the concept of macro and micro translation units based on pauses and times identified... more
This study is the second part of a three-part analysis (if it meets the review requirements) of emotions carried out by written documents. These documents were collected from eight students who took part in a three-day stock market... more
The native language of a foreign language learner can have an effect on the errors they make because of similarities or differences between the two languages. In order to provide effective error prediction and correction for nonnative... more
It is important for language learners to determine and reflect on their writing errors in order to overcome weaknesses. Each language learner has their own unique writing error characteristics and therefore has different learning needs.... more
This paper explores the idea that intelligence, rather than humanity, serves as the primary force driving evolution, with a particular focus on Artificial General Intelligence (AGI). As AI technology progresses, we must reconsider... more
For many second language learners, learning the target language is supposed to be identical with the mastery of the grammar of that language. When we say "the mastery of the grammar", we refer to the mastery of rules which revolve around... more
For many second language learners, learning the target language is supposed to be identical with the mastery of the grammar of that language. When we say "the mastery of the grammar", we refer to the mastery of rules which revolve around... more
Purpose: Ever increasing penetration of the Internet in our lives has led to an enormous amount of multimedia content generation on the internet. Textual data contributes a major share towards data generated on the world wide web.... more
Around the world, most of the proposed techniques for the identification of sarcasm either take the utterance in isolation or these methods only perform the categorization of the textual data. Very limited work has been done on how to... more
Speech recognition system (ASR) is a technology that allows computers receive the input using the spoken words. This technology requires sample words in the pattern matching process that is stored in the database. There is no reference as... more
This paper presents a comparative analysis of two different approaches for clustering textual data from the 20 Newsgroups dataset. The first approach leverages a Large Language Model (LLM) to classify each text into predefined categories... more
Motivation: The most widely used literature search techniques, such as those offered by NCBI's PubMed system, require significant effort on the part of the searcher, and inexperienced searchers do not use these systems as effectively... more
Parallel and comparable corpora represent a crucial resource for different Natural Language Processing tasks like machine translation, lexical acquisition, and knowledge structuring but are also suitable to be consulted by humans for... more
Milica MIHALJEVIC Zavod za hrvatski jezik HR, Zagreb STRUKTURA RJECNTCKOGA CLANKA U TERMINOLOSKIM RJECNTCIMA Struktura rjecnickoga clanka u terminoloskim rjeenicima razliCita je ovisno 0 vrsti i namjeni rjeenika. Neki mali, prijevodni... more