BRIDGE: Bootstrapping Text to Control Time-Series Generation via Multi-Agent Iterative Optimization and Diffusion Modelling
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
LVPruning: An Effective yet Simple Language-Guided Vision Token Pruning Approach for Multi-modal Large Language Models
Discontinuous named entities in clinical Text: A systematic literature review
Dial2MSA-Verified: A Multi-Dialect Arabic Social Media Dataset for Neural Machine Translation to Modern Standard Arabic
Learning to generate and evaluate fact-checking explanations with transformers
The BEA 2024 shared task on the multilingual lexical simplification pipeline
An extensible massively multilingual lexical simplification pipeline dataset using the MultiLS framework
Cantonmt: Cantonese to english nmt platform with fine-tuned models using synthetic back-translation data
Too risky yet not risky enough: the intersecting characteristics, vulnerabilities, harm indicators and guardianship issues associated with seriously harmed missing children
Enriching the metadata of community-generated digital content through entity linking: An evaluative comparison of state-of-the-art models
Which side are you on? A multi-task dataset for end-to-end argument summarisation and evaluation
CantonMT: Cantonese-English neural machine translation looking into evaluations
Natural language satisfiability: Exploring the problem distribution and evaluating transformer-based language models
Our Heritage, Our Stories: developing AI tools to link and support community-generated digital cultural heritage
From outputs to insights: a survey of rationalization approaches for explainable text classification
CantonMT: Cantonese to English NMT platform with fine-tuned models using real and synthetic back-translation data
IDEM: The IDioms with EMotions Dataset for Emotion Recognition
Relation extraction for constructing knowledge graphs: Enhancing the searchability of community-generated digital content (CGDC) collections
TriG-NER: Triplet-Grid Framework for Discontinuous Named Entity Recognition
Multi-Loss Fusion: Angular and Contrastive Integration for Machine-Generated Text Detection
Towards Explainable Multi-Label Text Classification: A Multi-Task Rationalisation Framework for Identifying Indicators of Forced Labour
Bulgarian Grammar Error Correction with Data Augmentation and Machine Translation Techniques
Investigating a Benchmark for Training-set free Evaluation of Linguistic Capabilities in Machine Reading Comprehension
Probing the Uniquely Identifiable Linguistic Patterns of Conversational AI Agents
Refining Predicates for Relation Extraction through Thesaurus Integration
Unsupervised literature mining approaches for extracting relationships pertaining to habitats and reproductive conditions of plant species
CantonMT: Investigating Back-Translation and Model-Switch Mechanisms for Cantonese-English Neural Machine Translation
Aspect-based Sentiment Evaluation of Chess Moves (ASSESS): an NLP-based Method for Evaluating Chess Strategies from Textbooks
Resources for Annotating Hate Speech in Social Media Platforms Used in Ethiopia: A Novel Lexicon and Labelling Scheme
AraTar: A Corpus to Support the Fine-grained Detection of Hate Speech Targets in the Arabic Language
Recognition of Biodiversity-related Named Entities by Fine-tuning General-domain BERT-based Language Models
LanViKD: Cross-Modal Language-Vision Knowledge Distillation for Egocentric Action Recognition
Refining Predicates for Relation Extraction through Thesaurus Integration-Abstract
Mmt’s submission for the wmt 2023 quality estimation shared task
Do you hear the people sing? key point analysis via iterative clustering and abstractive summarisation
Pulsar at mediqa-sum 2023: Large language models augmented by synthetic dialogue convert patient dialogues to medical records
PULSAR: Pre-training with extracted healthcare terms for summarising patients' problems and data augmentation with black-box large language models
A survey of methods for revealing and overcoming weaknesses of data-driven Natural Language Understanding
Team: PULSAR at ProbSum 2023: PULSAR: Pre-training with extracted healthcare terms for summarising patients’ problems and data augmentation with black-box large language models
UniManc at NADI 2023 shared task: A comparison of various t5-based models for translating Arabic dialectical text to Modern Standard Arabic
Timeline: Exhaustive annotation of temporal relations supporting the automatic ordering of events in news articles
Not all quantifiers are equal: Probing transformer-based language models’ understanding of generalised quantifiers
Identifying the limits of transformers when performing model-checking with natural language
Argument mining as a multi-hop generative machine reading comprehension task
Few-shot entity linking of food names
Global information-aware argument mining based on a top-down multi-turn QA model
Training models on oversampled data and a novel multi-class annotation scheme for dementia detection
Natural Language Robot Programming: NLP integrated with autonomous robotic grasping
Learning to Play Chess from Textbooks (LEAP): a Corpus for Evaluating Chess Moves based on Sentiment Analysis
Entity Coreference and Co-occurrence Aware Argument Mining from Biomedical Literature
Reviewer 2 Must Be Stopped: Transformer-Based Approaches for Predicting Paper Acceptance
A Hybrid of Rule-based and Transformer-based Approaches for Relation Extraction in Biodiversity Literature
Towards End-User Development for IoT: A Case Study on Semantic Parsing of Cooking Recipes for Programming Kitchen Devices
Second Report-Our Heritage, Our Stories: Linking and Searching Community-Generated Digital Content to Develop the People's National Collection
Analyzing Sentiments and Topics on Twitter Towards Rising Cost of Living
Are Machine Reading Comprehension Systems Robust to Context Paraphrasing?
Extracting Reproductive Condition and Habitat Information from Text Using a Transformer-based Information Extraction Pipeline
English2BSL: A Rule-Based System for Translating English into British Sign Language
Building an ensemble of transformer models for Arabic dialect classification and sentiment analysis
Towards human-centred explainability benchmarks for text classification
RaFoLa: a rationale-annotated corpus for detecting indicators of forced labour
Incorporating zoning information into argument mining from biomedical literature
Policy-focused Stance Detection in Parliamentary Debate Speeches
Food for Thought: How can we exploit contextual embeddings in the translation of idiomatic expressions?
First Report-Our Heritage, Our Stories: Linking and searching community-generated digital content to develop the people's national collection
from Social Networking Sites
Natural language processing for requirements engineering: A systematic mapping study
An investigation of academic perspectives on the ‘circular economy’using text mining and a Delphi study
Semantics altering modifications for evaluating comprehension in machine reading
Computation of semantic change in scientific concepts: Case study of “circular economy”
IoT Cooking Workflows for End-Users: A Comparison Between Behaviour Trees and the DX-MAN Model
Is the Understanding of Explicit Discourse Relations Required in Machine Reading Comprehension?
Interactive clustering of cooking recipe instructions: Towards the automatic detection of events involving kitchen devices
Automatic Detection of Deaths from Social Networking Sites
Sustainable innovation: analysing literature lineages
wapr. tugon. ph: A Secure Helpline for Detecting Psychosocial Aid from Reports of Unlawful Killings in the Philippines
Sentiment and position-taking analysis of parliamentary debates: a systematic literature review
A framework for evaluation of machine reading comprehension gold standards
ParlVote: A corpus for sentiment analysis of political debates
Beyond leaderboards: A survey of methods for revealing weaknesses in natural language inference data and models
Self-supervised learning of object slippage: An LSTM model trained on low-cost tactile sensors
Origin of the Aromatic Group of Cultivated Rice (Oryza sativa L.) Traced to the Indian Subcontinent
Whose story is it anyway? Automatic extraction of accounts from news articles
Policy preference detection in parliamentary debate motions
Studying the Evolution of the ‘Circular Economy’ Concept Using Topic Modelling
Semantic frame embeddings for detecting relations between software requirements
Towards the Automatic Analysis of the Structure of News Stories.
Using Frame Embeddings to Identify Semantically Related Software Requirements.
Semantic change in the language of UK parliamentary debates
Understanding the evolution of circular economy through language change
Topic modelling vs distant supervision: a comparative evaluation based on the classification of parliamentary enquiries
Using Prior Knowledge to Facilitate Computational Reading of Arabic Calligraphy
Literature mining on dipterocarps: towards better informed natural regeneration and reforestation in Luzon, Philippines.
Data from the paper: Policy Preference Detection in Parliamentary Debate Motions
COUNT-BASED SEMANTIC MODEL EVALUATION FOR THE EXTRACTION OF SEMANTIC RELATIONS FOR NAMED BAYS FROM A SMALL SPECIALIZED CORPUS
Analyzing sentiments expressed on Twitter by UK energy company consumers
Identification of research hypotheses and new knowledge from scientific literature
'Aye'or'No'? Speech-level Sentiment Analysis of Hansard UK Parliamentary Debate Transcripts
Annotation and detection of drug effects in text for pharmacovigilance
Identifying opinion-topics and polarity of parliamentary debate motions
A sentiment-labelled corpus of Hansard parliamentary debate speeches
Using semantic frames to identify related textual requirements: an initial validation
Towards a corpus of requirements documents enriched with semantic frame annotations
LitPathExplorer: a confidence-based visual text analytics tool for exploring literature-enriched pathway models
Conceptual information extraction for named bays from a specialized corpus
Extracting granular information on habitats and reproductive conditions of Dipterocarps through pattern-based literature analysis
A FrameNet-based approach for annotating natural language descriptions of software requirements
Extending the environment ontology with text-mined habitat mentions
Towards the automatic extraction of plant traits from textual descriptions
Extraction of terms highly associated with named rivers
Stranger Genres: Computationally Classifying Reprinted Nineteenth Century Newspaper Texts
biochem4j: Integrated and extensible biochemical knowledge through graph databases
Using uncertainty to link and rank evidence from biomedical literature for model curation
SciLite: a platform for displaying text-mined annotations as a means to link research articles with biological data
Constructing a biodiversity terminological inventory
A text mining-based framework for constructing an RDF-compliant biodiversity knowledge repository
Ecological niche modelling tool for aquatic life population distribution using maximum entropy model
Argo as a platform for integrating distinct biodiversity analytics tools into workflows for building graph databases
Developing a knowledge base on the habitats and reproductive conditions of Dipterocarps through information extraction
Modelling the Coverage of Dipterocarp Trees in Central Visayas, Philippines
5Ws: What Went Wrong With Word embeddings
CEUR WORKSHOP PROCEEDINGS
Clustering Cancer Drugs According to their Mechanisms of Action.
Text mining the history of medicine
Overview of the interactive task in BioCreative V
BioCreative V BioC track overview: collaborative biocurator assistant task for BioGRID
Argo: enabling the development of bespoke workflows and services for disease annotation
Learning to recognise named entities in tweets by exploiting weakly labelled data
Crowdsourcing-based annotation of emotions in Filipino and English tweets
Construction of a Biodiversity Knowledge Repository using a Text Mining-based Framework.
A text mining framework for accelerating the semantic curation of literature
Proceedings of the Fifth Workshop on Building and Evaluating Resources for Biomedical Text Mining (BioTxtM2016)
Enhancing semantic search through the automatic construction of a Biodiversity Terminological Inventory
Understanding mass flowering of dipterocarps through semantic occurrence information extraction
Real use cases for Semantic Information from the Mining Biodiversity project
Text Mining Workflows for Indexing Archives with Automatically Extracted Semantic Metadata
Facilitating and promoting web annotation with Argo
The CHEMDNER corpus of chemicals and drugs and its annotation principles
Using text mining techniques to extract phenotypic information from the PhenoCHF corpus
Optimising chemical named entity recognition with pre-processing analytics, knowledge-rich features and heuristics
Supporting the annotation of chronic obstructive pulmonary disease (COPD) phenotypes with text mining workflows
Augmenting the Medical Subject Headings vocabulary with semantically rich variants to improve disease mention normalisation
Semi-automatic curation of chronic obstructive pulmonary disease phenotypes using Argo.
Mining the biomedical literature
Adapting ChER for the recognition of chemical mentions in patents
Development of bespoke machine learning and biocuration workflows in a BioC-supporting text mining workbench
Bridging text-mined biomolecular events to pathway models using approximate subgraph matching techniques
Unlocking knowledge in biodiversity legacy literature through automatic semantic metadata extraction.
Text-mining-assisted biocuration workflows in Argo
BioC interoperability track overview
Processing biological literature with customizable Web services supporting interoperable formats
Interoperability and Customisation of Annotation Schemata in Argo.
A strategy for annotating clinical records with phenotypic information relating to the chronic obstructive pulmonary disease
Enriching the legacy literature with OCR corrections and text-mined semantic metadata.
Information extraction from pharmaceutical literature
Facilitating the analysis of discourse phenomena in an interoperable NLP platform
Chemistry-specific features and heuristics for developing a CRF-based chemical named entity recogniser
Extending an interoperable platform to facilitate the creation of multilingual and multimodal NLP applications
Customisable curation workflows in argo”
Towards a better understanding of discourse: integrating multiple discourse annotation perspectives using UIMA
NaCTeM’s BioC modules and resources for BioCreative IV
NaCTeM CTD Web Services
Supporting Discourse Phenomena in an Interoperable NLP Framework
Analysing entity type variation across biomedical subdomains
What's in a Name? Entity Type Variation across Two Biomedical Subdomains
Proceedings of BioNLP 2011 Workshop
Detecting experimental techniques and selecting relevant documents for protein-protein interactions from biomedical literature
Building a coreference-annotated corpus from the domain of biochemistry
Adapting the cluster ranking supervised model to resolve coreferences in the drug literature
Discovering Potential Drugs by Extracting Biological Activities of Natural Products
Nactem systems for biocreative iii ppi tasks
Open Science for Foundation Models
VeLAR: Vision-oriEnted Language-Attentive token Reduction for multimodal large language models
as a means to link research articles with biological data
2021 IEEE 15th International Conference on Semantic Computing (ICSC)| 978-1-7281-8899-7/21/$31.00© 2021 IEEE| DOI: 10.1109/ICSC50631. 2021.00082
Conzelmann, Miro 412
Semi-automatic extraction of processes affecting beaches from a specialized corpus
Third Workshop on Building and Evaluating Resources for Biomedical Text Mining Workshop Programme