August 27, 2024

A new and simple approach extending data expansion-based implicit discourse relation classification

by Frontiers Journals

Discourse relation classification is a fundamental task for discourse analysis, which is essential for understanding the structure and connection of texts. Implicit discourse relation classification aims to determine the relationship between adjacent sentences and is the most challenging in discourse relation classification because it lacks explicit discourse connectives as linguistic cues and sufficient annotated training data.

A promising way is to expand the training data for implicit discourse relations based on easy-to-collect explicit discourse relations. However, the expanded data often involves noise in both the argument pair selection and discourse relation sense assignment process, leading to limited improvements.

A team led by Wei Song propose a novel method for explicit data expansion to address the above challenges. They published their research on 15 August 2024 in Frontiers of Computer Science.

In order to obtain suitable argument pairs, the argument pair type classification (APTC) task is proposed. APTC is a classifier that can distinguish between explicit and implicit argument pairs, and select only those explicit argument pairs that are similar to natural implicit argument pairs for data expansion, so the method can filter out noisy and unsuitable argument pairs for data expansion.

To annotate the senses of expanded argument pairs, a simple label-smoothing strategy is proposed. Instead of assigning a single dominant sense to a discourse connective, a smoothed sense is derived based on the distribution of each sense for that discourse connective.

This can reduce the impact of noisy sense labels that may not match the actual relation between the arguments.

Despite its simplicity, the evaluation results on PDTB 2.0 and PDTB 3.0 demonstrate the effectiveness of the proposed method. It can consistently lead to improvements compared with previous data expansion methods and obtain competitive performance to the state-of-the-art models across datasets and on both the top-level class and the second-level type senses.

The discriminative explicit argument pair selection and the label-smoothing strategy complement and depend on each other to achieve the best performance. The results and analysis confirm that the proposed method extends the data expansion-based implicit discourse relation classification.

More information: Wei Song et al, Discriminative explicit instance selection for implicit discourse relation classification, Frontiers of Computer Science (2023). DOI: 10.1007/s11704-023-3058-2

Provided by Frontiers Journals

Citation: A new and simple approach extending data expansion-based implicit discourse relation classification (2024, August 27) retrieved 31 August 2024 from https://techxplore.com/news/2024-08-simple-approach-expansion-based-implicit.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Adaptive-k: A simple and effective method for robust training in label noisy datasets

3 shares

Feedback to editors

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

6 hours ago

Pilot plant demonstrates iron-based hydrogen storage feasibility

8 hours ago

Exploring the fundamental reasoning abilities of LLMs

11 hours ago

Research team proposes solution to AI's continual learning problem

18 hours ago

Virtual and augmented reality can temporarily change the way people perceive distances, finds study

Aug 30, 2024

Researchers develop ultra-high efficiency perovskite LEDs by strengthening lattice

Aug 30, 2024

Transparency is often lacking in datasets used to train large language models, study finds

Aug 30, 2024

Morphing facial technology sheds light on the boundaries of self-recognition

Aug 30, 2024

Silicon chip propels 6G communications forward

Aug 29, 2024

Scalable graphene technology could significantly enhance battery safety and performance

Aug 29, 2024

Load comments (0)

A new and simple approach extending data expansion-based implicit discourse relation classification

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

Pilot plant demonstrates iron-based hydrogen storage feasibility

Exploring the fundamental reasoning abilities of LLMs

Research team proposes solution to AI's continual learning problem

Virtual and augmented reality can temporarily change the way people perceive distances, finds study

Researchers develop ultra-high efficiency perovskite LEDs by strengthening lattice

Transparency is often lacking in datasets used to train large language models, study finds

Morphing facial technology sheds light on the boundaries of self-recognition

Silicon chip propels 6G communications forward

Scalable graphene technology could significantly enhance battery safety and performance

Adaptive-k: A simple and effective method for robust training in label noisy datasets

Attribute augmentation-based label integration for crowdsourcing

Health care professionals exhibit gender bias

Improving AI large language models helps them better align with human brain activity

Active label distribution learning via kernel maximum mean discrepancy

Exploring the interactions between explicit and implicit motor learning components

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

Exploring the fundamental reasoning abilities of LLMs

Research team proposes solution to AI's continual learning problem

Virtual and augmented reality can temporarily change the way people perceive distances, finds study

Morphing facial technology sheds light on the boundaries of self-recognition

Transparency is often lacking in datasets used to train large language models, study finds

Phys.org

Medical Xpress

Science X

A new and simple approach extending data expansion-based implicit discourse relation classification

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

Pilot plant demonstrates iron-based hydrogen storage feasibility

Exploring the fundamental reasoning abilities of LLMs

Research team proposes solution to AI's continual learning problem

Virtual and augmented reality can temporarily change the way people perceive distances, finds study

Researchers develop ultra-high efficiency perovskite LEDs by strengthening lattice

Transparency is often lacking in datasets used to train large language models, study finds

Morphing facial technology sheds light on the boundaries of self-recognition

Silicon chip propels 6G communications forward

Scalable graphene technology could significantly enhance battery safety and performance

Related Stories

Adaptive-k: A simple and effective method for robust training in label noisy datasets

Attribute augmentation-based label integration for crowdsourcing

Health care professionals exhibit gender bias

Improving AI large language models helps them better align with human brain activity

Active label distribution learning via kernel maximum mean discrepancy

Exploring the interactions between explicit and implicit motor learning components

Recommended for you

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

Exploring the fundamental reasoning abilities of LLMs

Research team proposes solution to AI's continual learning problem

Virtual and augmented reality can temporarily change the way people perceive distances, finds study

Morphing facial technology sheds light on the boundaries of self-recognition

Transparency is often lacking in datasets used to train large language models, study finds

Your Privacy