Triple Alliance Prototype Orthotist Network for Long-Tailed Multi-Label Text Classification

Lin Xiao, Pengyu Xu, Mingyang Song, Huafeng Liu, Liping Jing, Xiangliang Zhang

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Multi-label text classification (MLTC) aims to tag the most relevant labels for the given document. Compared to the standard multi-class case where each document has only one label, it is considerably more difficulty to annotate new coming documents for multi-label text classification. Furthermore, it also suffers from the challenge of highly skewed long-tailed label distribution. Due to the relative infrequency of tail labels, this leads to an imbalance that biases towards predicting more head labels. To address the challenge, we propose a Triple Alliance Prototype Orthotist Network (TAPON) to build a generic meta-mapping from few-shot prototypes to many-shot classifier parameters, which aims to promote the generalizability of tail classifiers. To be specific, TAPON is a two-stage method. At the first stage, TAPON obtains the meta-knowledge between many-shot classifier parameters and few-shot prototype of head labels. Meanwhile, the triple alliance prototype is obtained by adopting an Attentive Prototype with the aid of few-shot documents, label semantic information and label correlation. Additionally, a Prototype Orthotist module is especially designed to capture the meta-knowledge between the many-shot classifier and few-shot prototype. At the second stage of transferring, TAPON aims to transfer the generic meta-mapping from head labels to tail labels. It first uses Attentive Prototype to obtain triple alliance prototype for tail labels, and then uses the meta-knowledge obtained from the first stage to get many-shot classifiers for tail labels. By conducting extensive experiments on benchmark datasets, we show that the proposed TAPON significantly outperforms other state-of-the-art methods for long-tailed multi-label text classification.
Original languageEnglish (US)
Pages (from-to)2616-2628
Number of pages13
JournalIEEE/ACM Transactions on Audio Speech and Language Processing
Volume31
DOIs
StatePublished - Jan 1 2023
Externally publishedYes

Bibliographical note

Generated from Scopus record by KAUST IRTS on 2023-09-20

ASJC Scopus subject areas

  • Media Technology
  • Instrumentation
  • Acoustics and Ultrasonics
  • Linguistics and Language
  • Signal Processing
  • Electrical and Electronic Engineering
  • Speech and Hearing

Fingerprint

Dive into the research topics of 'Triple Alliance Prototype Orthotist Network for Long-Tailed Multi-Label Text Classification'. Together they form a unique fingerprint.

Cite this