Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification

Research Output

When labels are organized into a meaningful taxonomy, the parent-child relationship between labels at different levels can give the classifier additional information not deducible from the data alone, especially with limited training data. As a case study, we illustrate this effect on the task of patent classification—the task of categorizing patent documents based on their technical content. Existing approaches do not take into consideration this additional information. Experiments on two patent classification datasets, WIPO-alpha and USPTO-2M, show that our regularized Gated Recurrent Unit (GRU) architecture already gives a performance improvement with a micro-averaged precision score using the top prediction of 0.5191 and 0.5740 on the two datasets, respectively. However, knowledge transfer along the label hierarchy gives further significant improvement on WIPO-alpha, raising the score to 0.5376, and a small improvement on USPTO-2M to 0.5743. Our analyses reveal that incorporating label information improves performance on classes with fewer examples and makes model robust to errors that result from predicting closely related labels.

Type:

Article
Date:

30 July 2021
Publication Status:

Published
Publisher

Elsevier BV
DOI:

10.1016/j.neucom.2021.07.057
Cross Ref:

10.1016/j.neucom.2021.07.057
ISSN:

0925-2312

http://researchrepository.napier.ac.uk/output/2811581 Aroyehun, S. T., Angel, J., Majumder, N., Gelbukh, A., & Hussain, A. (2021). Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification. Neurocomputing, 464, 421-431. https://doi.org/10.1016/j.neucom.2021.07.057

Citation

Aroyehun, S. T., Angel, J., Majumder, N., Gelbukh, A., & Hussain, A. (2021). Leveraging label hierarchy using transfer and multi-task learning: A case study on patent classification. Neurocomputing, 464, 421-431. https://doi.org/10.1016/j.neucom.2021.07.057

Authors

Prof Amir Hussain

Professor
School of Computing Engineering and the Built Environment

0131 455 2239

A.Hussain@napier.ac.uk

Keywords

Transfer learning, Multi-task learning, Patent classification, Natural language processing, Neural networks, Machine learning

Monthly Views:

Available Documents

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy
Downloadable citations
HTML BIB RTF

Type:

Date:

Publication Status:

Publisher

DOI:

Cross Ref:

ISSN:

Citation

Authors

Prof Amir Hussain

Keywords

Monthly Views:

Files currently unavailable for download , please contact repository@napier.ac.uk to request a copy

Downloadable citations