Performance Evaluation of Caps-Net Based Multitask Learning Architecture for Text Classification
PDF

Keywords

Text Classification
Caps-Net Architecture
Multi-Task Learning
CNN
Task Clustering

How to Cite

Jacob, I. Jeena. 2020. “Performance Evaluation of Caps-Net Based Multitask Learning Architecture for Text Classification”. Journal of Artificial Intelligence and Capsule Networks 2 (1): 1-10. https://doi.org/10.36548/jaicn.2020.1.001.

Abstract

The classification of the text involving the process of identification and categorization of text is a tedious and a challenging task too. The Capsules Network (Caps-Net) which is a unique architecture with the capability to confiscate the basic attributes comprising the insights of the particular field that could help in bridging the knowledge gap existing between the source and the destination tasks and capability learn more robust representation than the CNN-Convolutional neural networks in the image classification domain is utilized in the paper to classify the text. As the multi-task learning capability enables to part insights between the tasks that are related and enhances data used in training indirectly, the Caps-Net based multi task learning frame work is proposed in the paper. The proposed architecture including the Caps-Net effectively classifies the text and minimizes the interference experienced among the multiple tasks in the multi-task learning. The architecture put forward is evaluated using various text classification dataset ensuring the efficacy of the proffered frame work

PDF

References

Joachims, Thorsten. "Text categorization with support vector machines: Learning with many relevant features." In European conference on machine learning, pp. 137-142. Springer, Berlin, Heidelberg, 1998.

Blei, David M., Andrew Y. Ng, and Michael I. Jordan. "Latent dirichlet allocation." Journal of machine Learning research 3, no. Jan (2003): 993-1022.

Collobert, Ronan, and Jason Weston. "A unified architecture for natural language processing: Deep neural networks with multitask learning." In Proceedings of the 25th international conference on Machine learning, pp. 160-167. 2008.

Liu, Pengfei, Xipeng Qiu, and Xuanjing Huang. "Recurrent neural network for text classification with multi-task learning." arXiv preprint arXiv:1605.05101 (2016).

Zhang, Honglun, Liqiang Xiao, Yongkun Wang, and Yaohui Jin. "A generalized recurrent neural architecture for text classification with multi-task learning." arXiv preprint arXiv:1707.02892 (2017).

Hinton, Geoffrey E., Alex Krizhevsky, and Sida D. Wang. "Transforming auto-encoders." In International conference on artificial neural networks, pp. 44-51. Springer, Berlin, Heidelberg, 2011.

Vijayakumar, T. "comparative study of capsule neural network in various applications." Journal of Artificial Intelligence 1, no. 01 (2019): 19-27.

Bashar, Abul. "survey on evolving deep learning neural network architectures." Journal of Artificial Intelligence 1, no. 02 (2019): 73-82.

Manoharan, Samuel. "a smart image processing algorithm for text recognition, information extraction and vocalization for the visually challenged." Journal of Innovative Image Processing (JIIP) 1, no. 01 (2019): 31-38.

Mousa, Amr, and Björn Schuller. "Contextual bidirectional long short-term memory recurrent neural network language models: A generative approach to sentiment analysis." In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 1, Long Papers, pp. 1023-1032. 2017.

Hinton, Geoffrey E., Sara Sabour, and Nicholas Frosst. "Matrix capsules with EM routing." (2018).

Fang, Yuchun, Zhengyan Ma, Zhaoxiang Zhang, Xu-Yao Zhang, and Xiang Bai. "Dynamic Multi-Task Learning with Convolutional Neural Network." In IJCAI, pp. 1668-1674. 2017.

Sabour, Sara, Nicholas Frosst, and Geoffrey E. Hinton. "Dynamic routing between capsules." In Advances in neural information processing systems, pp. 3856-3866. 2017.

Chung, Junyoung, Caglar Gulcehre, KyungHyun Cho, and Yoshua Bengio. "Empirical evaluation of gated recurrent neural networks on sequence modeling." arXiv preprint arXiv:1412.3555 (2014).

Jacob, I. Jeena. "Capsule network based biometric recognition system." Journal of Artificial Intelligence 1, no. 02 (2019): 83-94.