Distilled and fused style embedding for authorship attribution with weighted multi-kernel ensemble model
2024, vol.16 , no.4, pp. 25-36
Article [2024-04-03]
In this age of Artificial Intelligence, the dispute over ownership of the web contents are increasing. In forensic studies and plagiarism cases, determining the author of a document undoubtedly plays a crucial role. The task of Authorship Attribution (AA) involves learning the style of the writers based on available documents to predict the ownership of unknown documents. Hence, capturing the style of writing is primary besides challenging. In this work, key features are systematically extracted, multiple deep-learning models are trained on various aspects of the text, and two novelties: Distilled and Fused Style Embedding (DFSE) and a multi-kernel ensemble model with kernels weighted based on Z-scores are proposed. The performance of the proposed model is superior to the baseline model and many contemporary models for in-genre and multi-topic datasets.
authorship classification, natural language processing, style embedding, neural network, ensemble model
https://doi.org/10.59035/XDXI5594
B. Lavanya, R. Sowmiya. Distilled and fused style embedding for authorship attribution with weighted multi-kernel ensemble model . International Journal on Information Technologies and Security, vol.16 , no.4, 2024, pp. 25-36. https://doi.org/10.59035/XDXI5594