Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction

Ševerdija, Domagoj; Prusina, Tomislav; Borozan, Luka; Matijević, Domagoj

doi:10.20532/cit.2023.1005673

Journal of computing and information technology, Vol. 31 No. 4, 2023.

Izvorni znanstveni članak

https://doi.org/10.20532/cit.2023.1005673

Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction

Domagoj Ševerdija ; School of Applied Mathematics and Computer Science, University of Osijek, Croatia
Tomislav Prusina orcid.org/0009-0000-5331-4183 ; Universität Hamburg, Department of Informatics, Germany
Luka Borozan ; School of Applied Mathematics and Computer Science, University of Osijek, Croatia
Domagoj Matijević orcid.org/0000-0003-3390-9467 ; School of Applied Mathematics and Computer Science, University of Osijek, Croatia

Puni tekst: engleski pdf 692 Kb

verzije

str. 251-266

preuzimanja: 0

citiraj

APA 6th Edition

Ševerdija, D., Prusina, T., Borozan, L. i Matijević, D. (2023). Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction. Journal of computing and information technology, 31 (4), 251-266. https://doi.org/10.20532/cit.2023.1005673

MLA 8th Edition

Ševerdija, Domagoj, et al. "Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction." Journal of computing and information technology, vol. 31, br. 4, 2023, str. 251-266. https://doi.org/10.20532/cit.2023.1005673. Citirano 30.06.2024.

Chicago 17th Edition

Ševerdija, Domagoj, Tomislav Prusina, Luka Borozan i Domagoj Matijević. "Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction." Journal of computing and information technology 31, br. 4 (2023): 251-266. https://doi.org/10.20532/cit.2023.1005673

Harvard

Ševerdija, D., et al. (2023). 'Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction', Journal of computing and information technology, 31(4), str. 251-266. https://doi.org/10.20532/cit.2023.1005673

Vancouver

Ševerdija D, Prusina T, Borozan L, Matijević D. Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction. Journal of computing and information technology [Internet]. 2023 [pristupljeno 30.06.2024.];31(4):251-266. https://doi.org/10.20532/cit.2023.1005673

IEEE

D. Ševerdija, T. Prusina, L. Borozan i D. Matijević, "Efficient Sentence Representation Learning via Knowledge Distillation with Maximum Coding Rate Reduction", Journal of computing and information technology, vol.31, br. 4, str. 251-266, 2023. [Online]. https://doi.org/10.20532/cit.2023.1005673

Sažetak

Addressing the demand for effective sentence representation in natural language inference problems, this paper explores the utility of pre-trained large language models in computing such representations. Although these models generate high-dimensional sentence embeddings, a noticeable performance disparity arises when they are compared to smaller models. The hardware limitations concerning space and time necessitate the use of smaller, distilled versions of large language models. In this study, we investigate the knowledge distillation of Sentence-BERT, a sentence representation model, by introducing an additional projection layer trained on the novel Maximum Coding Rate Reduction (MCR2) objective designed for general-purpose manifold clustering. Our experiments demonstrate that the distilled language model, with reduced complexity and sentence embedding size, can achieve comparable results on semantic retrieval benchmarks, providing a promising solution for practical applications.

Ključne riječi

Sentence embeddings; knowledge distillation; Maximum Coding Rate Reduction; semantic retrieval

Hrčak ID:

317643

URI

https://hrcak.srce.hr/317643

Datum izdavanja:

28.5.2024.

Posjeta: 0 *

Prijava i registracija

Journal of computing and information technology, Vol. 31 No. 4, 2023.

Sažetak

Ključne riječi

Hrčak ID:

URI

Datum izdavanja: