CoVirNet a multimodal web-based book genre classification system using a dual-color-space input CNN-ViT architecture

Book genres are not always clearly defined and classifying them based solely on visual or textual patterns can be unreliable. While recent models attempt to improve genre classification by combining both cues, key limitations remain in how input representations and model architectures are designed....

Descripció completa

Dades bibliogràfiques
Autor principal: Acompañado, Emiline Barcent Jloise S. (Autor)
Altres autors: Yusiong, John Paul T. (adviser.)
Format: Thesis
Idioma:English
Matèries: