CoVirNet a multimodal web-based book genre classification system using a dual-color-space input CNN-ViT architecture

Book genres are not always clearly defined and classifying them based solely on visual or textual patterns can be unreliable. While recent models attempt to improve genre classification by combining both cues, key limitations remain in how input representations and model architectures are designed....

Descrizione completa

Dettagli Bibliografici
Autore principale: Acompañado, Emiline Barcent Jloise S. (Autore)
Altri autori: Yusiong, John Paul T. (adviser.)
Natura: Tesi
Lingua:English
Soggetti: