CoVirNet a multimodal web-based book genre classification system using a dual-color-space input CNN-ViT architecture

Book genres are not always clearly defined and classifying them based solely on visual or textual patterns can be unreliable. While recent models attempt to improve genre classification by combining both cues, key limitations remain in how input representations and model architectures are designed....

Полное описание

Библиографические подробности
Главный автор: Acompañado, Emiline Barcent Jloise S. (Автор)
Другие авторы: Yusiong, John Paul T. (adviser.)
Формат: Диссертация
Язык:English
Предметы: