CoVirNet a multimodal web-based book genre classification system using a dual-color-space input CNN-ViT architecture
Book genres are not always clearly defined and classifying them based solely on visual or textual patterns can be unreliable. While recent models attempt to improve genre classification by combining both cues, key limitations remain in how input representations and model architectures are designed....
| Autor principal: | |
|---|---|
| Altres autors: | |
| Format: | Thesis |
| Idioma: | English |
| Matèries: |