CoVirNet a multimodal web-based book genre classification system using a dual-color-space input CNN-ViT architecture

Book genres are not always clearly defined and classifying them based solely on visual or textual patterns can be unreliable. While recent models attempt to improve genre classification by combining both cues, key limitations remain in how input representations and model architectures are designed....

詳細記述

書誌詳細
第一著者: Acompañado, Emiline Barcent Jloise S. (著者)
その他の著者: Yusiong, John Paul T. (adviser.)
フォーマット: 学位論文
言語:English
主題: