Meta AI’s OMNIVORE: A Modality-Agnostic Single Vision Model With Cross-Modal Generalization
A Meta AI research team presents OMNIVORE, a single vision model for various visual modalities that can perform cross-modal generalization and achieves performance at par or better than traditional modality-specific models of the same size.