A dispute over a Kansas hamburger stand's mural could go all the way to the U.S. Supreme Court. The question: Is it art or advertising? The outcome could affect hundreds of municipalities.
Abstract: Cross-modal image-text retrieval enables efficient heterogeneous modality interaction via vision-language semantic alignment, advancing multimodal intelligence applications. However, ...