Abstract: Recent research in remote sensing object detection (RSOD) has significantly advanced the development of vision foundation models. However, deploying these models on resource-constrained edge ...
Abstract: The diversity of VQA questions bring new challenge for VQA model to predict the answer. Existing models focus on the construction of new attention mechanisms and object recognition, but ...