Abstract: Remote sensing image retrieval with text feedback (RSIR-TF) presents a challenging multimodal retrieval task that leverages a reference image, modification text, and scene graph to retrieve ...
OpenOCR is an open-source toolkit developed by the OCR team from FVL Lab, Fudan University, under the guidance of Prof. Yu-Gang Jiang and Prof. Zhineng Chen. It focuses on 「General-OCR」 tasks, ...