Show/Hide Menu
Hide/Show Apps
Logout
Türkçe
Türkçe
Search
Search
Login
Login
OpenMETU
OpenMETU
About
About
Open Science Policy
Open Science Policy
Open Access Guideline
Open Access Guideline
Postgraduate Thesis Guideline
Postgraduate Thesis Guideline
Communities & Collections
Communities & Collections
Help
Help
Frequently Asked Questions
Frequently Asked Questions
Guides
Guides
Thesis submission
Thesis submission
MS without thesis term project submission
MS without thesis term project submission
Publication submission with DOI
Publication submission with DOI
Publication submission
Publication submission
Supporting Information
Supporting Information
General Information
General Information
Copyright, Embargo and License
Copyright, Embargo and License
Contact us
Contact us
TinyRS-R1: Compact Vision Language Model for Remote Sensing
Date
2025-01-01
Author
Köksal, Aybora
Alatan, Abdullah Aydın
Metadata
Show full item record
This work is licensed under a
Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License
.
Item Usage Stats
29
views
0
downloads
Cite This
Remote sensing applications often rely on edge hardware that cannot host the models in the 7B parametric vision language of today. This paper presents TinyRS, the first 2B-parameter VLM optimized for remote sensing, and TinyRS-R1, its reasoning-augmented variant. Based on Qwen2-VL-2B, TinyRS is trained via a four-stage pipeline: pre-training on million-scale satellite images, instruction tuning, fine-tuning with Chain-of-Thought (CoT) annotations from a new reasoning dataset, and GRPO-based alignment. TinyRS-R1 matches or surpasses recent 7B remote sensing models in classification, VQA, grounding, and open-ended QA–while using one third of the memory and latency. CoT reasoning improves grounding and scene understanding, while TinyRS excels at concise, low-latency VQA. TinyRS-R1 is the first domain-specialized small VLM with GRPO-aligned CoT reasoning for general-purpose remote sensing. The code, models, and caption datasets will be released.
Subject Keywords
aerial image analysis
,
chain-of-thought reasoning
,
domain adaptation
,
group relative policy optimization
,
remote sensing
,
Vision language models
URI
https://hdl.handle.net/11511/117262
Journal
IEEE Geoscience and Remote Sensing Letters
DOI
https://doi.org/10.1109/lgrs.2025.3623244
Collections
Department of Electrical and Electronics Engineering, Article
Citation Formats
IEEE
ACM
APA
CHICAGO
MLA
BibTeX
A. Köksal and A. A. Alatan, “TinyRS-R1: Compact Vision Language Model for Remote Sensing,”
IEEE Geoscience and Remote Sensing Letters
, pp. 0–0, 2025, Accessed: 00, 2025. [Online]. Available: https://hdl.handle.net/11511/117262.