Search Results

Found 1 repositories(showing 1)

Cosmos-x-DocScope

PRITHIVSAKTHIUR

❤️40

Understand physical common sense and generate appropriate embodied decisions. optimized for document-level optical character recognition, long-context vision-language understanding. build with hand-curated dataset for text-to-image models, providing significantly more detailed descriptions or captions of given images.

Apache-2.0

Python

Updated 10 months ago

huggingface-transformersocrqwen2-5-vl+3

All 1 repositories loaded

GitHub Explorer

Search Results

Cosmos-x-DocScope

Cosmos-x-DocScope