Synthetic Multimodal-Datasets Generation
-
The MERIT Dataset: Modelling and Efficiently Rendering Interpretable Transcripts
Paper • 2409.00447 • Published • 3 -
VERSE: Visual Embedding Reduction and Space Exploration. Clustering-Guided Insights for Training Data Enhancement in Visually-Rich Document Understanding
Paper • 2601.05125 • Published