LiteParse pairs fast text parsing with a two-stage agent pattern, falling back to multimodal models when tables or charts ...
Abstract: Visually-situated text parsing (VsTP) has recently seen notable advancements, driven by the growing demand for automated document understanding and the emergence of large language models ...