Reducto has trained multiple models on the various visual cues that matter in documents - from spaces between paragraphs that denote a change in topic to tabs in lists that show nested hierarchies.
We just finished building what we believe is state of the art for spreadsheet parsing. Our goal across all of this is to be the layer that connects human data with LLMs.
Collection
[
|
...
]