Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
cpursley
4 months ago
|
parent
|
context
|
favorite
| on:
Qwen3-Next
How are you prepping the PDF data before shoving it into Qwen?
Alifatisk
4 months ago
|
next
[–]
I just compress the file size as low as possible without losing the quality, didn't even know there was more ways to prep it.
I do sometimes chop up the PDF into smaller pdfs with their own individual chapters
amelius
4 months ago
|
parent
|
next
[–]
On Linux you can use pdftotext also if you are only concerned with the text.
navbaker
4 months ago
|
prev
[–]
Not OP, but we use the docling library to extract text and put it in markdown before storing for use with an LLM.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: