PS to Text Parser

Closed Posted Mar 25, 2015 Paid on delivery
Closed Paid on delivery

Hi There!

I am regularly running into the following issue. I need to extract text from a PDF (to convert to epub). In certain PDF’s you have hidden / unwanted spaces: the word looks fine in the pdf but if you extract it you see a space between letters. Something similar is happening with capitals. In the PDF you see a capital; in the extracted text you don’t get a capital.

Somebody informed me that a way around this would be to go from pdf to ps and then extract the text from the ps.

So I guess I am in need of a parser that can extract text (ascii / utf8) from a ps-file. If this works we would like to install it on a server and be able to do batch conversions.

Thanks,

Bas

Linear Programming PDF

Project ID: #7369852

About the project

5 proposals Remote project Active May 1, 2015

5 freelancers are bidding on average €326 for this job

SoftwareEng8876

Hello sir, I am expert freelancer for PDF work. You can check feedback on my profile. Would you like start chat with me? I will show you many of my previous work in PDF. Hopefully you will satisfy my previous work. More

€263 EUR in 4 days
(17 Reviews)
4.2
adrif73

Hello! I would like to offer you my services for your project. I have worked on similar projects, as you can see in my profile page, and I am an expert in ebooks. Kind regards!

€250 EUR in 2 days
(8 Reviews)
2.3
vvadimov

A proposal has not yet been provided

€277 EUR in 12 days
(1 Review)
1.8
eamora2014

Hello, I can try to extract the text from your PDF with a group of software applications that ensure high quality and accuracy. Send me a short sample file, I'll extract the text, so you can see if my work fits y More

€300 EUR in 3 days
(0 Reviews)
0.0
discintegrator

A proposal has not yet been provided

€250 EUR in 10 days
(0 Reviews)
0.0
rootica

Propunerea nu a fost încă furnizată

€555 EUR in 3 days
(0 Reviews)
0.0