Find Jobs
Hire Freelancers

Write a script to convert formatted PDF exams to XML/HTML files.

$30-250 USD

In Progress
Posted almost 9 years ago

$30-250 USD

Paid on delivery
I have hundreds of PDF exam files which are all in the same format, I want all the exams formatted into a database, but for this I need them to be something I can parse easily like XML/HTML. The info I need from each exam is: Exam name For each question: 1. Question number (and if the exam is divided to topics, which topic it belongs to) 2. Question Text (the actual question. 3. If the question has multiple choices the text of each choice. (the question title specifies if it is a multiple choice question or not). 4. Question answer. 5. Question Answer Explanation. The hard part is that fields 2-5 might contain images in them, if there is an image, it should be extracted to a file, and referenced to from the correct place. I don't care if the script/program that you'll supply will handle one exam at a time and I'll create a script the runs it on on all the files. Attaching a sample exam file, there is an image in question #4, I'll supply later 2 more exams that will basically cover all the possible cases of how an exam should look like.
Project ID: 8182201

About the project

17 proposals
Remote project
Active 9 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
Awarded to:
User Avatar
Hi, My name is Gilad (originally from Israel, by the way), and I specialize in creating custom-made tools for PDF files. I had a look at your file and read the instructions and I believe I can develop for you either a script or a stand-alone tool (or a combination of both) that will do what you described. The most tricky part is extracting the images, as you said, but I already have a tool written in Java that can do that, I just need to find a way to attach each image to its matching question. At any rate, I'm available to start working on it right away. A little bit about me: I'm an Expert on both the Adobe and AcrobatAnswers forums and have a website dedicated to my custom-made tools for PDF files that you're welcome to check out (Google my handle-name to find it). You're also welcome to check out my work history on this site and see some of the PDF-related projects I've worked on in the past. Regards, Gilad (try67)
$350 USD in 5 days
4.9 (107 reviews)
6.6
6.6
User Avatar
Hello. I have read your project description, and I would enjoy creating the program for you. Converting from one format to another is generally pretty easy, but as you noted, these may contain images. The way I would solve this is by having a first-pass which extracts the images to a file with references, then processing the remaining text from pdf to html. Please contact me so that we may speak further, and I hope to work with you soon.
$277 USD in 3 days
4.9 (58 reviews)
6.4
6.4
17 freelancers are bidding on average $246 USD for this job
User Avatar
Hello, I'm a novice freelancer with great experience in the development, I want to make the most quickly and efficiently. Send a more detailed this job! Any question welcome! Best regards, Vasiliy
$150 USD in 5 days
4.9 (37 reviews)
6.3
6.3
User Avatar
I have experience of writing scripts using python to my clients life easier . I wrote so many scripts to extract information from different resources . I can provide you with the output of all the files according to your requirement , Discuss further details on skype. My skypeID is : noumanawais Looking forward to hear you soon . thanks.
$150 USD in 5 days
5.0 (43 reviews)
5.6
5.6
User Avatar
Hello, I have a experience with PDF processing. It's interesting task for me. All the files have same structure?
$222 USD in 10 days
5.0 (12 reviews)
5.3
5.3
User Avatar
Hi, I am confident to deliver more than your expectation, if given a chance. Kindly have a look into my profile, and if it interests you, lets discuss more of the project. I plan to do it preferably in PHP, else PYTHON. You will have to provide me at least 3 pdf's(which you already mentioned) to confirm the type of documents are similar. Thanks and Cheers.
$249 USD in 15 days
4.9 (9 reviews)
5.0
5.0
User Avatar
Hello. More 20 years programming experience. I need more details to set real time and price. Regards. ---------------------------------------------------------------------------------------------------------------------------------------------------
$250 USD in 3 days
4.3 (19 reviews)
4.9
4.9
User Avatar
Hi, I have 4 years of Python coding experience. I saw that pdfminer does a good job on extracting the text and images for your project. I can have this script ready and tested locally in 2-3 days, and for the rest of the time you can investigate and let me know if there are cases which are not handled. Thanks, Bogdan
$555 USD in 7 days
5.0 (6 reviews)
4.0
4.0
User Avatar
Bir öneri henüz sağlanmadı
$155 USD in 1 day
5.0 (5 reviews)
3.2
3.2
User Avatar
Hi, I have a lot of experience with python and I'm sure I can finish your project by Monday. We can talk more details on private!
$277 USD in 3 days
5.0 (1 review)
1.5
1.5
User Avatar
A proposal has not yet been provided
$155 USD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I have extensive experience in processing large amounts of data (databases with 100 of millions of rows) and changing it into a different format (data on a website into a searchable database, a searchable database into various CSV files, HTML to PDF, XML to PDF, etc.
$155 USD in 7 days
0.0 (0 reviews)
0.0
0.0
User Avatar
一个有效的提议尚未被提供
$155 USD in 3 days
0.0 (0 reviews)
0.0
0.0
User Avatar
A proposal has not yet been provided
$111 USD in 5 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Hello I have a good script idea to read in a folder and get all exam files in the folder. it will then create a new folder with the new xml files along with a picture folder to reference which picture came from witch exam.
$222 USD in 15 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of ISRAEL
Tel-Aviv, Israel
5.0
5
Payment method verified
Member since Jul 31, 2015

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.