Filter

My recent searches
Filter by:
Budget
to
to
to
Type
Skills
Languages
    Job State
    119 pdfbox jobs found, pricing in GBP

    I am looking for a skilled mobile app developer who can create a native app for both Android and iOS platforms. The project requires the following: Features: - Data base: PostgreSQL or MySQL or MongoDB - Make PDF from data base: Apache PDFBox or iText - Email service: SendGrid or Mailgun - STORAGE of PDF: Apple (iOS): Secure Enclave API and Google (Android): Keystore System - PAYMENT : stripe - ESIGNATURE : service's API to initiate sending a one-time password (OTP) via SMS. DocuSign or Adobe Sign or SignNow Design : Design already done on Figma English version : (Copy)?type=design&node-id=0%3A1&mode=design&t=fWuFnb5PnhqXNVYT-1 French version :

    £1185 (Avg Bid)
    £1185 Avg Bid
    85 bids

    ...in Java Swing with Apache PDFbox. The project is already underway and requires the integration of additional functionalities: 1. Crop Tool with Zoom functionality: Implement a tool to crop a small portion of PDF which represents a location on a larger map. The crop will act as a canvas (so the image must be a high-quality zoomed-in Image of the selected area), where objects will be added to this portion of the PDF. 2. Select Tool: The cropped location will contain abstract objects which represent buildings. The select tool will enable any existing objects to be select and to be moved to a new location on the displayed area. 3. Measure Distances Tool: Create a tool for measuring distances between 2 points on the PDF. Skills Required: - Experience with Apache PDFBox is ...

    £6 / hr (Avg Bid)
    £6 / hr Avg Bid
    17 bids

    ...dimensions. 2. Select Tool: Develop a feature allowing users to select specific areas of a PDF document. 3. Measure Distances Tool: Create a tool for measuring distances between elements within the PDF. We utilize Apache PDFBox for PDF processing; therefore, familiarity with this library is preferred. Experience in developing similar functionalities will be highly advantageous. Successful implementation of these features may lead to further opportunities for collaboration on this project. Skills Required: - Proficiency in Java development - Experience with Apache PDFBox or similar PDF manipulation libraries - Strong understanding of PDF file structure and manipulation techniques Interested freelancers are invited to submit their proposals, highlighting relevant experie...

    £3 / hr (Avg Bid)
    £3 / hr Avg Bid
    14 bids

    I'...specific, the task is to move the last page of the PDF to the front. This should be achieved seamlessly without disrupting the quality or the content of the PDF. Ideal Skills: - Extensive experience in programming, particularly dealing with PDF manipulation. - Proficient in a language that can perform such a task, for instance Python with relevant libraries (PyPDF2, pdfrw) or Java (with iText, PDFBox). - Strong problem-solving skills to handle any unexpected issues. Required Deliverable: - A working script that takes a PDF as input and outputs the PDF with the last page at the front. Project Deadline: - The script is expected to be fully operational and delivered within a month from the project commencement date. Looking forward to your proposals including methodology...

    £100 (Avg Bid)
    £100 Avg Bid
    32 bids

    Please employ the pdfbox 3.0.1 library and Java language (Java 8, JDK 17) to substitute text within a PDF file that already contains text (refer to the attached files). I am in need of the Java source code for a program capable of accomplishing this task, as per the specifications detailed in the attachment files. The initial PDF file is denoted as "Payslip_template.pdf." After executing the program with this template file, the resulting document should be named "," wherein the text replacements have been applied. I kindly request the completion of this task at your earliest convenience. Additionally, please note that the file "" is intended for conversion to the "" file. Moreover, I specifically require a console-based Java project

    £33 (Avg Bid)
    £33 Avg Bid
    12 bids

    ...editing PDF files using various tools and software, including but not limited to desktop automation wrappers around well-known PDF editors such as PDF Expert and Adobe Acrobat. Candidates are not limited to Python-based solutions such as Python with PyPDF2 or PyMuPDF, and can use any method or programming language they are proficient with (including Adobe Acrobat Pro with JavaScript, Java with Apache PDFBox, JavaScript with HummusJS, and Desktop Automation Tools like AutoHotkey or AutoIt), as long as it allows for precise manipulation and replacement of text and formatting in PDF documents. The tasks for this project include: Text Replacement in Sample PDFs: Edit two sample PDFs (, ) by replacing specific text strings. The edited PDFs must maintain the original layout,

    £488 (Avg Bid)
    £488 Avg Bid
    52 bids

    ...editing PDF files using various tools and software, including but not limited to desktop automation wrappers around well-known PDF editors such as PDF Expert and Adobe Acrobat. Candidates are not limited to Python-based solutions such as Python with PyPDF2 or PyMuPDF, and can use any method or programming language they are proficient with (including Adobe Acrobat Pro with JavaScript, Java with Apache PDFBox, JavaScript with or HummusJS, and Desktop Automation Tools like AutoHotkey or AutoIt), as long as it allows for precise manipulation and replacement of text and formatting in PDF documents. The tasks for this project include: Text Replacement in Sample PDFs: Edit two sample PDFs (, ) by replacing specific text strings. The edited PDFs must maintain the original

    £37 (Avg Bid)
    £37 Avg Bid
    11 bids

    I am looking for a freelancer who can assist me with PDF validation in my project. The main issue I am encountering is PDF validation inconsistency. Programming Language: Java Ideal Skills and Experience: - Strong knowledge and experience in Java programming language - Expertise in PDF manipulation using PDFBox library - Familiarity with PDF data extraction and validation techniques - Experience in working with test scripts and identifying and resolving issues related to PDF validation - Excellent problem-solving skills and ability to understand complex issues related to PDF validation Project Details: - The project involves validating PDFs from different user interfaces with PDF data. - I have covered 13 test cases in 3 test scripts, but there are variations in 7-8 PDF details ...

    £17 / hr (Avg Bid)
    £17 / hr Avg Bid
    4 bids

    Senior Developer highly skilled in the following technologies: Html/CSS, ReactJS, Java, JasperReports, iText/PDFBox, Spring (MVC, Data, Security...), Thymeleaf, JSF, MySQL/PostgreSQL

    £913 (Avg Bid)
    £913 Avg Bid
    103 bids

    I need a class jar that extracts the text for text fields and comboboxes from a one page interactive PDF. Can use PDFBox or any other free download JAVA PDF API (SpirePDF, Qoppa JavaPDF, and so on) . The extracted fields need to be posted to a derby db for later manipulation (Derby db File will be included). The PDF is an "exam" where the user selects the answer from the combo box pulldown selections. I will provide current base program jars (using Netbeans 17 and Zulu OpenJDK 17) and an empty class file with the needed variables and imports. Must keep the programming simple with sufficent comments explaning the main methods used.

    £130 (Avg Bid)
    £130 Avg Bid
    17 bids

    ...objective of this project is to read PDF files from a specified location, extract data row and column wise, and store the data in a SQL Server table row and column wise. The data can then be accessed using a WebAPI. Technologies: C# (itext7,itextsharp,,pdfsharp,pdfpig, Tesseract, pdfsharp,,ocropus) SQL Server WebAPI Python (Tabula-py, Camelot, pdfplumber,) Java (Apache pdfbox) Requirements: Visual Studio 2019 or higher SQL Server Management NuGet package Select file from particular location them perform below steps:- 1. First check password in pdf 2. Remove it 3. Then check image or not 4. If image use ocr 5. If not then extract data row and column wise 6. Push in sql table as per the pdf format Extra functions required 7. Edit pdf function

    £4 / hr (Avg Bid)
    £4 / hr Avg Bid
    16 bids

    ...objective of this project is to read PDF files from a specified location, extract data row and column wise, and store the data in a SQL Server table row and column wise. The data can then be accessed using a WebAPI. Technologies: C# (itext7,itextsharp,,pdfsharp,pdfpig, Tesseract, pdfsharp,,ocropus) SQL Server WebAPI Python (Tabula-py, Camelot, pdfplumber,) Java (Apache pdfbox) Requirements: Visual Studio 2019 or higher SQL Server Management Studio iTextSharp NuGet package NuGet package Step 1: Create a new Visual Studio solution Open Visual Studio and create a new solution Select "ASP.NET Web Application" and choose "Web API" Choose a name and location for your project and click "Create" Step 2: Install iTextSharp and NuGet packages

    £17 / hr (Avg Bid)
    £17 / hr Avg Bid
    20 bids

    Hi I have a little problem on my own project i want to build a application in java to scan a pdf form i dont need the whole page for sure i want specific data like name , phone , Id , their dates etc.. ,, i already build the main structure of the code my problem is the part that i want to scan just the data i want so i tried to used stripTextByArea and StrpTextByRegion but i dont think I properly used these classes so If some one can guide me to the right direction

    £5 / hr (Avg Bid)
    £5 / hr Avg Bid
    3 bids

    I am looking for a programmer to develop a PDF editor for a Notary Public website. I need someone familiar with a PDF tool like Apache PDFBOX or something like it. The program will be integrated into a notary platform after it is proven to work stand-alone. (I’m not tied to PDFBOX. If you are familiar with something else, let me know. I am not a programmer, so I am to being educated.) Most of the features I need should already be in the PDF developer toolbox. These are the features that I require: Editing feature will allow for applying a signature, adding a notary stamp, adding text and checkmarks. If possible, I would like a “white-out” feature so I can overwrite areas of a document. The software will be used in a WEBEX or ZOOM meeting s...

    £967 (Avg Bid)
    £967 Avg Bid
    17 bids

    Using Apache PDFBox version 2.0.21 or newer, lines and text can be retrieved rather easily. There are numerous simple examples available on the web. If we extend the PDFGraphicsStreamEngine, or PDFStreamEngine, the actions are in fact only strokePath() and fillPath(int windingRule) -methods. We need to collect paths into a list of, let us call it PathContainer, and draw them later into a canvas (simply an extended JComponent). Our example of fillPath is as follows in the attached file. etc. Now, the path containers have all necessary information about lines, line width, color, tiling pattern, shading, and whatever it contains. And later we paint it (in JComponent) for (int i = 0; i < (); i++) { Object o = (i);

    £465 (Avg Bid)
    £465 Avg Bid
    22 bids

    ...js or Angular.js With Vue: With Angular: With React: Typescript: Below there is a tutorial on how to create a ocr microservice with Tesseract, PDFBox and Docker Better solutions are welcome! Attention: This is a project that will require future changes and updates, it is not a one-time-job, but it is an investment in a product that will be resold to many (hopefully) customers and which will therefore require (paid) intervention by of a developer for the initial configuration. Who

    £1877 (Avg Bid)
    £1877 Avg Bid
    30 bids

    Create an API endpoint so that it will generate a PDF in spring boot using pdfbox dependency groupId></groupId> <artifactId>pdfbox</artifactId> attaching the PDF formats

    £116 (Avg Bid)
    £116 Avg Bid
    5 bids

    ...js or Angular.js With Vue: With Angular: With React: Typescript: Below there is a tutorial on how to create a ocr microservice with Tesseract, PDFBox and Docker Better solutions are welcome! Attention: This is a project that will require future changes and updates, it is not a one-time-job, but it is an investment in a product that will be resold to many (hopefully) customers and which will therefore require (paid) intervention by of a developer for the initial configuration. Who

    £445 (Avg Bid)
    £445 Avg Bid
    22 bids

    We are after a Java developer who can assist us in extracting information from PDF CAD Files including lines/shapes, points and text. We would be interested in developers who have had experience using any PDF libraries (PDFBox, ODA, Aspose, iText etc) to extract information from PDF files. We require the PDF to be parsed into their layers & coordinates so we can conduct analysis on the information in the PDF file. In your application please list your PDF experience and any sample projects you have worked on that meet the above skills set or you won't be considered further.

    £18 / hr (Avg Bid)
    £18 / hr Avg Bid
    15 bids

    Using Apache PDFBox version 2.0.21 or newer, lines and text can be retrieved rather easily. There are numerous simple examples available on the web. If we extend the PDFGraphicsStreamEngine, or PDFStreamEngine, the actions are in fact only strokePath() and fillPath(int windingRule) -methods. We need to collect paths into a list of, let us call it PathContainer, and draw them later into a canvas (simply an extended JComponent). Our example of fillPath is as follows in the attached file. etc. Now, the path containers have all necessary information about lines, line width, color, tiling pattern, shading, and whatever it contains. And later we paint it (in JComponent) for (int i = 0; i < (); i++) { Object o = (i);

    £128 (Avg Bid)
    £128 Avg Bid
    7 bids

    ...orientation. Develop from Scratch or Modify my existing Android Printing App. I currently have an Android Print service app. (Android studio /kotlin) based on Android Opensource androidcupsprint. Most importantly, I need it to work and will test it by printing on a mobile receipt thermal printer. Required Platform: Android studio I currently have an Android Printing app based on open source PDFBOX and androidcupsprint. The issue is, when I print on thermal or small paper, it always prints preset length. It ignores the orientation and Paper length selected in the Preview. Only thing it obeys in the preview is the number of copies to print. I need it modified such that the print would follow the paper orientation selected and also paper page size/page length selected during ...

    £846 (Avg Bid)
    £846 Avg Bid
    6 bids

    Project Requirements: Develop an api to create a table in pdf using pdfBox This api should work for tables, nested tables, table border/layout and styling, fonts,cell, cell styling, image, imagecell, text, text styling, paragraphs. Note: should not use any 3rd party APIs (vandeseer/easytable) and/or iText to achieve above. technical configurations: Eclipse Version: Oxygen.3a Release (4.7.3a) Spring framework: Version 3.2.18, spring boot. Java Version : Java SE 1.8 Maven Version: 3.5.4

    £14 (Avg Bid)
    £14 Avg Bid
    6 bids

    Project Requirements: Develop an api to create a table in pdf using pdfBox This api should work for tables, nested tables, table border/layout and styling, fonts,cell, cell styling, image, imagecell, text, text styling, paragraphs. Note: should not use any 3rd party API's and iText to achieve above. technical configurations: Eclipse Version: Oxygen.3a Release (4.7.3a) Spring framework: Version 3.2.18, spring boot. Java Version : Java SE 1.8 Maven Version: 3.5.4

    £20 (Avg Bid)
    £20 Avg Bid
    12 bids

    1) Create a pdf parser which we will read data from the pdf and store in the model attached. 2) The pdf contains question text (optional images, equations, formulas), choices, correct answer, complexity etc. Sample PDF has been attached. 3) The question may contain superscript and subscript elements. These should be preserved while parsing. 4) New question m...the machine and its path should be embedded in the question text. Some possible examples: - Q1 What is the name of the below painting? <img src='path_to_image'> Hint:painted by Da Vinci - Ques2 (a+b)<sup>2</sup>=? In example 1, path_to_image is the path on machine where image is stored from the pdf. In example 2, sup means superscript to infer the formula (a+b) whole squared. Preferably using PDFBox...

    £77 (Avg Bid)
    £77 Avg Bid
    14 bids

    1) Create a pdf parser which we will read data from the pdf and store in the model attached. 2) The pdf contains question text (optional images, equations, formulas), choices, correct answer, complexity etc. Sample PDF has been attached. 3) The question may contain superscript and subscript elements. These should be pr...the machine and its path should be embedded in the question text. Some possible examples: - Q1 What is the name of the below painting? <img src='path_to_image'> Hint:painted by Da Vinci - Ques2 (a+b)<sup>2</sup>=? In example 1, path_to_image is the path on machine where image is stored from the pdf. In example 2, sup means superscript to infer the formula (a+b) whole squared. Preferably using PDFBox or any other open source library an...

    £18 (Avg Bid)
    £18 Avg Bid
    6 bids

    Looking for Java Developer experienced in Apache PDFBox. I am planning to develope pdf document annotation project. Urgent. So I can't give you much time.

    £128 (Avg Bid)
    £128 Avg Bid
    10 bids

    Custom development to enable annotation on PDF . The Annotations includes: 1) Text Annotation (Font Size, Background Color, Text Color, Strikeout, Pencil etc) 2) Shape Annotation (Rectangular, Circle, custom freestyle, etc) 3) Comments and links Technologies: The annotations can be implemented using the following Technologies. Method 1: Using the Apache PdfBox and iTEXT, and to be accessible GWT on Java Platforms. And the Client side implementation of event in Javascript. Where the annotations are marked in SQL database Method 2: Using the and creating the javascript annotations using Javascript libraries. And expected to be compiled as a WebJar, and the whole the API expected to be accessible from Vaadin Component.

    £166 (Avg Bid)
    £166 Avg Bid
    8 bids

    In this simple project, I need one major function and the other one for AD SSO in jsp. 1. AD SSO : in web...this simple project, I need one major function and the other one for AD SSO in jsp. 1. AD SSO : in web page there are two method to login, A. id/pw, B. without ID/PW, just get AD authentication id and by pass the next web page. In here, I want to know how to setup tomcat/jsp project to do.. and step by step setup guide... 2. PDF conversion using open source library(MUST MIT/Apache) like Apache PDFBox the purpose is that there is a input PDF file, and then apply some option like color to gray-scale, portrait to landscape, split pages from n to n+n, duplex, copies Also PDF version must be changed to 1.7 even lower or higher version. Please see the detail information in the...

    £143 (Avg Bid)
    £143 Avg Bid
    5 bids

    We have existing java J2SE code written few years ago with older version of the iText Library for converting csv / database to pdf. Because the new version of iText is no longer free so we like you to add the choice of using the PDFBox / FOP or any other library that you recommend which is free for commercial purposes. This code was written few years ago, so it needs to be updated to the current jdk. Here is a little discussion: At the time of bidding provide your experience with java and related technologies. Upon selection, I will provide access to SVN. Please see the attached: PcdConverter-Source-Code Req: Good notes and documentation - ReadMe. With your

    £80 - £398
    Featured Sealed
    £80 - £398
    17 bids

    I need to test a few different Java pdf libraries (pdfbox, jpdfoptimizer, pdf-tools). I need to flatten the pdf (remove layers, form fields, annotations) essentially do what printing to microsoft pdf printer would do. Need help immediately. Developer needs to be able to quickly load these libraries and send sent test pdf's over to me after optimizing using the api.

    £135 (Avg Bid)
    £135 Avg Bid
    5 bids

    A standalone java application possibly using AWS lambda, the code in java should be as follows: The following file types should be converted to pdf doc type first and then a thumbnail IMAGE should be generated . Resolution Requirements for Thumbnail Image PDF - 600*800 with 300 dpi(2 pages...doc type first and then a thumbnail IMAGE should be generated . Resolution Requirements for Thumbnail Image PDF - 600*800 with 300 dpi(2 pages) JPEG - 200*200(1 page) Eml - 600*800 with 300 dpi(2 pages) Crtext - 600*800 with 300 dpi(2 pages) Doc, Docx - 600*800 with 300 dpi(2 pages) PNG - 200*200(1 page) All conversion of any file type to pdf and generating thumbnail should be done with Apache pdfbox. Appropriate test classes for conversion of file formats and generation of thumbnails should ...

    £22 (Avg Bid)
    Urgent
    £22 Avg Bid
    2 bids

    Hello! We are trying to implement a PAdES pdf with Java. We are using PDFbox, DSS tools. We have the signature and OCSP response at hand and are looking for someone who can guide us trough the process step by step.

    £32 / hr (Avg Bid)
    £32 / hr Avg Bid
    12 bids

    -PDFBox java library -PDFBox Arabic encoding -PDFBox template form reading and filling fields with Arabic letters correctly -Testing will be by retrieving data from database and fill it in the PDF template and create a new pdf file

    £20 (Avg Bid)
    £20 Avg Bid
    6 bids

    I am a Java programmer. However, I don't have time to figure out small things like setting up new libraries to be used with my Netbeans environment. I am in urgent need to setup PDFBOX as a library on my Netbeans (MAC OS). Need someone to hand-hold me to set this up on my machine. I will might allow you to takeover my MAC using tool like TeamViewer.

    £22 (Avg Bid)
    £22 Avg Bid
    7 bids

    ...their desired file type (PDF,IMAGE) To simplify the logic I would recommend the process as follows: 1. The file list is always first converted to PDF and merged into 1 PDF document. 2. If the user selects an export type of image then use Java to convert the fully merged PDF document to either IMAGE. PDFBox libary will merge multiple PDF's into 1 PDF. PDFBox libary will convert PDF's to an Image file as well as Images to a PDF. Code has already been made to convert DOCX to PDF. By using PDFBox you can see that once all files have been converted into one PDF, that the PDF can then be further manipulated to the desired file format. If you believe there is a better way to complete this objective I am happy to hear your suggestion. The source code will b...

    £73 (Avg Bid)
    £73 Avg Bid
    6 bids

    Javascript Developer Role- SF, CA The role: (JavaScript, AngularJS, , , and/or Apache PDFBox™ Java) * BS/MS in Computer Science or Computer Engineering * Minimum of 3+ years of full time development experience with JavaScript and recently 2 years+ AngularJS, , , and/or Apache PDFBox™ Java  * Minimum of 3+ years of experience in core Java development building enterprise applications or software products * Experience as a core member of a software development team with significant programming experience. *We are specifically after freelancers from the US. Our Company We are a boutique enterprise software products firm with a growing client base in the Utilities sector. Presently, we seek an Angular.JS/Fabric.JS software engineer for a long term

    £42 / hr (Avg Bid)
    £42 / hr Avg Bid
    24 bids

    ...freely available library OpenCV I'm picturing the algorithm being something like the pseudocode below: import org.apache.pdfbox.*; import org.opencv.core.*; /** * @param pdfFilename path to a PDF file that contains scanned page images. * These page images should be easily retrievable with a call to Apache PDFbox * * @param jsonFileName path to a JSON file containing a list of *known* objects * on the page -- it's a simple array of {page, x, y, width, height} objects. These * objects should be filtered out from the object detection results **/ void process(String pdfFilename, String jsonFilename) { pdf = openAndRead(pdfFileName); json - openAndRead(jsonFileName); for each page

    £231 (Avg Bid)
    £231 Avg Bid
    5 bids

    I require a Java command-line program that automatically extracts information (content/sentences) between bookmarks in a PDF. The program should use either Apache PDFBox or Apache Tikka. The program should do the following: a) Java -jar extractContent PDFName Bookmark Extract the content between bookmarks (i.e. print to screen). In the command line a Bookmark name will be provided (i.e. Background) and the program should extract the text between that Bookmark and the next Bookmark. Note: Bookmarks may have several levels (So you need to extract the data between Bookmarks on the same level). b) Java -jar extractContent PDFName Bookmark Keyword If a keyword is provided, then the program should extract the paragraph (located in that Bookmark section) that contains that k...

    £77 (Avg Bid)
    £77 Avg Bid
    6 bids

    I require a Java command-line program that replaces the characters under rectangles in a PDF with the character “x” (smaller letter). The program should use either Apache PDFBox and/or Apache Tikka. I have created a test data set of 10 PDFs. The test dataset can be found at: example: java -jar ./ Deliverables include the following: 1. Source code with documentation 2. Jar file

    £31 (Avg Bid)
    £31 Avg Bid
    8 bids

    I require a Java command-line program that automatically extracts either a sentence or a paragra...automatically extracts either a sentence or a paragraph from the PDF based upon a keyword. The program should use either Apache PDFBox or Apache Tikka. I have created a test data set of 10 PDFs. The test dataset can be found at: The program should read in the PDF and then output () to the screen the relevant highlighted text example: Java -jar extractText PDFname keyword Sentence Java -jar extractText PDFname keyword Paragraph Sample Keywords: limitations, ethics, regression Deliverables include the following: 1. Source code with documentation 2. Jar file See:

    £18 (Avg Bid)
    £18 Avg Bid
    12 bids

    I require a Java command-line program that replaces the characters under rectangles in a PDF with the character “x” (smaller letter). The program should use either Apache PDFBox and/or Apache Tikka and be a modification to an existing java program that I already have (PDFObfuscation) that creates these rectangles in the PDF. Source code will be provided to the successful bidder. I have created a test data set of 10 PDFs. The test dataset can be found at: example: java -jar ./ 2 50 gray 95 Deliverables include the following: 1. Source code with documentation 2. Jar file

    £17 (Avg Bid)
    £17 Avg Bid
    3 bids

    ...QRcode Create a simple A4 page as PNG in 300dpi and share this file also on your first delivery. Looks pretty easy? Yes it is, we want to see you basic openCV skills before we assign you the rest of the task. Mandatory libs - 's javacv-platform lib - 's openpdf lib (if needed) - 's pdfbox lib (if needed) Restricted libs - any Spring lib Payments Our payment plan are fixed to the delivery of this milestones: #1 (40% payment) - provide us full working implementation as source (see also later: in our git repository, no zip file or binary file deliveries are accepted!) #2 (60% payment) - after our finished tests (can take up to 14 workdays) #3 (20% bonus) - if you deliver

    £72 (Avg Bid)
    £72 Avg Bid
    3 bids

    I require a Java command-line program that automatically extracts specific highlighted text from PDFs. The program should use either Apache PDFBox or Apache Tikka. I have created a test data set of 10 PDFs. I have highlighted sections of PDFs using a different highlight color (and name). These different highlighted sections are named Objective, MethodStats and Limitations. The test dataset can be found at: The program should read in the PDF and then output () to the screen the relevant highlighted text (based upon which Section is asked for in the command line) example: Java -jar extractHighlight PDFname Objective Java -jar extractHighlight PDFname Limitations Deliverables include the following: a) Source code with documentation b)

    £20 (Avg Bid)
    £20 Avg Bid
    8 bids

    I require a Java command-line program that automatically extracts information (content/sentences) between bookmarks in a PDF. The program should use either Apache PDFBox or Apache Tikka. The program should do the following: a) Java -jar extractContent PDFName Bookmark Extract the content between bookmarks (i.e. print to screen). In the command line a Bookmark name will be provided (i.e. Background) and the program should extract the text between that Bookmark and the next Bookmark. Note: Bookmarks may have several levels (So you need to extract the data between Bookmarks on the same level). b) Java -jar extractContent PDFName Bookmark Keyword If a keyword is provided, then the program should extract the paragraph (located in that Bookmark section) that contains that k...

    £38 (Avg Bid)
    £38 Avg Bid
    5 bids

    ...java-code. For this task we are fine to see your work is doing the job on already manually exported PNG images. (please share them here on first delivery!) Looks pretty easy? Yes it is, we want to see you basic openCV skills before we assign you the rest of the task. Mandatory libs - 's javacv-platform lib - 's openpdf lib (if needed) - 's pdfbox lib (if needed) Restricted libs - any Spring lib Payments Our payment plan are fixed to the delivery of this milestones: #1 (40% payment) - provide us full working implementation as source (see also later: in our git repository, no zip file or binary file deliveries are accepted!) #2 (60% payment) - after our finished tests (can take up to 14 workdays) #3 (20% bonus) - if you deliver

    £117 (Avg Bid)
    £117 Avg Bid
    2 bids

    Are you a master of openCV and you can easily fix issues with wrongly scanned documents. The documents are either slightly rotated (about +/- 0-30°) or totaly wrong oriented (about 180°) Mandatory libs - 's javacv-platform lib - 's openpdf lib - 's pdfbox lib Restricted libs - any Spring lib Payments Our payment plan are fixed to the delivery of this milestones: #1 (20% payment) - provide us full working implementation as source (see also later: in our git repository, no zip file or binary file deliveries are accepted!) - project gets build with maven - implementation contains rotation angle detection in degree (orientation is not mandatory in this step) implement a method: double getRotatedAngle()

    £137 (Avg Bid)
    £137 Avg Bid
    4 bids

    ...understanding the PDFBox library. Please apply only if you already worked with PDFBox or iText or other PDF software. What we need: Utility/jar/class we can call from our java WebApp which is running on Linux server (this may affect non-java solutions) under Tomcat with Java 8. Problem: we need to extract text from searchable PDF (not scanned) and preserve text positions - so ideally lib should return words/tokens with x/y start/end positions as well as start/end coordinates of vertical and horizontal line separators. We need to get only the text a user can see; or if we get full text, we need a clear understanding what part of text is visible to the end-user and what part of text is not-visible. Attached is an example of a pdf file that has hidden text. We tried...

    £411 (Avg Bid)
    £411 Avg Bid
    6 bids

    Hopefully this will be quick and easy project. I just need some Java code that I can pass a folder path (e.g., String folder = "C:SomePDFs"), and have the program iterate through each PDF file and extract the full text into a string, so I can do some other stuff with it inside the loop. (If it matters, I use Eclipse as my IDE.) I looked into Adobe's pdfbox tool, but I'm not a Java guy and couldn't figure it out in time. You can use pdfbox or any PDF parser you like. The only other requirements are that it has to be able to deal with non-English characters (e.g. Japanese, Chinese, Arabic, etc.), and if you use some kind of 3rd party library, I will also need you to tell me what libraries to include and where to get them. If you have questions, ...

    £26 (Avg Bid)
    £26 Avg Bid
    10 bids

    Only developers who have prior experience in PDF box should apply.

    £45 (Avg Bid)
    £45 Avg Bid
    4 bids

    Using PDFBox () to convert a json file, utilising an available 3rd party SDK extract data and create the new PDF document. The PDF will be a stylised Invoice design and having multiple pages. The design will be based on multiple examples that will be supplied. Code must include a PDFService class that sets the style and json object and a method to return the PDF binary.

    £73 (Avg Bid)
    £73 Avg Bid
    22 bids