Browse our Products
<dependency>
<groupId>com.aspose</groupId>
<artifactId>aspose-pdf</artifactId>
<version>24.10</version>
</dependency>
compile(group: 'com.aspose', name: 'aspose-pdf', version: '24.10')
<dependency org="com.aspose" name="aspose-pdf" rev="24.10">
<artifact name="aspose-pdf" ext="jar"/>
</dependency>
libraryDependencies += "com.aspose" % "aspose-pdf" % "24.10"
Java API to Process & Manipulate PDF Files
Product Page | Docs | Demos | API Reference | Examples | Blog | Search | Free Support | Temporary License
Aspose.PDF for Java is a PDF document creation component that enables your Java applications to read, write and manipulate PDF documents without using Adobe Acrobat.
aspose.pdf
Package Features
PDF Document Featres
- Set basic information (e.g. author, creator) of the PDF document.
- Configure PDF Page properties (e.g. width, height, cropbox, bleedbox etc.).
- Set page numbering, bookmark level, page sizes etc.
- Apply document open action, open mode as well as appearance.
- Document can have different page transition effects such as dissolve or box.
- Create PDF documents via
XML
,API
orXML
andAPI
combined. - Ability to work with text, paragraphs, headings, hyperlinks, graphs, attachments etc.
Security Features
- PDF documents can be encrypted up to 128 bits.
- Master and user passwords can be set for PDF encryption.
- Apply rescriptions on content modification, copying, printing and other operations.
Conversion Features
- Convert an existing XML file (
.XML
) orXmlDocument
to a new PDF document or a PDF file stream. - Convert conventional Image formats into PDF file.
- Convert
PCL
files into PDF file.
For a more comprehensive list of features, please visit Features of aspose.pdf
Package.
com.aspose.pdf
Package Features
- Supports 14 core fonts.
- Support for
Type 1
,TrueType
,Type 3
,CJK
fonts. Unicode
support is available.- Add, search, extract and replace text in PDF files.
- Add/delete, extract and replace images.
- Insert, delete, split PDF pages.
- Support for Linearization (optimization for the web).
- Set and get XMP metadata.
- Validate (
PDF/A-1a
,PDF/A-1b
). - Work with bookmarks, annotations, PDF forms, stamps, watermarks and more.
For a more comprehensive list of features, please visit Features of com.aspose.pdf
Package.
com.aspose.pdf.facades
Package Features
- Supports 14 core fonts.
- Support for
Type 1
,TrueType
,Type 3
,CJK
fonts. Unicode
support is available.- Add, replace and extract text & images (from the entire PDF, a particular page, or a range of pages).
- Work with bookmarks, annotations, PDF forms, links, actions, signature and more.
- Print PDF to default, specified, physical, or virtual printer.
- Print PDF to
XPS
file or XPS printer.
For a more comprehensive list of features, please visit Features of com.aspose.pdf.facades
Package.
Read & Write PDF & Other Formats
Fixed Layout: PDF, XPS
Books: EPUB
Web: HTML
Other: TEX, XML, SVG
Save PDF Documents As
Microsoft Office: DOC, DOCX, XLS, XLSX, PPTX
Images: JPEG, PNG, BMP, TIFF, EMF
Other: MobiXML, XML, TEXT
Read Formats
CGM, MHT, PCL, PS, XSLFO, MD
Supported Environments
- Microsoft Windows: Windows Desktop & Server (x86, x64)
- macOS: Mac OS X
- Linux: Ubuntu, OpenSUSE, CentOS, and others
- Java Versions:
J2SE 8.0 (1.8)
or above
Get Started
Aspose.PDF Java APIs are hosted at the Aspose Repository. You can easily use Aspose.PDF for Java API directly in your Maven projects with simple configurations. For the detailed instructions please visit Installing Aspose.PDF for Java from Aspose Repository documentation page.
Extract text from a PDF file using Java
// For complete examples and data files, please go to https://github.com/aspose-pdf/Aspose.Pdf-for-Java
// Open document
Document pdfDocument = new Document("input.pdf");
// Create TextAbsorber object to extract text
TextAbsorber textAbsorber = new TextAbsorber();
// Accept the absorber for all the pages
pdfDocument.getPages().accept(textAbsorber);
// Get the extracted text
String extractedText = textAbsorber.getText();
// Create a writer and open the file
java.io.FileWriter writer = new java.io.FileWriter(new java.io.File("Extracted_text.txt"));
writer.write(extractedText);
// Write a line of text to the file tw.WriteLine(extractedText);
// Close the stream
writer.close();
Product Page | Docs | Demos | API Reference | Examples | Blog | Search | Free Support | Temporary License
Version | Release Date |
---|---|
24.10 | October 30, 2024 |
24.9 | September 27, 2024 |
24.8 | August 29, 2024 |
24.7 | July 31, 2024 |
24.6 | June 28, 2024 |
24.5 | May 30, 2024 |
24.4 | April 29, 2024 |
24.3 | March 29, 2024 |
24.2 | February 28, 2024 |
24.1 | January 31, 2024 |
23.12 | December 28, 2023 |
23.11 | November 29, 2023 |
23.10 | October 31, 2023 |
23.9 | September 28, 2023 |
23.8 | August 29, 2023 |
23.7 | July 27, 2023 |
23.6 | June 29, 2023 |
23.5 | May 30, 2023 |
23.4 | April 28, 2023 |
23.3 | March 24, 2023 |
23.2 | February 28, 2023 |
23.1 | January 26, 2023 |
22.12 | December 23, 2022 |
22.11 | November 30, 2022 |
22.10.1 | November 17, 2022 |
22.10 | October 31, 2022 |
22.9 | September 23, 2022 |
22.8 | August 24, 2022 |
22.7.1 | August 5, 2022 |
22.7 | August 5, 2022 |
22.6 | June 30, 2022 |
22.5 | June 1, 2022 |
22.4 | April 28, 2022 |
22.3 | March 25, 2022 |
22.2 | February 22, 2022 |
22.1 | January 21, 2022 |
21.12 | December 20, 2021 |
21.11 | November 22, 2021 |
21.10 | October 20, 2021 |
21.9 | September 20, 2021 |
21.8 | August 20, 2021 |
21.7 | July 8, 2021 |
21.6 | June 17, 2021 |
21.5 | May 20, 2021 |
21.4 | April 20, 2021 |
21.3 | March 19, 2021 |
21.2.1 | March 10, 2021 |
21.2 | February 20, 2021 |
21.1 | January 22, 2021 |
20.12 | December 18, 2020 |
20.11.1 | December 1, 2020 |
20.11 | November 18, 2020 |
20.10 | October 20, 2020 |
20.9 | September 17, 2020 |
20.8.1 | September 14, 2020 |
20.8 | August 19, 2020 |
20.7 | July 20, 2020 |
20.6 | June 19, 2020 |
20.5.1 | May 28, 2020 |
20.5 | May 19, 2020 |
20.4 | April 17, 2020 |
20.3 | March 19, 2020 |
20.2 | February 20, 2020 |
20.1 | January 22, 2020 |
19.12 | December 19, 2019 |
19.11 | November 29, 2019 |
19.10 | October 31, 2019 |
19.9 | September 30, 2019 |
19.8 | August 29, 2019 |
19.7 | July 31, 2019 |
19.6 | June 26, 2019 |
19.5 | May 31, 2019 |
19.4 | April 30, 2019 |
19.3 | March 29, 2019 |
19.2 | February 28, 2019 |
19.1 | January 29, 2019 |
18.12 | December 30, 2018 |
18.11 | November 29, 2018 |
18.10.2 | November 12, 2018 |
18.10 | October 30, 2018 |
18.9.1 | October 4, 2018 |
18.9 | September 28, 2018 |
18.8 | August 29, 2018 |
18.7 | July 31, 2018 |
18.6 | June 29, 2018 |
18.5 | May 31, 2018 |
18.4 | April 25, 2018 |
18.3 | March 30, 2018 |
18.2 | February 28, 2018 |
18.1 | February 6, 2018 |
17.12 | December 31, 2017 |
17.11 | November 30, 2017 |
17.10 | November 1, 2017 |
17.9 | October 2, 2017 |
17.8 | August 31, 2017 |
17.7 | July 20, 2017 |
17.6 | June 29, 2017 |
17.5 | June 7, 2017 |
17.4 | April 28, 2017 |
17.3.0 | April 6, 2017 |
17.2.0 | February 23, 2017 |
17.1.0 | January 26, 2017 |
16.12.0 | December 27, 2016 |
16.11.0 | November 29, 2016 |
16.10.0 | October 26, 2016 |
12.0.0 | October 10, 2016 |
11.9.0 | September 20, 2016 |
11.8.0 | August 11, 2016 |
11.7.0 | July 12, 2016 |
11.6.0 | June 10, 2016 |
11.5.0 | May 10, 2016 |
11.4.0 | April 8, 2016 |
11.3.0 | March 7, 2016 |
11.2.0 | February 8, 2016 |
11.1.0 | December 30, 2015 |
Aspose Total Aspose PDF Aspose Java API pdf java library pdf java class Maven PDF CGM EPUB HTML TeX MHT PCL PS Postscript SVG XML XPS XSLFO MD Markdown XLS XLSX PPTX DOC DOCX MobiXML JPEG EMF PNG BMP GIF TIFF Text document compression table creation graph font bookmark data source attachment annotation export print font image form stamp watermark link security signature TrueType CJK Unicode CMYK PDFA XML FDF XFDF stamp watermark Windows Linux Mac J2SE extract text text extraction PDF to DOC PDF to DOCX PDF to XLS PDF to XLSX PDF to PPTX HTML to PDF MHTML to PDF PDF to HTML BMP to PDF CGM to PDF DICOM to PDF EMF to PDF JPG to PDF PNG to PDF SVG to PDF TIFF to PDF PDF to TIFF PDF to SVG EPUB to PDF Markdown to PDF MD to PDF PCL to PDF TXT to PDF XPS to PDF PS to PDF Postscript to PDF XML to PDF XSLFO to PDF LaTeX to PDF TeX to PDF PDF to EPUB PDF to LaTeX PDF to TeX PDF to TXT PDF to XPS PDF to PDF A_1b PDF to PDF A_3b PDF to PDF A_3a PDF to PDF A_2a PDF to PDF A_3U PDFA to PDF