New GemBox.Document 2.5 Release with PDF read support

We have just released a new setup, help, and examples for GemBox.Document 2.5 with support for reading PDF files.

GemBox.Document currently supports reading PDF files that contain text in paragraphs and/or tables by trying to recognize the logical structure of the document from the content of PDF pages. The output is not of high fidelity, since it depends on the complexity of the content of the PDF pages, but has the following advantages:

  • The logical structure of the document (sections, paragraphs, tables) is available.
  • Text search is fully supported.
  • Editing a document is fully supported.

Note that we plan to progressively improve recognition of logical structure and add support for new features, such as pictures, form fields, etc.
For more information about PDF reading with GemBox.Document, see our help page: Support level for reading PDF format.
For an example showing the results obtained from reading a PDF file and extracting text from it, see the Read and Extract PDF Text in C# and VB.NET example.
Other notable improvements included in this release are:

We hope that you like the new features. If you have any questions, complaints, or suggestions, feel free to contact us or send us feedback.