Optimizing images in PDF files free with PrimoPDF

Optimizing PDF filesWhile PrimoPDF makes it easy to prepare PDFs for the most common usage scenarios with the Screen, Print, eBook, and Prepress output options, it also enables you to fine-tune image settings using Downsampling.

Downsampling enables you to reduce image resolution, discarding pixel information and resulting in a smaller PDF.  Simply choose Custom from the PDF output options in the PrimoPDF window to customize settings for Color, Grayscale, and Monochrome images. Once you select an image type, you can adjust the following settings:

  • Don’t downsample. Produces the highest quality of output, best for creating PDF files for printing to a commercial quality printer.
  • Average. Produces good quality results suitable for printing to most desktop printers.
  • Subsample. Results in smaller file sizes and faster PDF creation, but may decrease visual quality.
  • Bicubic. Creates higher visual quality than other subsampling methods, but may take longer to generate.

You will also need to specify a downsample threshold, which establishes the resolution threshold in PPI (pixels per inch) for downsampling and the target resolution. Any images having a resolution above the first value specified will be downsampled to the second resolution specified. Now you can prepare your PDF images for any situation!

addthis-custombutton Optimizing images in PDF files free with PrimoPDF

Picture-perfect PDF files with Nitro PDF Professional

By now, you’re well aware of the benefits of using PDFs to share information with others. PDF provides you the means to create pixel-perfect files that look just like the originals, while the popularity of PDF viewers make it an ideal format for sharing files with others.

One clever use of PDF is in packaging up images. Distributing a batch of images as individual files can be a real pain for recipients, even more so in cases where each image has to be opened separately for viewing. What’s more, depending on the quality of your images, individual image file sizes can creep up into several mega-bytes, which can really limit your audience, especially if you’re sharing over the web or via email. Enter PDF — using Nitro Pro 6, you can easily combine all of your images into a single PDF “album”, using tools to make quick touch-ups including reordering, cropping, editing, optimization and more. Whether you’re distributing high-quality graphics to business associates or creating digital photo albums to share with family and friends, Nitro Pro includes tools that you can use to make your image-based PDFs really shine.

Creating a PDF Album

Nitro PDF Professional provides a handy way to convert and combine several images into a single PDF “album”, all in a single task.

To Create A PDF Album

  1. From Windows Explorer, select your files to convert and combine (use CTRL+click to select multiple files at once).
  2. Right-click and choose Combine supported files in Nitro PDF….
  3. Combine PDF

  4. In the Combine Files dialog, arrange the images in the order you’d like them to appear.
  5. Combine PDF files dialog

  6. Choose your output settings and click Create.
  7. When prompted, name the PDF file.

If you’re satisfied with your album, you can simply distribute it to your intended recipients, or you can use Nitro Pro’s handy editing tools to polish it up a bit further.

Editing Pages and Images in your Album

With the right set of PDF editing tools, you can easily perform simple tweaks to make any of your PDF documents (including PDF albums) picture-perfect. Here are a few examples:

  • Rearranging PDF pages. If you don’t like the order in which your pages appear, you can use the Pages pane to quickly select PDF pages then drag them into the desired order. (Tip: To display the Pages pane, from the navigation tabs along the left side of the Nitro Pro window, click the Pages tab.)
  • Cropping pages. Use the page cropping tool to crop pages to an exact size, or to remove unnecessary or unwanted information. This can be particularly handy with images, if you’d like to highlight a focal point in your photo.  The cropping tool is located on the Insert and Edit tab, under the Images group. Simply click and drag to create a rectangle around the area you’d like to keep, and double-click the cropping area. In the Crop Pages dialog, you can adjust the crop margins and specify additional pages to crop, if desired.

Crop PDF page

  • Adding Captions. Use the text editing tool to easily add captions to your images. From the Insert and Edit tab, under the Text group, choose Insert. You can then type in your desired text in any location on the page, choosing font face, size, and color settings to add a personal touch to your images.
  • Editing Images. Nitro Pro’s image editing tools make last-minute touch-ups a snap. Double-click any image in your PDF to open the Format tab which is home to all image editing tools, including options to delete, insert, replace, crop, extract, scale, and arrange images, as well as tools for more advanced tasks such as adjusting brightness, resolution, and color space.
  • Shrinking Albums for Distribution. Once you’re satisfied with your album, you can further optimize it to make the file size smaller, and thus friendlier, for your recipients. The Optimize Document settings are available from the Nitro PDF Button, under Prepare, and include preset image optimization settings, along with a Custom option that allows you to specify your own.

These are just a few of the ways you can use Nitro PDF Professional in creating polished PDF packages of your photos and images. (Tip: You can also use our Extract Images feature to quickly export all of the images from your PDF as individual files, for reuse in other programs.) With Nitro Pro 6.0, we’ve made the process even easier - if you haven’t already done so, we’d encourage you to download and give it a whirl!

addthis-custombutton Picture-perfect PDF files with Nitro PDF Professional

Introducing Nitro PDF Professional 6

Nitro PDF Professional 6If you haven’t already heard, we’ve just announced the launch of Nitro PDF Professional 6 and today are featured on the Download.com homepage.

The release comes from almost a year of long days and nights by our product management and development teams. A special thanks goes to them and to those of you who shared feedback on Nitro Pro, told us what you wanted, and participated in the beta program. Thank you all!

So what’s the difference between version 5 and 6? In short, a lot. Major new features, enhancements of existing features, bug fixes, and ‘under the hood’ work that will make working with PDF files in Nitro Pro even faster and easier.

This blog post quickly runs you through our latest version, but it doesn’t really do it justice. The best way to learn about Nitro Pro 6 is to grab the 14-day trial and see for yourself.

If you want to learn more, read the full feature list, get free 14-day trial, or update your version 5 to 6, follow the appropriate link.

All-new conversion (exporting) engine

Our all-new, accurate conversion engine streamlines exporting content from PDF files for reuse in other applications. Designed in particular to work best with PDF files that originally came from Microsoft Word, our PDF-to-Word converter produces files that reproduce the look-and-feel of the original Microsoft Word and PDF versions, while being easy to edit. Features include:

  • Converting content based on how you’re going to reuse it. Convert PDF files to DOC or RTF. Choose from conversion modes designed to create files that precisely reproduce the look of the originals, or ones more suited to major editing and restructuring — all depending on how you intend to use them.
  • Accurate paragraph reproduction. Generate files with well-formatted, correctly-justified, and easy-to-edit paragraphs, eliminating the need for manual reformatting or other tweaking. (Not as easy as it might sound!)
  • Convert tables. Convert tabular PDF content into editable tables, including complex and irregular tables.
  • Scalable charts, graphs, diagrams, and vector images. Convert vector-based images into scalable objects in Microsoft Word (wherever possible).
  • Detect header- and footer-based content and turn them into active headers and footers, or remove that content from your output.
  • Extract text and all images, while excluding all the formatting and layout information.

Better, faster PDF creation

The creation engine is all new too. Designed with performance in mind, the engine creates PDF files much faster on average than version 5, while offering more control over your output.

  • 50% faster creation than Nitro Pro 5 (on average).
  • Create PDF/A compliant PDF files (PDF/A-1b) and ensure your documents will be fully accessible in the years to come.
  • Greater creation control with an expanded range of conversion options, you can produce more consistent PDF files. Includes output settings for PDF/A-1b compliance, document information, security, open options, font embedding, and image compression, along with the ability to store settings for quick reuse.

Commenting List and Summarize Comments

The new Comments List panel adds support for multi-line comments, letting you view each piece of text feedback in its entirety and making working with longer comments much easier.

PDF Comments

The new Summarize Comments feature further simplifies the process of reviewing feedback, enabling you to review page content and comments side-by-side — both onscreen and on paper.

Summarize and print PDF comments

Document management system (DMS) support

Added WebDAV support means you can now use Nitro Pro with most popular document management systems, including Hummingbird, Documentum, and SharePoint.

Easy-to-use Navigation pane

The Pages, Bookmarks and Signatures panels are now much easier to display and hide as they’re accessible directly from the left edge of the Nitro Pro window.

PDF navigation

Better text editing

The new Text Editing context ribbon places all text editing tools in the one place and displays the moment you start editing text. In addition, Nitro Pro 6 handles fonts better, letting you embed and subset embed fonts as you edit. Editing text in PDF files

Improved image optimization

The Optimize Documents tool now gives you far greater control over how your images are shrunk.

Optimize and shrink PDF files

Grid and Snap to Grid

New grid and snap-to-grid capabilities simplify the process of designing and laying form fields, links and other objects.

Snap to Grid in PDF

Transparent image support

Stamps and digital signatures now support transparent images, letting you use a greater range of images and have them looking much better in your PDF files.

Zoom with your mouse

New Zoom tool feature makes zooming in and out easier. When you have the Zoom tool selected, just hold down your Control button and use your mouse scroll wheel to move in and out.

More intuitive Pencil annotation tool

In Nitro Pro 5, adding several Pencil annotations in a row (something I love to do) was difficult as you had to switch out of the tool and back to ensure your pop-up notes were kept separate for each Pencil annotation. Nitro Pro 6 works more intuitively and means you never need to switch in and out of tools — after a short pause of inaction Nitro Pro 6 assumes any subsequent annotation should be treated as a new annotation and should have its own pop-up note.

Getting Started dialog

A new-and-improved Getting Started dialog for learning about Nitro Pro.

Getting Started with Nitro PDF Professional 6

Support & troubleshooting tools

To make it easier for you to repair Nitro Pro when changes are made on Windows, use Nitro Pro’s in-built tools to fix common problems that can occur with print drivers, Microsoft Office add-ins, and more. In addition, quickly and automatically collect all relevant system information so our support team can help you faster.

PDF support

Optimized PDF engine

Nitro Pro’s core PDF library has been completely optimized, resulting in smaller file sizes, faster open and save times, and much more.

As I said, this post is just a taste — you really need to get the 14-day free trial to see the difference.

We’re now heading out to celebrate!

addthis-custombutton Introducing Nitro PDF Professional 6

The first free PDF-to-Excel converter is live

PDF to ExcelToday I’m excited to let you be the first to know that, hot on the heels of our PDF to Word release, we have launched a brand-new PDF to Excel service. We believe it’s the first free tool for converting PDF to the XLS format.

PDF to Excel takes care of the normally messy, tedious and manual job of getting tabular content out of PDF files for reuse as editable spreadsheets and tables. The files we create work in Microsoft Excel, OpenOffice, Google Docs, Corel WordPerfect and whatever other applications that open XLS files.

Here’s how it works:

  1. Head to the http://www.pdftoexcelonline.com site
  2. Select a PDF to convert, enter your email address, and submit the file
  3. Our server processes the file (identifying tables, discarding other content and packaging the tables into a XLS file) and then emails it
  4. You check your email in a minute or two
  5. All done.

There’s a tendency online to equate ‘free’ with ‘cheap and nasty,’ but that’s not what Nitro is about with our free PDF tools. Like our PDF-to-Word service, we think our PDF-to-Excel converter stacks up extremely well against the tools that you must pay for. The key areas where we think we have raised the bar are:

  • Fully-automated processing. Most existing tools are unable to automatically scan PDF files, detect and then extract just table content — meaning you have to manually select each table to extract them or alternatively you receive an XLS file containing the entire contents of the PDF, which you must sift through to hopefully find your tables.
  • Retaining the look and feel of tables. Most PDF-to-Excel converters don’t attempt to retain the look of tables at all. We get right into it. The example below shows you the kinds of formatting we can handle, including fonts sizes and styles, borders, cell colors and row and column spacing.

PDF to Excel

A PDF converted to XLS using our service

Over the past month, we’ve had quite a few inquiries about where to go to download our converter? Well, at this stage it is an online service only. Our desktop product — Nitro PDF Professional 6 — is due for release soon, which will include the technology used in our free conversion service, only it’ll have some extra capabilities. Standby for that.

We’d love to hear what you think about our latest free tool, please try out PDF to Excel and share your thoughts here.

addthis-custombutton The first free PDF-to-Excel converter is live

Changing the look of your drawing and markup tools

Cloud tool

Recently Chris Dahl looked at how you can set the default look of your PDF annotations and pop-up notes, but I thought I’d look further into some of the appearance settings you can change for your drawing markups in Nitro Pro. These include the shapes (rectangle, line, arrow, oval, polygon, and so on), as well as the freehand Pencil tool. The exact changes can vary depending on the tool, but here’s a quick summary:

  • Line style. Choose from several different line styles, including different ‘dashed’ lines.
  • Line thickness. Depending on the kinds of PDF files you’re marking up, you might find the need to thicken up or thin out the thickness that’s used by default.
  • Color. Control the border color.
  • Fill color. When coloring the body of a markup is necessary, this lets you set it.
  • Opacity. When you want content beneath your markups to be visible.

I won’t repeat what Chris has already covered, so if you’d like to customize the look of your drawing markups, follow the step he’s already outlined.

addthis-custombutton Changing the look of your drawing and markup tools

Join the Nitro group on LinkedIn

Nitro PDF Software on LinkedInWe recently launched a Nitro PDF group on LinkedIn. Our new group is dedicated to connecting with and helping out people who use our range of PDF-related products and services. If you use Nitro PDF Professional, Nitro PDF Express, PrimoPDF, PDF Hammer, PDF Download, or our PDF-to-Word service, please join up and connect with other users and the Nitro team.

addthis-custombutton Join the Nitro group on LinkedIn

Our PDF-to-Word service now free for everyone to use

Convert PDF Word freeWe’ve just ended the invite-only beta program for our new free PDF-to-Word converter service, so if you missed out and are keen to start using it, head to the site now.

We’ve already talked quite a bit about what our PDF converter does and how much we think our service has raised the bar for accuracy, so today I thought I’d look at the flipside and highlight what our converter doesn’t do.

Our PDF-to-Word technology DOES NOT

  • Do fake tables. We rarely use tabs to force tabulated information into a proper layout. Wherever possible, we re-build the table from scratch to ensure the best layout and editability.
  • Ignore text justification. We don’t left align all text. We use the correct left, right, center, and justify text alignment tools.
  • Phony paragraphs. We don’t use line breaks to force text onto the next line, or text boxes to force blocks of text into position. We keep paragraphs editable and part of the on-page content.
  • Merge text and images. Some PDF-to-Word converters unnecessarily merge images and text during conversion to try to produce better-looking output files. We separate text and images for enhanced editability, while retaining the same layout.
  • Ignore headers/footers. Most PDF-to-Word converters ignore header and footer content and reformat it as a normal line of text on the page. Wherever possible, we detect headers and footers and recreate them using Microsoft Word’s Header and Footer tools.

We hope you enjoy using the service. If you do, please let your friends know about it as well.

addthis-custombutton Our PDF-to-Word service now free for everyone to use

Setting default colors & styles for comments

If you find yourself changing the color of the PDF markup annotations in Nitro Pro repetitively, then please read on. There *is* a way for you to set the default properties for a particular type of markup annotation (this also works for form fields too BTW).

1. Create your comment/markup.

2. Right click, go to Properties and set the appearance.

3. Click OK to close the Properties dialog.

4. Now right click on the comment/markup again and left click on Make Current Properties Default (see screenshot below).

default comment properties

addthis-custombutton Setting default colors & styles for comments

The future for PDF document reviewing with Nitro Pro

Print & summarize comments in PDF filesThe Nitro team uses PDF-related markup and review tools extensively. In the product and support side of things, there are specifications and interfaces to discuss, in marketing, there are web pages, flyers, boxes, logos and more, and at a management level, there’s regularly agreements, contracts and the like to circulate and finalize.

Where I think PDF stands apart from the review tools of Microsoft Word and Excel is the fact that it feels much more like reviewing a paper document. All the tools you’re used to, such as text highlighters, pens, notes and stamps make it intuitive, and most importantly, allow you to annotate any part of a page. This is very different to applications like Microsoft Word, which are limited to just commenting on text content and become confusing when content is inserted, edited and deleted during the reviewing process.

For those of you using Nitro Pro for commenting, you’ll already be familiar with the set of annotation tools (full details on our annotate & highlight PDF page), and you might also be familiar with the fact that working through a large amount of comments can be challenging. What I wanted to do today was give you a sneak peek at what we’re working on to help you work more efficiently.

The first one (see the screenshot below) is summarizing comments. What you can see there are some very basic mock-ups and some of the early comment summary output we’ve been generating. If you’re familiar with adding comments in Microsoft Word, then you’ve probably seen the ‘balloons’ you can display along the right side of the page to see comments and changes. As the interface of Nitro Pro and Office 2007 is so similar, we’ve tried to follow the same principle, so moving between the two applications has the smallest learning curve.

Summarize comments in PDF files

The functionality and design has come from talking to users who like to comment (a lot!). From there the product management and development teams have worked together to establish what’s possible. Simple wireframes have evolved into fully working demonstrations. The support, sales and marketing teams have had demos and shared their thoughts. And as each iteration has passed we’ve refined the functional capabilities and how summary output will look and print.

There are several benefits you’ll get using the Summarize Comments feature. Firstly, you’ll be able to look at all the text feedback on the page in one go, and, with connectors between the text feedback and each markup on the page, easily follow what’s going on. Secondly, if you like to use paper as part of your review process, you can print out your summary and work with good old pens and markers — while we all love to save paper whenever we can, the fact is it’s sometimes much easier to do some of the reviewing on paper, especially when working with large documents and many comments.

We are also working on significantly improving the Comments pane, making it easier to work with large amounts of comments, read and add/edit text in there, and better handle replies. The most noticeable change is the support for multi-line display, which will make it much easier to read longer comments without switching from the pane back out to the note on the page. The screenshot below show a very early demonstration of the new pane in action.

PDF Comments pane

This is really just a taste of what we’re working on, which you’ll be able to get your hands on later in the year with our next major release. If you’re a paid-up owner of Nitro PDF Professional and you’d like to have the chance to try out upcoming private betas, please head to our beta sign up page and enter your details. If you’d like to like to see our new convert PDF to Word technology in action, follow the link and enter nitro as your invite code to try it via our online beta service.

addthis-custombutton The future for PDF document reviewing with Nitro Pro

PDF-to-Word Conversion: Why It’s So Hard to Do

When a file is converted to PDF, it loses its meaning. On the surface all the information is there, and to your eyes it looks exactly the same, but underneath that, all the method, structure and intelligence used when designing the original document has been lost. This forms the heart of the challenge faced when attempting to convert PDF files back to formats like DOC (Microsoft Word), RTF and HTML, and is not dissimilar to those faced when OCRing paper-based documents.

Once you have your PDF file, the original layout and meaning formed from text-based building blocks — including words, lines (and line breaks), paragraphs, columns, tables, headers/footers and outlines — are long gone. Once in a PDF, its content just describes how and where on the page each object should be displayed.

This is a far cry from where you would be if you went back to the original file in Microsoft Word, Open Office, Google Docs, Adobe InDesign, or whatever. These kinds of word processing and desktop publishing applications follow similar principles, and it’s why converting files between them (while certainly not perfect) is a much more simple process.

How files are normally designed and edited in word processing applications

Most word processing applications use the same sort of principles for formatting and giving meaning to content. For the sake of this article, I’ll use Microsoft Word as the example. Here’s a few of the main ones:

  • Paragraphs let you work with text that reflows across lines and can be quickly reformatted using styles to adjust spacing, indent, size and more.
  • Columns let you incorporate more complex page layouts and in many cases make content easier to follow and give meaning to using different grouping styles.
  • Tables let you layout tabular information not suited to the more linear formatting offered by paragraphs and columns.
  • Headers & footers let you repeat content more consistently across multiple pages.

PDF to Word is like the OCR process

If you’re familiar with optical character recognition (OCR) and converting paper to electronic form, you might have already grasped some of the complexities we’re dealing with. Apart from recognizing fonts and how they should be displayed on the page, the challenges are much the same for both as all meaning and structure is gone from the contents.

The loss of the text stream

Take a look at the screenshot below. The first three lines of text show how it is displayed on the page in a PDF. The second shows how many separate objects the text is broken into inside the PDF. For each small text object, the PDF includes co-ordinates that simply describe where it should be positioned on the page and how it should be displayed.

Text objects in PDF

The first challenge for exporting text back out of PDF files comes when the streams of text from the original word processor get broken up into these seemingly random chunks. From here we must start to discern what their relationship is to the content around them. This process begins by sucking out all the text from the PDF.

Rediscovering words one object at a time

To begin with, the PDF-to-Word converter must put the words back together. We can look at each text object, its properties, the distance it is from surrounding text objects, and start to see where words and whitespace between them might exist.

Good conversion starts at the accurate detection of line ends

The key to recreating an editable Word file is accurately detecting where each line ends. If you take a look at the example below, it’s pretty easy to see where each of the lines end, new paragraphs start, and columns are placed alongside each other, but inside the PDF there is nothing that notes these facts.

If you look at the example below, getting it wrong could easily result in one line of text in the left column merging with a completely unrelated line of text from the right column -– not a hard mistake to make when you’re making your decision based on the amount of whitespace between text objects.
Line breaks in PDF

Recreating editable paragraphs and detecting them accurately is next

Detecting line ends correctly not only saves you from merging columns of content together, it does the even more important task of starting to rebuild the structure of the text content. Once you can see a series of horizontal lines you can start deducing where a paragraph with reflowing text might need to be –- and once you have that you can start re-creating a Microsoft Word file that is highly editable.

Of course, it’s never that simple (if it was I wouldn’t be writing about it!). Unfortunately, paragraph-based content can be presented in many different ways, which makes the accurate detection and reproduction even more difficult.  Examples include:

  • Different text styles and colours.
  • Drop caps at the beginning of chapters or sections.
  • Indents to multiply lines to indicate new (and different) paragraphs of content such as quotes.
  • First-line indents to indicate the start (and end) of paragraphs.
  • Changes in or maintaining the same alignment, such as left, right, centered and justified across lines can indicate separate paragraphs and content blocks.
  • Changes in line spacing, which can even show that the content was originally part of a separate paragraph and should therefore be treated differently.

Paragraphs in PDF

A few examples of the different kinds of paragraphs a document might contain.

Laying out the page with columns

Like examining the relationship and patterns between lines of text to re-discover paragraphs, we can start to figure out where columns might exist by looking at all the text and paragraphs on the page. For example, if we see a series of paragraphs, whose alignment on the left is all on the same vertical axis and each paragraph uses similar text styles and spacings, we may well have found ourselves a column of content.

For it to all come together well, we need to take a holistic approach and not assume too much before looking at all the page content. Once you think you have the overall text layout for the page, you need to take all the necessary page measurements so you can make use of them in Microsoft Word.

Using the Column settings in Word, you need to specify the number of columns, their widths and spacing between them, and then insert column- and line-breaks to ensure the text is placed in the right column and in the right area of the page.

Truly editable comes with advanced table detection

Tabulated content (i.e. tables) is similar to columns only more complex. You’re dealing with columns and rows, and varying degrees of information to discern tables accurately. Quality table detection is bordering on a black art as you each table you encounter is different, forcing you to have a large range of processes to run through before working out whether the content is a table or not.

If you look at the table below you can see how cells and tables can be formatted in different ways. There are more obvious signposts such as cell background coloring and borders to show it’s a table, but as you look to the bottom of the table you’ll see those indicators are gone.

Table text in PDF files

To get to that level of detection you have to run many processes to find a pattern and accurately identify the presence of a table.

Separating section/chapter-level content from page-level content

When determining the correct page margins to use when converting the PDF to Word, header and footer content often gets in the way and causes layout and editability problems. If you can detect this content and keep it separate, the normal page content is much more likely to lay out well.

You do this by scanning across multiple pages for similar content positioned in the same places at the bottom and/or top of the page, and when you find it keep it separate from the normal content. This advanced technique keeps the pages cleaner and allows you to incorporate this content into the actual header and footer areas of a Microsoft Word document.

The other ways to layout the text …

There are other, easier ways to produce an OK visual rendition of the page layout. PDF-to-Word converters sometimes do it using:

  • Line breaks. Instead of spending a lot of time working out whether lines above and below each other are part of the same paragraph, just force a line break at the end of every line.
  • Text boxes. Insert paragraphs of text into text boxes to get the position on the page the same as the original.
  • Tabs for page columns. In between columns just insert a tab to get the positioning approximately right.
  • Tabs for table columns. Again, in between each table column, insert a tab to position it.

Unfortunately, all these have an inherent problem because they only deal with the presentation and positioning of the content on the page. When it comes to making any changes to the text, the nightmare begins:

  • For line breaks you must manually remove each one to reflow text in formal paragraphs
  • For text boxes, you’re isolated from the rest of the on-page text and many formatting tools won’t work.
  • For tabs in columns and tables, you make text editing about as awkward as it can be as the text flow moves left to right across the page and ignores the fact that the content in some cells and columns originally flowed downwards over multiple lines.

These techniques are rarely the right answer.

Winning the battle between visual accuracy and editabiity

The biggest challenge when converting PDF to Word (and other formats) is to retain the visual appearance of the document, while adding back to it a meaningful structure that makes it possible to easily edit and re-purpose the content. A constant battle exists because techniques to improve visual accuracy can easily force the converter into creating less editable content, and vice versa. Moreover, no two documents are formatted and laid out exactly the same, meaning converters must be as flexible as possible.

Achieving 100% accurate PDF to Word conversion is an impossible feat. Instead, we must aspire to being as accurate as we possibly can, with as many possible documents. That’s where the biggest difference lies between the tools available to convert PDF to Word.

It is possible to create PDF files with embedded structure information in them, however most PDF files don’t have this structure.

Note for Nitro Pro users: If you’re a Nitro Pro user and have been waiting for us to improve our PDF to Word performance, you’ll be pleased to know that with the 6.0 release we’ll be including a whole new PDF-to-Word conversion engine. In fact, you can try it out now as our new free PDF to Word online service uses the same core technology. To get access to the beta, just enter nitro as your invite code.

addthis-custombutton PDF-to-Word Conversion: Why It’s So Hard to Do