Text tab

<< Click to Display Table of Contents >>

Text tab

When exporting your PDF file (File | Export | Pages as), click on the Format button and switch to the Text tab.

If, on the General tab, you have selected the option Text Output, you can now specify here the way in which text will be exported to all formats.

Note: Those additional options that only concern HTML exports are on the HTML tab.

For a better overview, the options of this tab are divided into two sections: Text Flow and Text Appearance.

Text Flow section

Identify image captions

This option will cause FlexiPDF to identify image captions and output them together with the images instead of in the body text. If you have enabled image output, the captions for TIFF, JPEG, and PNG files will also be embedded in the image files themselves.

Try to preserve layout

In all text formats (see Text output) except EPUB, this option causes FlexiPDF to position words as close as possible to the original layout. The way in which this is done depends on the selected output format.
In the "Plain Text" format, the layout is approximated by inserting spaces. For this purpose, you should view the output document without automatic line wrapping in a monospaced font (for example, Courier New). Even with the "Simple HTML" format, the layout is simulated by inserting spaces.
The "HTML 3" and "HTML 4" formats use HTML tables for positioning. "HTML 4 CSS" uses the exact text positioning that was introduced with CSS2.
Certain deviations from the original layout are inevitable. You may still need to edit the output files a bit.

Render text in vector artwork

If this option is enabled, the letters will not be output directly but will be broken down into graphic curves.

Recognise "speech marks"

If the option Preserve line breaks is disabled, FlexiPDF will try to reflow the text into paragraphs. This works very well in most cases.
If you are exporting a document that contains reported speech (for example, a novel), however, it is important to preserve the line breaks as in the original document. This is because the line break indicates to the reader that the speaker of the direct speech has changed.
If you have enabled the option Recognize "speech marks", FlexiPDF will pay special attention to quotes at the start of a line and at the start of a paragraph and will thus be able to preserve these necessary line breaks.
The option should remain disabled for other documents that do not contain such reported speech.

Preserve line breaks

This option enables you to ensure that FlexiPDF retains all line breaks of the original document. Furthermore, it will stop FlexiPDF from removing hyphenation.
This option makes it more difficult to subsequently edit the output document because lines of text will not reflow after insertions and deletions. The advantage of the option is that the layout is closer to the original layout.

De-hyphenate words

With this option, FlexiPDF removes hyphens at the end of a line and rejoins the parts of words separated by hyphenation into one word. Words with normal hyphens such as "lower-level" are not changed.
FlexiPDF removes the hyphens using the spell check dictionary that is currently active. (For more information, see Spell check options.) It is thus important that the document language corresponds to the enabled language of the spell check.
If this option is disabled, FlexiPDF will leave hyphens as they appear in the original document.
If the above options Try to preserve layout or Preserve line breaks are enabled, the De-hyphenate words function is not available.

Text Appearance section

Create file for every page/article

This option will cause FlexiPDF to output each page to a separate file. When you export to HTML, the pages are linked via hyperlinks. This option is ignored by EPUB, as it is only ever output as a complete file.
If you export article threads, the separate files will be created per article and not page by page.

Hyperlinks

If FlexiPDF finds hyperlinks in the PDF document, they will also be written to the output file. This option has no effect on output in "Plain Text" format (see Text output) because plain text cannot contain hyperlinks.

Bookmarks

If this option is enabled, the bookmarks of the PDF file will be stored as hypertext destinations in the output file. This option has no effect on output in "Plain Text" format.

Page breaks

If this option is enabled, FlexiPDF will insert a page break at the end of each page. For HTML, the HTML tag <HR> inserts a horizontal separator.

The option "Use UTF-8 encoding"

This is the Unicode character set: a format that also supports complex character sets (for example, Asian scripts).