Export Content from Confluence
Export multiple pages to HTML, XML, or PDF
If you want to export a space – or selected pages in a space – to HTML, XML, or PDF, Confluence can create a zipped archive of the HTML or XML files, or a single, downloadable PDF file.
PDF export is useful you're producing a printable user manual from your documentation space for example. The HTML export can be used to convert your site content to a static website, and finally the XML export can be used to import your space content into another Confluence space (running the same or later version of Confluence).
To export pages to HTML, XML, or PDF:
- Go to the space and choose Space tools > Content Tools from the bottom of the sidebar
- Choose Export
- Select either HTML, XML, or PDF, then choose Next
- Select either a normal or custom export for HTML or PDF, or a Full or Custom XML export:
- Normal Export (HTML and PDF) – to produce an HTML file containing all the pages that you have permission to view.
- Full Export (XML) – to produce an XML file containing all the pages in the space, including those that you do not have permission to view.
- Custom Export – if you want to export selected pages only, or if you want to exclude comments from the export.
- Choose Export
When the export process has finished, you can download the zipped archive or PDF.
What's included in the export?
The following content is included in space exports.
Export type | Pages | Blogs | Comments | Attachments | Unpublished changes |
---|---|---|---|---|---|
HTML | Yes | No | Optional | Yes | No |
Yes | No | No | Images only | No | |
XML | Yes | Yes | Optional | Yes | Yes |
Customizing the appearance of PDF exports
You can add a title page, table of contents, and customized headers and footers to the PDF output. For more advanced customizations, you can apply Cascading Style Sheet (CSS) modifications. These customizations are specific to each space, and you need the 'Space Administrator' permission to apply these customizations. For more information, see Customize Exports to PDF.
Notes on PDF exporting
- To export a PDF containing international text, see Create a PDF in Another Language
- Confluence's PDF export feature is designed to handle a wide variety of content, but on rare occasions the PDF Export process may fail due to an unrecognized customization. If that happens, the PDF export screen will indicate the title of the page in which the problem occurred, to help you diagnose the cause of the failure.
Notes on Word exporting
- Only the first 50 images attached to the page are exported to your Word document. This is to prevent out of memory errors affecting your whole Confluence site. See CONFSERVER-34211 - If a page with big number of images Exported to Word, some images are invisible OPEN for more information, and to find out how you can temporarily increase this limit using a system property.
Notes on HTML exporting
- In the zip file, page attachments are placed in individual folders with names in the following format:
...\download\attachments\xxxxxx
where 'xxxxxx'
is the page ID of the page containing the attachments.
To customize the HTML output, you'll need to modify the file
confluence-x.y.z-jar/com/atlassian/confluence/pages/Page.htmlexport.vm
. To learn how to repackage this file, see How to Edit Files in Confluence JAR Files