Edit

Share via


What is Azure Translator in Foundry Tools document translation?

Document translation is a cloud-based machine translation feature of Azure Translator. You can translate multiple and complex documents across all supported languages and dialects while preserving original document structure and data format. The Document translation API supports two translation processes:

  • Asynchronous batch translation supports the processing of multiple documents and large files. The batch translation process requires an Azure Blob storage account with storage containers for your source and translated documents.

  • Synchronous single file supports the processing of single file translations. The file translation process doesn't require an Azure Blob storage account. The final response contains the translated document and is returned directly to the calling client.

Key features

Feature Description
Translate large files Translate whole documents asynchronously.
Translate numerous files Translate multiple files across all supported languages and dialects while preserving document structure and data format.
Translate image file formats (preview) 🆕 Translate text within an image while maintaining the original design and layout.
Supported formats: .jpeg, .png, .bmp, .webp
Pricing: Calculated on a per image basis. For more information, see Pricing
Translate image text in Word document files (.docx). This feature is available with the batch document translation API for .docx file format.
Preserve source file presentation Translate files while preserving the original layout and format.
Apply custom translation Translate documents using general and custom translation models.
Apply custom glossaries Translate documents using custom glossaries.
Automatically detect document language Let the Document translation service determine the language of the document.
Translate documents with content in multiple languages Use the autodetect feature to translate documents with content in multiple languages into your target language.

Development options

Integrate document translation capabilities into your projects and applications by utilizing available development tools and options. This feature allows you to programmatically implement translation features in your application functionality and provide users with multilingual support as needed.

Optimize efficiency by utilizing asynchronous workflows to enable parallel translation of multiple documents and large files.

Development option Description
REST API The REST API is a language agnostic interface that enables you to create HTTP requests and authorization headers to translate documents.
Client libraries (SDKs) The client-library (SDKs) are language-specific classes, objects, methods, and code that you can quickly use by adding a reference in your project. Currently Document translation has programming language support for C#/.NET and Python.

Supported document and glossary formats

Batch document supported formats

The Get supported document formats method returns a list of document formats supported by the Document translation service. The list includes the common file extension, and the content-type if using the upload API.

File type File extension Description
Adobe PDF pdf Portable document file format. Document translation uses optical character recognition (OCR) technology to extract and translate text in scanned PDF document while retaining the original layout.
Comma-Separated Values csv A comma-delimited raw-data file used by spreadsheet programs.
HTML html, htm Hyper Text Markup Language.
Image (2025-12-01-preview) .jpeg, .png, .bmp, .webp Files that store digital image data.
OpenDocument Presentation odp An open-source presentation file.
OpenDocument Spreadsheet ods An open-source spreadsheet file.
OpenDocument Text odt An open-source text document file.
Markdown markdown, mdown, mkdn, md, mkd, mdwn, mdtxt, mdtext, rmd A lightweight markup language for creating formatted text.
M​HTML mhtml, mht A web page archive format used to combine HTML code and its companion resources.
Microsoft Excel xls, xlsx A spreadsheet file for data analysis and documentation.
Microsoft Outlook msg An email message created or saved within Microsoft Outlook.
Microsoft PowerPoint ppt, pptx A presentation file used to display content in a slideshow format.
Microsoft Word doc, docx A text document file.
Rich text format rtf A text document containing formatting.
Tab separated values/TAB tsv/tab A tab-delimited raw-data file used by spreadsheet programs.
Text txt An unformatted text document.
XML Localization Interchange File Format xlff A parallel document format, export of Translation Memory systems. The languages used are defined inside the file.

Batch Legacy file types

Source file types are preserved during the document translation with the following exceptions:

Source file extension Translated file extension
.doc, .odt, .rtf, .docx
.xls, .ods .xlsx
.ppt, .odp .pptx

Batch glossary supported formats

Document translation supports the following glossary file types:

File type File extension Description
Comma-Separated Values csv A comma-delimited raw-data file used by spreadsheet programs.
XML Localization Interchange File Format xlf , xliff A parallel document format, export of Translation Memory systems The languages used are defined inside the file.
Tab-Separated Values/TAB tsv, tab A tab-delimited raw-data file used by spreadsheet programs.

Document translation Request limits

For detailed information regarding Azure Translator request limits, see Document translation request limits.

Document translation data residency

Document translation data residency depends on the Azure region where your Translator resource was created:

✔️ Feature: Document translation
✔️ Service endpoint: Custom: <name-of-your-resource.cognitiveservices.azure.com/translator/text/batch/v1.1

Resource created region Request processing data center
Global Closest available data center.
Americas East US 2 • West US 2
Asia Pacific Japan East • Southeast Asia
Europe (except Switzerland) France Central • West Europe
Switzerland Switzerland North • Switzerland West

Next steps

In our quickstart, you learn how to rapidly get started using Document translation. To begin, you need an active Azure account. If you don't have one, you can create a free account.