Xliff
This gem is for parsing and building xliff files.
Usage
The gem is meant to handle two tasks – reading xliff files and creating new ones.
Reading xliff files
bundle = Xliff::Bundle.from_path('path/to/my/file.xliff')
bundle.files.each do |file|
puts "File: #{file.original}:"
file.entries.each do |entry|
puts "#{entry.source}:#{entry.target}"
end
endStrings that haven't been translated yet – the shape Xcode exports for a new locale – carry no <target>, so entry.target may be nil. A <note> is optional too, so entry.note may be nil.
Creating xliff files
bundle = Xliff::Bundle.new(path: 'path/to/my/file.xliff')
file = Xliff::File.new(original: 'info.plist', source_language: 'en', target_language: 'fr')
entry = Xliff::Entry.new(id: 1234, source: 'hello', target: 'bonjour')
file.add_entry(entry)
bundle.add_file(file)
xml = bundle.to_sIn the above example, xml reads:
<?xml version="1.0" encoding="UTF-8"?>
<xliff xmlns="urn:oasis:names:tc:xliff:document:1.2" version="1.2" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="urn:oasis:names:tc:xliff:document:1.2 http://docs.oasis-open.org/xliff/v1.2/os/xliff-core-1.2-transitional.xsd">
<file original="info.plist" source-language="en" target-language="fr" datatype="plaintext">
<body>
<trans-unit id="1234" xml:space="default">
<source>hello</source>
<target>bonjour</target>
</trans-unit>
</body>
</file>
</xliff>target is optional – omit it to build a source-only entry for a string that hasn't been translated yet, and no <target> element is written.
Conformance and limitations
Output targets XLIFF 1.2. Documents built from scratch validate against both the strict and transitional schemas, and round-tripped Xcode exports against the transitional schema – all verified by the test suite against the official OASIS XSDs (bundle exec rake conformance). The declared xsi:schemaLocation defaults to the transitional schema – which is what real-world content such as Xcode's <tool build-num> conforms to – and is preserved from the source document when round-tripping. For a document that uses only the structure and attributes the library models – as Xcode's exports do – reading and re-writing it is byte-for-byte unchanged. The library models a fixed attribute set per element and re-emits those attributes in a canonical order (see the limitations below); a document carrying other attributes – or the modeled ones in a different order – round-trips its content but not byte-for-byte.
A few things worth knowing:
-
Untranslated strings parse with a
niltarget(andnote); both elements are omitted on write. -
Xliff::Headermodels an element name and its attributes; a parsed header's child content is preserved verbatim. A header child's inner nodes – text and nested elements, e.g. an<skl>skeleton's<internal-file>– are deep-copied on parse (exposed asXliff::Header#child_nodes) and re-emitted on write, surviving a round-trip even though the library doesn't model them. As with<group>below, the content survives but isn't byte-for-byte: a moved node in a namespaced document may gain a redundantxmlns. A header built by hand carries only its element name and attributes. The class doesn't bind XML namespaces;xml:is the one prefix it can emit (it's always bound). On parse, a non-xml:prefix on an element name is dropped (the element re-homes into the default namespace), and a non-xml:prefix on an attribute name is dropped too – keeping it would emit an undeclared prefix (non-well-formed XML). Building a header by hand rejects a non-xml:prefix outright instead. See #18. -
Only modeled attributes are preserved.
<file>keepsoriginal/source-language/target-language/datatypeand<trans-unit>keepsid/xml:space; any other attribute – e.g.approved,state,product-name, orxml:langon<source>/<target>– is dropped on read and not re-emitted. Tracked in #28. -
source,target, andnoteare plain text. Inline XLIFF markup (<g>,<ph>, …) inside them is flattened to its text, and only the first<note>on a<trans-unit>is retained. -
<group>and<bin-unit>are preserved, not parsed. Their content survives a round-trip (and is available asXliff::File#unparsed_body_nodes), but the library doesn't model them or expose their nested<trans-unit>s as entries — and they're re-emitted after the file's entries rather than byte-for-byte in place. First-class support is tracked in #17. -
Attribute values aren't validated against the schema's enumerations – an out-of-range
datatype,xml:space, or language code is serialized as given. -
A preserved
xsi:schemaLocationis reported, not verified. The declaration is round-tripped verbatim from the source, so a document that over-declares – like Xcode's exports, which name the strict schema yet carry the transitional-only<tool build-num>– re-emits output that declares a schema it doesn't fully satisfy. (Re-deriving the declaration instead would break the byte-identical round-trip.) Documents built from scratch default to the transitional schema precisely to avoid this. -
Byte-identical round-trip assumes a standard root. The
<xliff>element is always re-emitted withxmlns,xmlns:xsi,version, andxsi:schemaLocation; a source that omits any of these, or declares the schema-instance namespace under a non-xsiprefix, is normalized to that form. Preserving an arbitrary root verbatim is tracked in #16. -
Namespace handling targets the standard XLIFF namespace. A source document not in the XLIFF 1.2 namespace still round-trips its content, but a preserved
<group>/<bin-unit>or header child may shift itsxmlnson write – and such a round-trip can take a second pass to stabilize. The standaloneXliff::Header#to_s/Xliff::File#to_slikewise emit a namespace-coherent fragment only within the full<xliff>document. Deliberately out of scope here, since Xcode's exports are all in the standard namespace – tracked in #32.
Development
After checking out the repo, run bundle install to install dependencies. Then, run bundle exec rake spec to run the tests; bundle exec rake conformance runs just the schema-conformance examples, validating serialized output against the vendored OASIS XLIFF 1.2 XSDs in spec/schemas. You can also run bundle exec console for an interactive prompt that will allow you to experiment.
Contributing
Bug reports and pull requests are welcome on GitHub at https://github.com/automattic/xliff. This project is intended to be a safe, welcoming space for collaboration, and contributors are expected to adhere to the code of conduct.
License
The gem is available as open source under the terms of the MIT License.
Code of Conduct
Everyone interacting in the Xliff project's codebase and issue tracker is expected to follow the code of conduct.