โฏ๏ธ Json::Merge
[![License: MIT][๐license-img]][๐license-ref]
if ci_badges.map(&:color).detect { it != "green"} โ๏ธ let me know, as I may have missed the discord notification.
if ci_badges.map(&:color).all? { it == "green"} ๐๏ธ send money so I can do more of this. FLOSS maintenance is now my full-time job.
I've summarized my thoughts in this blog post.
๐ป Synopsis
Json::Merge is a standalone Ruby module that intelligently merges two versions of a JSON or JSONC file using tree-sitter AST analysis. It's like a smart "git merge" specifically designed for JSON configuration files. Built on top of ast-merge, it shares the same architecture as prism-merge for Ruby source files.
When the underlying tree-sitter JSON parser surfaces JSONC comments, json-merge now preserves them directly. The jsonc-merge gem remains available as a compatibility shim for older integrations that still depend on that gem name.
Key Features
- Tree-Sitter Powered: Uses tree-sitter-json for accurate AST parsing
-
JSONC-Aware: Preserves
//and/* */comments when the parser exposes them - Intelligent: Matches objects and arrays by structural signatures
-
Fuzzy Property Matching:
ObjectMatchRefinermatches similar property names (e.g.,databaseUrlโdatabase_url) using Levenshtein distance for naming convention differences - Full Provenance: Tracks origin of every node
-
Standalone: Minimal dependencies - just
ast-mergeandruby_tree_sitter -
Customizable:
-
signature_generator- callable custom signature generators -
preference- setting of:template,:destination, or a Hash for per-node-type preferences -
node_splitter- Hash mapping node types to callables for per-node-type merge customization (see ast-merge docs) -
add_template_only_nodes- setting to retain nodes that do not exist in destination -
match_refiners- array of refiners for fuzzy matching (e.g.,ObjectMatchRefiner)
-
Supported Node Types
| Node Type | Signature Format | Matching Behavior |
|---|---|---|
| Object | [:object, key_signatures...] |
Objects match by their key structure |
| Array | [:array, element_count] |
Arrays match by position and type |
| Pair | [:pair, key_name] |
Key-value pairs match by key name |
| String | [:string, value] |
Strings match by value |
| Number | [:number, value] |
Numbers match by value |
| Boolean | [:boolean, value] |
Booleans match by value |
| Null | [:null] |
Null values always match |
Example
require "json/merge"
template = File.read("template.json")
destination = File.read("destination.json")
merger = Json::Merge::SmartMerger.new(template, destination)
result = merger.merge
File.write("merged.json", result.to_json)The *-merge Gem Family
The *-merge gem family provides intelligent, AST-based merging for various file formats. At the foundation is tree_haver, which provides a unified cross-Ruby parsing API that works seamlessly across MRI, JRuby, and TruffleRuby.
| Gem | Version / CI | Language / Format |
Parser Backend(s) | Description |
|---|---|---|---|---|
| tree_haver |
|
Multi | Supported Backends: MRI C, Rust, FFI, Java, Prism, Psych, Commonmarker, Markly, Citrus, Parslet | Foundation: Cross-Ruby adapter for parsing libraries (like Faraday for HTTP) |
| ast-merge |
|
Text | internal |
Infrastructure: Shared base classes and merge logic for all *-merge gems |
| bash-merge |
|
Bash | tree-sitter-bash (via tree_haver) | Smart merge for Bash scripts |
| commonmarker-merge |
|
Markdown | Commonmarker (via tree_haver) | Smart merge for Markdown (CommonMark via comrak Rust) |
| dotenv-merge |
|
Dotenv | internal | Smart merge for .env files |
| json-merge |
|
JSON / JSONC | tree-sitter-json (via tree_haver) | Smart merge for JSON and JSONC files |
| jsonc-merge |
|
JSONC | compatibility shim | Compatibility wrapper that depends on json-merge
|
| markdown-merge |
|
Markdown | Commonmarker / Markly (via tree_haver), Parslet | Foundation: Shared base for Markdown mergers with inner code block merging |
| markly-merge |
|
Markdown | Markly (via tree_haver) | Smart merge for Markdown (CommonMark via cmark-gfm C) |
| prism-merge |
|
Ruby |
Prism (prism std lib gem) |
Smart merge for Ruby source files |
| psych-merge |
|
YAML |
Psych (psych std lib gem) |
Smart merge for YAML files |
| rbs-merge |
|
RBS |
tree-sitter-rbs (via tree_haver), RBS (rbs std lib gem) |
Smart merge for Ruby type signatures |
| toml-merge |
|
TOML | Parslet + toml, Citrus + toml-rb, tree-sitter-toml (all via tree_haver) | Smart merge for TOML files |
Backend Platform Compatibility
tree_haver supports multiple parsing backends, but not all backends work on all Ruby platforms:
| Platform ๐๏ธ TreeHaver Backend ๐๏ธ |
MRI | JRuby | TruffleRuby | Notes |
|---|---|---|---|---|
| MRI (ruby_tree_sitter) | โ | โ | โ | C extension, MRI only |
| Rust (tree_stump) | โ | โ | โ | Rust extension via magnus/rb-sys, MRI only |
| FFI (ffi) | โ | โ | โ | TruffleRuby's FFI doesn't support STRUCT_BY_VALUE
|
| Java (jtreesitter) | โ | โ | โ | JRuby only, requires grammar JARs |
| Prism (prism) | โ | โ | โ | Ruby parsing, stdlib in Ruby 3.4+ |
| Psych (psych) | โ | โ | โ | YAML parsing, stdlib |
| Citrus (citrus) | โ | โ | โ | Pure Ruby PEG parser, no native dependencies |
| Parslet (parslet) | โ | โ | โ | Pure Ruby PEG parser, no native dependencies |
| Commonmarker (commonmarker) | โ | โ | โ | Rust extension for Markdown (via commonmarker-merge) |
| Markly (markly) | โ | โ | โ | C extension for Markdown (via markly-merge) |
Legend: โ = Works, โ = Does not work, โ = Untested
Why some backends don't work on certain platforms:
-
JRuby: Runs on the JVM; cannot load native C/Rust extensions (
.sofiles) -
TruffleRuby: Has C API emulation via Sulong/LLVM, but it doesn't expose all MRI internals that native extensions require (e.g.,
RBasic.flags,rb_gc_writebarrier) - FFI on TruffleRuby: TruffleRuby's FFI implementation doesn't support returning structs by value, which tree-sitter's C API requires
Example implementations for the gem templating use case:
| Gem | Purpose | Description |
|---|---|---|
| kettle-dev | Gem Development | Gem templating tool using *-merge gems |
| kettle-jem | Gem Templating | Gem template library with smart merge support |
๐ก Info you can shake a stick at
| Tokens to Remember |
|
|---|---|
| Works with JRuby |
|
| Works with Truffle Ruby |
|
| Works with MRI Ruby 4 |
|
| Works with MRI Ruby 3 |
|
| Support & Community |
|
| Source |
|
| Documentation |
|
| Compliance | [![License: MIT][๐license-img]][๐license-ref] |
| Style |
|
| Maintainer ๐๏ธ |
|
... ๐ |
|
Compatibility
Compatible with MRI Ruby 3.2.0+, and concordant releases of JRuby, and TruffleRuby.
| ๐ Amazing test matrix was brought to you by | ๐ appraisal2 ๐ and the color ๐ green ๐ |
|---|---|
| ๐ Check it out! | โจ github.com/appraisal-rb/appraisal2 โจ |
Federated DVCS
| Federated DVCS Repository | Status | Issues | PRs | Wiki | CI | Discussions |
|---|---|---|---|---|---|---|
| ๐งช kettle-rb/json-merge on GitLab | The Truth | ๐ | ๐ | ๐ | ๐ญ Tiny Matrix | โ |
| ๐ง kettle-rb/json-merge on CodeBerg | An Ethical Mirror (Donate) | ๐ | ๐ | โ | โญ๏ธ No Matrix | โ |
| ๐ kettle-rb/json-merge on GitHub | Another Mirror | ๐ | ๐ | ๐ | ๐ฏ Full Matrix | ๐ |
| ๐ฎ๏ธ Discord Server | Let's | talk | about | this | library! |
Available as part of the Tidelift Subscription.
The maintainers of this and thousands of other packages are working with Tidelift to deliver commercial support and maintenance for the open source packages you use to build your applications. Save time, reduce risk, and improve code health, while paying the maintainers of the exact packages you use.
- ๐กSubscribe for support guarantees covering all your FLOSS dependencies
- ๐กTidelift is part of Sonar
- ๐กTidelift pays maintainers to maintain the software you depend on!
๐@Pointy Haired Boss: An enterprise support subscription is "never gonna let you down", and supports open source maintainers
Alternatively:
โจ Installation
Install the gem and add to the application's Gemfile by executing:
bundle add json-mergeIf bundler is not being used to manage dependencies, install the gem by executing:
gem install json-merge๐ Secure Installation
This gem is cryptographically signed and has verifiable SHA-256 and SHA-512 checksums by stone_checksums. Be sure the gem you install hasnโt been tampered with by following the instructions below.
Add my public key (if you havenโt already; key expires 2045-04-29) as a trusted certificate:
gem cert --add <(curl -Ls https://raw.github.com/galtzo-floss/certs/main/pboling.pem)You only need to do that once. Then proceed to install with:
gem install json-merge -P HighSecurityThe HighSecurity trust profile will verify signed gems, and not allow the installation of unsigned dependencies.
If you want to up your security game full-time:
bundle config set --global trust-policy MediumSecurityMediumSecurity instead of HighSecurity is necessary if not all the gems you use are signed.
NOTE: Be prepared to track down certs for signed gems and add them the same way you added mine.
โ๏ธ Configuration
Signature Match Preference
Control which version to use when nodes have matching signatures but different content:
# Use template version (for config updates)
merger = Json::Merge::SmartMerger.new(
template,
destination,
preference: :template,
)
# Use destination version (default - preserve customizations)
merger = Json::Merge::SmartMerger.new(
template,
destination,
preference: :destination,
)Template-Only Nodes
Control whether to add nodes that only exist in the template:
# Add template-only nodes
merger = Json::Merge::SmartMerger.new(
template,
destination,
add_template_only_nodes: true,
)Object Match Refiner
When JSON object properties (key-value pairs) don't match by exact key name, the
ObjectMatchRefiner uses fuzzy matching to pair entries with:
- Similar key names (e.g.,
databaseUrlvsdatabase_url) - Keys with typos or different naming conventions (camelCase vs snake_case)
- Array elements with similar structure or content
# Enable object fuzzy matching
merger = Json::Merge::SmartMerger.new(
template,
destination,
match_refiners: [
Json::Merge::ObjectMatchRefiner.new(threshold: 0.5),
],
)ObjectMatchRefiner Options
| Option | Default | Description |
|---|---|---|
threshold |
0.5 | Minimum similarity score (0.0-1.0) to accept a match |
key_weight |
0.7 | Weight for key name similarity |
value_weight |
0.3 | Weight for value similarity |
# Custom weights for key-centric matching
refiner = Json::Merge::ObjectMatchRefiner.new(
threshold: 0.6,
key_weight: 0.8, # Focus more on key names
value_weight: 0.2, # Less focus on values
)Debug Logging
Enable debug logging to see merge decisions:
export JSON_MERGE_DEBUG=1๐ง Basic Usage
Merging Two JSON Files
require "json/merge"
template_content = File.read("template.json")
dest_content = File.read("destination.json")
merger = Json::Merge::SmartMerger.new(template_content, dest_content)
result = merger.merge
File.write("merged.json", result.to_json)Analyzing a JSON File
require "json/merge"
source = File.read("config.json")
analysis = Json::Merge::FileAnalysis.new(source)
# Iterate over all top-level nodes
analysis.statements.each do |node|
sig = analysis.generate_signature(node)
puts "#{node.class}: #{sig.inspect}"
endFuzzy Property Matching
When property names differ between template and destination (e.g., naming convention changes),
use the ObjectMatchRefiner:
require "json/merge"
template = <<~JSON
{
"databaseUrl": "postgres://localhost/app",
"cacheTimeout": 3600,
"apiEndpoint": "https://api.example.com"
}
JSON
destination = <<~JSON
{
"database_url": "postgres://localhost/custom",
"cache_ttl": 7200,
"api_endpoint": "https://custom.example.com"
}
JSON
# Default merge won't match keys (names differ - camelCase vs snake_case)
# Use ObjectMatchRefiner for fuzzy matching
merger = Json::Merge::SmartMerger.new(
template,
destination,
match_refiners: [
Json::Merge::ObjectMatchRefiner.new(threshold: 0.5),
],
)
result = merger.merge
# Properties are matched despite naming convention differences:
# - databaseUrl โ database_url (similar when normalized)
# - cacheTimeout โ cache_ttl (similar: "cache")
# - apiEndpoint โ api_endpoint (similar when normalized)Array Element Matching
The ObjectMatchRefiner also handles array elements with similar structure:
template = <<~JSON
{
"users": [
{ "id": 1, "userName": "alice" },
{ "id": 2, "userName": "bob" }
]
}
JSON
destination = <<~JSON
{
"users": [
{ "id": 1, "user_name": "alice_custom" },
{ "id": 3, "user_name": "charlie" }
]
}
JSON
merger = Json::Merge::SmartMerger.new(
template,
destination,
match_refiners: [
Json::Merge::ObjectMatchRefiner.new(threshold: 0.5),
],
)
# Array elements with matching IDs or similar structure are paired๐ฆท FLOSS Funding
While kettle-rb tools are free software and will always be, the project would benefit immensely from some funding. Raising a monthly budget of... "dollars" would make the project more sustainable.
We welcome both individual and corporate sponsors! We also offer a wide array of funding channels to account for your preferences (although currently Open Collective is our preferred funding platform).
If you're working in a company that's making significant use of kettle-rb tools we'd appreciate it if you suggest to your company to become a kettle-rb sponsor.
You can support the development of kettle-rb tools via GitHub Sponsors, Liberapay, PayPal, Open Collective and Tidelift.
| ๐ NOTE |
|---|
| If doing a sponsorship in the form of donation is problematic for your company from an accounting standpoint, we'd recommend the use of Tidelift, where you can get a support-like subscription instead. |
Open Collective for Individuals
Support us with a monthly donation and help us continue our activities. [Become a backer]
NOTE: kettle-readme-backers updates this list every day, automatically.
No backers yet. Be the first!
Open Collective for Organizations
Become a sponsor and get your logo on our README on GitHub with a link to your site. [Become a sponsor]
NOTE: kettle-readme-backers updates this list every day, automatically.
No sponsors yet. Be the first!
Another way to support open-source
Iโm driven by a passion to foster a thriving open-source community โ a space where people can tackle complex problems, no matter how small. Revitalizing libraries that have fallen into disrepair, and building new libraries focused on solving real-world challenges, are my passions. I was recently affected by layoffs, and the tech jobs market is unwelcoming. Iโm reaching out here because your support would significantly aid my efforts to provide for my family, and my farm (11 ๐ chickens, 2 ๐ถ dogs, 3 ๐ฐ rabbits, 8 ๐โ cats).
If you work at a company that uses my work, please encourage them to support me as a corporate sponsor. My work on gems you use might show up in bundle fund.
Iโm developing a new library, floss_funding, designed to empower open-source developers like myself to get paid for the work we do, in a sustainable way. Please give it a look.
Floss-Funding.dev: ๐๏ธ No network calls. ๐๏ธ No tracking. ๐๏ธ No oversight. ๐๏ธ Minimal crypto hashing. ๐ก Easily disabled nags
๐ Security
See SECURITY.md.
๐ค Contributing
If you need some ideas of where to help, you could work on adding more code coverage, or if it is already ๐ฏ (see below) check reek, issues, or PRs, or use the gem and think about how it could be better.
We so if you make changes, remember to update it.
See CONTRIBUTING.md for more detailed instructions.
๐ Release Instructions
See CONTRIBUTING.md.
Code Coverage
๐ช Code of Conduct
Everyone interacting with this project's codebases, issue trackers,
chat rooms and mailing lists agrees to follow the .
๐ Contributors
Made with contributors-img.
Also see GitLab Contributors: https://gitlab.com/kettle-rb/json-merge/-/graphs/main
๐ Versioning
This Library adheres to .
Violations of this scheme should be reported as bugs.
Specifically, if a minor or patch version is released that breaks backward compatibility,
a new version should be immediately released that restores compatibility.
Breaking changes to the public API will only be introduced with new major versions.
dropping support for a platform is both obviously and objectively a breaking change
โJordan Harband (@ljharb, maintainer of SemVer) in SemVer issue 716
I understand that policy doesn't work universally ("exceptions to every rule!"), but it is the policy here. As such, in many cases it is good to specify a dependency on this library using the Pessimistic Version Constraint with two digits of precision.
For example:
spec.add_dependency("json-merge", "~> 2.0")SemVer should, IMO, but doesn't explicitly, say that dropping support for specific Platforms is a breaking change to an API, and for that reason the bike shedding is endless.
To get a better understanding of how SemVer is intended to work over a project's lifetime, read this article from the creator of SemVer:
See CHANGELOG.md for a list of releases.
๐ License
The gem is available under the following license: AGPL-3.0-only. See LICENSE.md for details.
If none of the available licenses suit your use case, please contact us to discuss a custom commercial license.
ยฉ Copyright
See LICENSE.md for the official copyright notice.
๐ค A request for help
Maintainers have teeth and need to pay their dentists. After getting laid off in an RIF in March, and encountering difficulty finding a new one, I began spending most of my time building open source tools. I'm hoping to be able to pay for my kids' health insurance this month, so if you value the work I am doing, I need your support. Please consider sponsoring me or the project.
To join the community or get help ๐๏ธ Join the Discord.
To say "thanks!" โ๏ธ Join the Discord or ๐๏ธ send money.
Please give the project a star โญ โฅ.
Thanks for RTFM. โบ๏ธ