ChimeraHttpClient
When starting to split monolithic apps into smaller services, you need an easy way to access the remote data from the other apps. This chimera_http_client gem should serve as a comfortable and unifying way to access endpoints from other apps.
And what works for the internal communication between your own apps, will also work for external APIs that do not offer a client for simplified access.
It offers an easy to learn interface and nice error handling. And it enables you to queue HTTP requests to run them in parallel for better performance and simple aggregating of distributed data.
Dependencies
The chimera_http_client gem is wrapping the libcurl wrapper Typhoeus to have a more convenient interface. This allows for fast requests, for caching responses, and for queueing requests to run them in parallel. Connections are persistent by default, which saves subsequent requests from establishing a connection.
The only other runtime dependency is Ruby's latest code loader zeitwerk which is also part of Rails 6. ^
Ruby version
| Chimera version | MRI Ruby version | JRuby | TruffleRuby |
|---|---|---|---|
| >= 1.7 | >= 3.3 (older versions untested, likely still work) | yes | yes |
| >= 1.6 | >= 2.7 (all 3.x versions supported) | yes | yes |
| >= 1.4 | >= 2.5 (3.0 compatibility ensured) | yes | no |
| >= 1.1 | >= 2.5 | ? | ? |
| = 1.0 | >= 2.4, <= 3.0 | ? | ? |
| <= 0.5 | >= 2.1, <= 3.0 | ? | ? |
The test suite of v1.4 passes on MRI Ruby (2.5, 2.6, 2.7, 3.0) and on JRuby, but not on TruffleRuby.
The test suite of v1.6 passes on MRI Ruby (2.7, 3.0, 3.1, 3.2, 3.3) and on JRuby and TruffleRuby.
The non-MRI Rubys are not part of the regular Matrix, as their CI jobs take 3x as long, but included for new releases.
ENV variables
Setting the environment variable ENV['CHIMERA_HTTP_CLIENT_LOG_REQUESTS'] to true (or 'true') will provide more detailed error messages for logging and also add additional information to the Error JSON. It is recommended to use this only in development environments.
Table of Contents
- ChimeraHttpClient
- Dependencies
- Ruby version
- ENV variables
- Table of Contents
- The Connection class
- Initialization
- Mandatory initialization parameter
base_url - Optional initialization parameters
- Custom deserializers
- Monitoring, metrics, instrumentation
- Mandatory initialization parameter
- Request methods
- Mandatory request parameter
endpoint - Optional request parameters
- Basic auth
- Timeout duration
- Custom logger
- Caching responses
- Mandatory request parameter
- Example usage
- Initialization
- The Request class
- The Response class
- Error classes
- The Queue class
- Queueing requests
- Executing requests in parallel
- Empty the queue
- Installation
- Maintainers and Contributors
- Roadmap
- Chimera
- Dependencies
The Connection class
The basic usage looks like this:
connection = ChimeraHttpClient::Connection.new(base_url: 'http://localhost/namespace')
response = connection.get!(endpoint, params: params)Initialization
connection = ChimeraHttpClient::Connection.new(base_url: 'http://localhost:3000/v1', logger: logger, cache: cache)
Mandatory initialization parameter base_url
The mandatory parameter is base_url which should include the host, port and base path to the API endpoints you want to call, e.g. 'http://localhost:3000/v1'.
Setting the base_url is meant to be a comfort feature, as you can then pass short endpoints to each request like /users. You could set an empty string '' as base_url and then pass full qualified URLs as endpoint of the requests.
Optional initialization parameters
The optional parameters are:
-
cache- an instance of your cache solution, can be overwritten in any request -
deserializers- custom methods to deserialize the response body, below more details -
logger- an instance of a logger class that implements#info,#warnand#errormethods -
monitor- to collect metrics about requests, the basis for your instrumentation needs -
timeout- the timeout for all requests, can be overwritten in any request, the default are 3 seconds -
user_agent- if you would like your calls to identify with a specific user agent -
verbose- the default isfalse, set it to true while debugging issues
Custom deserializers
In case the API you are connecting to does not return JSON, you can pass custom deserializers to Connection.new or Queue.new:
deserializers: { error: your_error_deserializer, response: your_response_deserializer }
A Deserializer has to be an object on which the method call with the parameter body can be called:
custom_deserializer.call(body)
where body is the response body (in the default case a JSON object). The class Deserializer contains the default objects that are used. They might help you creating your own. Don't forget to make requests with another header than the default "Content-Type" => "application/json", when the API you connect to does not support JSON.
Monitoring, metrics, instrumentation
Pass an object as :monitor to a connection that defines the method call and accepts a hash as parameter.
monitor.call({...})
It will receive information about every request as soon as it finished. What you do with this information is up for you to implement.
| Field | Description |
|---|---|
url |
URL of the endpoint that was called |
method |
HTTP method: get, post, ... |
status |
HTTP status code: 200, ... |
runtime |
the time in seconds it took the request to finish |
completed_at |
Time.now.utc.iso8601(3) |
context |
Whatever you pass as monitoring_context to the options of a request |
Request methods
The available methods are:
-
get/get! -
post/post! -
put/put -
patch/patch! -
delete/delete!
where the methods ending on a bang! will raise an error (which you should handle in your application) while the others will return an error object.
Mandatory request parameter endpoint
The base_url set in the connection will together with the endpoint determine the URL to make a request to.
connection.get([:users, id])
connection.get(["users", id])
connection.get("users/#{id}")
connection.get("/users/#{id}")All forms above ave valid and will make a request to the same URL.
- Please take note that the endpoint can be given as a String, a Symbol, or an Array.
- While they do no harm, there is no need to pass leading or trailing
/in endpoints. - When passing the endpoint as an Array, it's elements are converted to Strings and concatenated with
/.
Optional request parameters
All request methods expect a mandatory endpoint and an optional hash as parameters. In the latter the following keywords are treated specially:
-
body- the mandatory body of apost,putorpatchrequest -
headers- a hash of HTTP headers -
params- parameters of a HTTP request -
username- used for a BasicAuth login -
password- used for a BasicAuth login -
timeout- set a custom timeout per request (the default is 3 seconds) -
cache- optionally overwrite the cache store set inConnectionin any request -
monitoring_context- pass additional information you want to collect with your instrumentationmonitor
Example:
connection.post(
:users,
body: { name: "Andy" },
params: { origin: `Twitter`},
headers: { "Authorization" => "Bearer #{token}" },
timeout: 10,
cache: nil
)Basic auth
In case you need to use an API that is protected by basic_auth just pass the credentials as optional parameters:
username: 'admin', password: 'secret'
Timeout duration
The default timeout duration is 3 seconds.
If you want to use a different timeout, you can pass the key timeout when initializing the Connection. You can also overwrite it on every call.
Custom logger
By default no logging is happening. If you need request logging, you can pass your custom Logger to the key logger when initializing the Connection. It will write to logger.info when starting and when completing a request.
The message passed to the logger is a hash with the following fields:
| Key | Description |
|---|---|
message |
indicator if a call was started or finished |
method |
the HTTP method used |
url |
the requested URL |
code |
HTTP status code |
runtime |
time the request took in ms |
user_agent |
the user_agent used to open the connection |
Caching responses
To cache all the reponses of a connection, just pass the optional parameter cache to its initializer. You can also overwrite the connection's cache configuration by passing the parameter cache to any get call.
It could be an instance of an implementation as simple as this:
class Cache
def initialize
@memory = {}
end
def get(request)
@memory[request]
end
def set(request, response)
@memory[request] = response
end
endOr use an adapter for Dalli, Redis, or Rails cache that also support an optional time-to-live default_ttl parameter. If you use Rails.cache with the adapter :memory_store or :mem_cache_store, the object you would have to pass looks like this:
require "typhoeus/cache/rails"
cache: Typhoeus::Cache::Rails.new(Rails.cache, default_ttl: 600) # 600 secondsRead more about how to use it: https://github.com/typhoeus/typhoeus#caching
Example usage
To use the gem, it is recommended to write wrapper classes for the endpoints used. While it would be possible to use the get, get!, post, post!, put, put!, patch, patch!, delete, delete! or also the bare request.run methods directly, wrapper classes will unify the usage pattern and be very convenient to use by veterans and newcomers to the team. A wrapper class could look like this:
require 'chimera_http_client'
class Users
def initialize(base_url: 'http://localhost:3000/v1')
@base_url = base_url
end
# GET one user by id and instantiate a User
#
def find(id:)
response = connection.get!(['users', id])
user = response.parsed_body
User.new(id: id, name: user['name'], email: user['email'])
rescue ChimeraHttpClient::Error => error
# handle / log / raise error
end
# GET a list of users and instantiate an Array of Users
#
def all(filter: nil, page: nil)
params = {}
params[:filter] = filter
params[:page] = page
response = connection.get!('users', params: params, timeout: 10) # set longer timeout
all_users = response.parsed_body
all_users.map { |user| User.new(id: user['id'], name: user['name'], email: user['email']) }
rescue ChimeraHttpClient::Error => error
# handle / log / raise error
end
# CREATE a new user by sending attributes in a JSON body and instantiate the new User
#
def create(body:)
response = connection.post!('users', body: body.to_json) # body.to_json (!!)
user = response.parsed_body
User.new(id: user['id'], name: user['name'], email: user['email'])
rescue ChimeraHttpClient::Error => error
# handle / log / raise error
end
private
def connection
# base_url is mandatory
# logger and timeout are optional
@connection ||= ChimeraHttpClient::Connection.new(base_url: @base_url, logger: Logger.new(STDOUT), timeout: 2)
end
endTo create and fetch a user from a remote service with the Users wrapper listed above, calls could be made like this:
users = Users.new
new_user = users.create(body: { name: "Andy", email: "andy@example.com" })
id = new_user.id
user = users.find(id: id)
user.name # == "Andy"The Request class
Usually it does not have to be used directly. It is the class that executes the Typhoeus::Requests, raises Errors on failing and returns Response objects on successful calls.
The body which it receives from the Connection class has to be in the in the (serialized) form in which the endpoint expects it. Usually this means you have to pass a JSON string to the body (it will not be serialized automatically).
The Response class
The ChimeraHttpClient::Response objects have the following interface:
* body (content the call returns)
* code (http code, should be 200 or 2xx)
* time (for monitoring)
* response (the full response object, including the request)
* success? (returns the result of response.success?)
* error? (returns false)
* parsed_body (returns the result of `deserializer[:response].call(body)`)
If your API does not use JSON, but a different format e.g. XML, you can pass a custom deserializer to the Connection.
Error classes
All errors inherit from ChimeraHttpClient::Error and therefore offer the same attributes:
* code (http error code)
* body (alias => message)
* time (for monitoring)
* response (the full response object, including the request)
* success? (returns the result of response.success?)
* error? (returns true)
* error_class (e.g. ChimeraHttpClient::NotFoundError)
* to_s (information for logging / respects ENV['CHIMERA_HTTP_CLIENT_LOG_REQUESTS'])
* to_json (information to return to the API consumer / respects ENV['CHIMERA_HTTP_CLIENT_LOG_REQUESTS'])
The error classes and their corresponding http error codes:
ConnectionError # 0
RedirectionError # 301, 302, 303, 307
BadRequestError # 400
UnauthorizedError # 401
PaymentRequiredError # 402
ForbiddenError # 403
NotFoundError # 404
MethodNotAllowedError # 405
ResourceConflictError # 409
UnprocessableEntityError # 422
ClientError # 400..499
ServerError # 500..599
TimeoutError # timeout
The Queue class
Instead of making single requests immediately, the ChimeraHttpClient allows to queue requests and run them in parallel.
The number of parallel requests is limited by your system. There is a hard limit for 200 concurrent requests. You will have to measure yourself where the sweet spot for optimal performance is - and when things start to get flaky. I recommend to queue not much more than 20 requests before running them.
Queueing requests
The initializer of the Queue class expects and handles the same parameters as the Connection class.
queue = ChimeraHttpClient::Queue.new(base_url: 'http://localhost:3000/v1')queue.add expects and handles the same parameters as the requests methods of a connection.
queue.add(method, endpoint, options = {})The only difference is that a parameter to set the HTTP method has to prepended. Valid options for method are:
-
:get/'get'/'GET' -
:post/'post'/'POST' -
:put/'put'/'PUT' -
:patch/'patch'/'PATCH' -
:delete/'delete'/'DELETE'
Executing requests in parallel
Once the queue is filled, run all the requests concurrently with:
responses = queue.executeresponses will contain an Array of ChimeraHttpClient::Response objects when all calls succeed. If any calls fail, the Array will also contain ChimeraHttpClient::Error objects. It is in your responsibility to handle the errors.
Tip: every
Responseand everyErrormake the underlyingTypheous::Requestavailable overobject.response.request, which could help with debugging, or with building your own retry mechanism.
Empty the queue
The queue is emptied after execution. You could also empty it at any other point before by calling queue.empty.
To inspect the requests waiting for execution, call queue.queued_requests.
Installation
Add this line to your application's Gemfile:
gem 'chimera_http_client', '~> 1.1'
And then execute:
$ bundle
When updating the version, do not forget to run
$ bundle update chimera_http_client
Maintainers and Contributors
After checking out the repo, run bundle install and then bundle execute rake to run the tests and rubocop.
The test suite uses a Sinatra server to make real HTTP requests. It is mounted via Capybara_discoball and running in the same process. It is still running reasonably fast (on my MacBook Air):
Finished in 2.01 seconds (files took 1.09 seconds to load)
824 examples, 0 failures, 7 pending
You can also run rake console to open an irb session with the ChimeraHttpClient pre-loaded that will allow you to experiment.
To build and install this gem onto your local machine, run bundle exec rake install.
Maintainers only:
To release a new version, update the version number in
version.rb, commit this change, and then runbundle exec rake release, which will create a git tag for the version, push git commits and tags, and push the.gemfile to rubygems.org.
Bug reports and pull requests are welcome on GitHub at https://github.com/mediafinger/chimera_http_client
Roadmap
https://github.com/mediafinger/chimera_http_client/blob/master/TODO.markdown
Chimera
Why this name? First of all, I needed a unique namespace. HttpClient is already used too often. And as this gem is based on Typhoeus I picked the name of one of his (mythological) children.