The different pieces of a CHT project, how they interact, and what they’re used for
CHT Sync and CHT Pipeline
Most CHT deployments require some sort of analytics so that stakeholders can make data driven decisions. CouchDB, which is the database used by the CHT, is not designed for analytics. It is a document database, which means that it is optimized for storing and retrieving documents. It is not optimized for aggregating data. For example, if you wanted to know how many patients were registered in a particular area, you would have to query the database for all the patients in that area, and then count them. This is not a very efficient process. It is much more efficient to store the number of patients in a particular area in a separate database, and update that number whenever a patient is registered or unregistered. This is what CHT Sync paired with CHT Pipeline is designed to do.
Logstash and PostgREST
A free and open source SQL database used for analytics queries. See more at the PostgreSQL site.
DBT is used to ingest raw JSON data from the postgres database and normalize it into a relational schema to make it much easier to query.
Apache Superset is a free an open source platform for creating data dashboards.
CHT Core Framework & CouchDB
For more information on these technologies, see CHT Core overview.
CHT Sync is a logstash and PostgREST application that runs on the server. It is a service that listens to changes in the CHT database, and updates the analytics database accordingly. It is designed to be run as a service on the server, and it is not designed to be accessed by users. It is not a web application, and it does not have a user interface. It is designed to be run on the same server as the CHT, but it can be run on a separate server if necessary. CHT Sync runs in a Docker container. See the CHT Sync readme for more information and instructions on how to run it.
CHT Sync puts all new data into the postgres database into a single table that has a
jsonb column. This is not very useful for analytics. CHT Pipeline is a set of SQL queries that transform the data in the jsonb column into a more useful format. It uses DBT to define the data transformations. There is a daemon that runs CHT Pipeline, and it updates the database whenever the data in the jsonb column changes.
You can pass in your CHT Pipeline model definitions to CHT Sync through the variables passed to the docker container. This ensures that you have an easy way of getting your data into the analytics database seamlessly.
An open source monitoring system using Grafana and Prometheus
Was this page helpful?
Glad to hear it! Please tell us how we can improve.
Sorry to hear that. Please tell us how we can improve.