Datopian Tech 👩‍💻

We are experts in data management
This is an overview of our technology

# Data Management Systems

A Data Management System (DMS) is a system or framework. It is a core technology and can be used to create a variety of solutions such as Data Portals, Data Catalogs, Data Lakes etc.

We have two related DMS stacks:

  • CKAN is the open source data management system we created in 2007 and that we continue to develop and maintain. The main information on CKAN is at https://ckan.org/. Here we have some specific notes on how we develop and deploy CKAN as well as our thoughts on the next generation of CKAN (v3).
  • DataHub is our version of a next generation of CKAN that powers DataHub.io. DataHub and CKAN v3 share many of the same core components. We are actively working on a DataHub v2 and an outline can be found here https://github.com/datopian/datahub-next

# Solutions

You can use a DMS to build many kinds of specific solutions

  • Data Portals are gateways to data. That gateway can be big or small, open or restricted. For example, data.gov is open to everyone, whilst an enterprise “intra” data portal is restricted to its personnel.
  • Data Catalog: see https://ckan.org/
  • Data Lake: you can use a DMS to rapidly create a data lake using existing infrastructure. For example, using the DMS’ catalog and storage gateway with existing cloud storage and data processing capabilities.
  • Data Engineering: you can use components of the DMS to rapidly create, orchestrate and supply data pipelines.

# Features

A DMS has a variety of features. This section provides an overview and links to specific feature pages that include details of how they work in CKAN and CKAN v3 / DataHub.

TIP

There are many ways to break down features and this is just one framing. We are thinking about others and if you have thoughts please get in touch.

# Components

A DMS has the following key components:

https://coggle.it/diagram/Xiw2ZmYss-ddJVuK/t/data-portal-feature-breakdown

# Visualization and Dashboards

# Frictionless

The Frictionless Data toolkit. See https://frictionlessdata.io/

Our teaem created whilst at Open Knowledge Foundatioin and continue to co-steward it.

# OpenSpending

https://openspending.org/

# Developer Experience

Service Reliability Engineering (SRE) and Developer Experience (DX) for our CKAN cluster technology.

Old cluster

# Glossary

Glossary »

# Notebook

Notebook – our informal blog