Categorical Databases
Home | 
Download | 
Getting Started | 
Manual |
Wiki |
Papers |
Screen Shots |
Github |
Google Group |
Conexus |
Contact 
 Read This First 
The suggested introductory course of study includes an introductory 
 
paper about CQL for computer scientists,
 introductory 
slides 
 and 
video about CQL from a functional programming perspective,
  introductory 
slides 
 and 
video about CQL from a mathematical perspective,
 introductory 
slides 
 about category theory from a knowledge management perspective,
 a 
paper
  from 
NIST arguing that category theory is critical for IT interoperability, and an
 introductory 
textbook
 on category theory, as well as a data science 
case study.
Papers
 
-  Ologs: A Categorical Framework for Knowledge Representation (2011) [Pre-print]
| Describes how to use categories as database/ontology schemas.
-  Functorial Data Migration (2013) [Pre-print]
| Describes how to migrate data between categorical databases.
-  Database Queries and Constraints via Lifting Problems (2013) [Pre-print]
| Describes how certain common database queries and constraints can be encoded as topological lifts.
-  Definition of CQL as a context-free grammar with equations  (2015)
| Describes a canonical syntax associated with CQL, as well as an axiomatic semantics.
-  Relational Foundations for Functorial Data Migration  (2015) [Pre-print] [slides]
| Describes how to implement a fragment of CQL using SELECT/FROM/WHERE/UNION and a fresh-ID generator, and vice versa. 
-  QINL: Query-Integrated Languages (2015)
| Describes the relationship between CQL and comprehension/monad syntax.
-  Algebraic Model Management: A Survey (2016) [Pre-print]
| Describes CQL entirely in terms of multi-sorted equational logic, and contrasts it with existing tools.   
-  Algebraic Databases (2017) [Pre-print] 
| Describes CQL in detail, and in particular, user-defined functions.  
-  Algebraic Data Integration (2017)  [Pre-print] [slides] [Aggregation supplement] [video]
| Describes CQL in detail, and in particular, how to implement it using automated theorem proving techniques and how to use it to integrate data.   
-  Informal Data Transformation Considered Harmful (2019) 
| Describes how CQL can preserve the data quality required to power machine learning algorithms through various data management tasks.  
-  Fast Left Kan Extensions Using the Chase (2022) 
| Describes CQL's sigma operation in detail, and in particular, how to implement it using a chase engine. 
-  Presenting Profunctors (2024)
| Motivated by problems in categorical database theory, we introduce and compare two notions of presentation for profunctors, uncurried and curried. 
Case Studies
 
Presentations
 
-  Boston Haskell (2011) [videos 1 2 3 4 5] 
 | Gives an introduction to the math behind CQL. 
-  Oracle (2014)
 | Gives an introduction to the math behind CQL, tailored to database-centric audiences.  
-  Boston Haskell (2014) [video] 
 | Gives an introduction to the math behind CQL, tailored to functional-programming audiences. 
-  Lambda Conf (2017) [video]
 | Gives an introduction to CQL for computer scientists. 
-  The Broad Institute  (2017) [video] 
 | Gives an introduction to the math behind CQL, and lessons learned from applying it in practice. 
-  Dataversity Architecture Summit (2017) 
 | Gives an introduction to CQL tailored to data architects. 
-  Kensho (2019) [video] 
 | Gives an introduction CQL and the math behind it, and describes how it enables universal semantic IT inter-operability. 
Related