Topics, variables, and features
System is designed to maximize precision in how evidence is captured and represented, while also ensuring that information about the same or similar things is grouped together. This is key to building and representing one system.
Words necessarily obfuscate some complexity of the system they represent. So the System Platform organizes information into three layers of precision. To make it clear what resolution you’re investigating, System color codes this as follows:
Most specific | Information at the data level is represented in blue. | new_confirmed |
| Information at the variable level is represented in black. | Covid-19 Incidence |
Least specific | Information at the topic level is represented in red. | COVID-19 |
Topics make up the platform’s semantic layer. They provided shared meaning spaces to organize and explore information on System. At this highest level, System organizes variables by the topics they measure.
The System Platform pulls all topics and their definitions from Wikidata and regularly indexes Wikidata to remain up-to-date with the Semantic Web.
Unlike variables, topics can’t be measured in and of themselves.
Variables on System are scientific variables used to measure topics in the world. We define variables, simply, as anything that can be measured using a defined methodology.
Variables like Body Mass Index (BMI) and Gross Domestic Product (GDP) help us understand our health and the economy. BMI, for example, is universally defined as the body mass divided by the square of the body height, and is expressed in units of kg/m2.
A variables may be one of several ways of measuring a topic. For example:
Variable Name | Topic Name |
---|---|
| Obesity |
| Human Pregnancy |
Depending on the goal of a study, researchers measure the topic Air Pollution in different ways. For example:
- "Concentration of Fine Particulate Matter" when studying the role that Fine Particulate Matter plays in premature deaths per year worldwide, as per this study.
- "Toxicity of Fine Particulate Matter", when comparing levels of toxicity in various combustion sources and non-combustion sources, such as diesel engines, road dust, coal combustion, as per this study.
- "Concentration of Nitrogen Dioxide", when studying the effect of nitrogen dioxide concentrations on bronchial inflammations, as per this study.
Variables Monitoring
System also provides a way of tracking the values of variables as a system. When variables are connected to data with time-varying values, System displays the current value alongside the most recent previous value and the percent change between the values. When the values and recent changes are examined for systems of related variables, you get richer context to consider the data. While depictions of changes in value can be found in many places (economic indicator reports, health records, etc.), this feature is designed to show you systems of changes in the data that are used to measure our world. Since System shows you the statistical direction of the relationships between variables (though not necessarily the causal direction), this feature can help you predict changes in variables before they are actually measured and consider possible actions to take to move variables.

Features (data) are the most precise building block in System’s information architecture. Whenever possible, pieces of evidence are gathered and computed at this level to maximize accuracy and reproducibility.
System matches data, variables, and topics as follows: A topic can be matched to any variable and a variable can be matched to any feature. This many-to-one organization allows System to aggregate statistical associations even when the underlying features and variables are operationalized slightly differently (e.g. measured on different populations).