top of page
57_edited.jpg

​AskTilli!

The 360-degree Foundational Skills Dataset

Unlocking longitudinal, multi-informant insights into cognitive and social-emotional development for non-WEIRD populations.

The Problem: "Data Blindness" in AI

1. Sparse Outcomes

Most datasets focus only on static final scores, ignoring the "process" of how a child learns.

image.png

2. Isolated Data

Research is often siloed either looking at classroom performance or home environments, but never triangulating both.

image.png

3. Expensive Protocols

Standardized assessments require high-cost, in-person trained assessors, leaving resource-poor communities out of the AI evolution.

image.png

Current educational AI models are predominantly trained on data from

W.E.I.R.D.

  Western, Educated, Industrialized, Rich, and Democratic populations.  

This creates a

"data blind spot" for children in

South Asia, the Middle East, and other diverse contexts.

We provide a unified view of child development!

Across 12 core competencies, categorized into two critical domains

6 Cognitive Skills

6 Social-Emotional Learning (SEL) Skills

OTHER DOMAIN AND PARAMETERS

Academic Domain
  • Math Scores 

  • Literacy Scores 

Captures core learning outcomes through math and literacy. Math reflects problem-solving and numerical understanding, while literacy measures reading, comprehension, and expression—together indicating overall academic proficiency.

[For Ishani] AI images (9).png

How We Measure: 
The Triadic Assessment Protocol

We move beyond single-informant bias by capturing three perspectives for every child, using a digitized, voice-supported, and localized protocol:

4.png
2.png
[For Ishani] AI images (12).png
5.png

Note: All data is de-identified, privacy-first, and delivered via low-bandwidth messaging platforms to ensure inclusivity.

Demographics & Ecology

Equity markers, home environment, family structure, and special needs status.

Teacher /

Classroom Report

Pedagogical context, concentration/distractibility ratings, and interpersonal assessments.

Direct Assessments

Behavioral interaction streams, raw latency, task-switching performance, and foundational literacy/numeracy readiness.

Caregiver

Report

Home-based emotional symptoms, behavioral outcomes, and executive function ratings.

What We Measure:
Data Structure

Our dataset is composed of four interoperable CSV files linked by a unique subject_id:

Our high-density behavioral streams enable researchers to build the next generation of AI!

AI Use Cases:
From Research to Action

1 / Predictive Intervention

Identify Grade 3 cognitive signatures that serve as early warning signs for academic struggle.

2 / Context-Aware Profiling

Use AI to weigh conflicting reports from parents vs. teachers, turning "discrepancies" into signals for better child support.

3 / Synthetic Learner Profiles

Train Generative AI (LLMs/LRMs) to simulate diverse, realistic student profiles for teacher training and curriculum stress-testing.

4 / Algorithmic Fairness Audits

Use our localized dataset to audit and improve the regional robustness of global SEL models.

Resources and More!

Technical Documentation

This guide is intended for software developers who are familiar with how web applications are built.

License

Creative Commons Attribution 4.0 (CC BY 4.0)

GitHub

This guide is intended for developers using GitHub who are familiar with how web applications are built.

About the Team

We are a Research-Practice Partnership (RPP) involving UNICEF, Tilli Kids, and global educators committed to open-source learning engineering.

bottom of page