FLOW: A Feedback-Driven Synthetic Longitudinal Dataset of Work and Wellbeing
By: Wafaa El Husseini
Access to longitudinal, individual-level data on work-life balance and wellbeing is limited by privacy, ethical, and logistical constraints. This poses challenges for reproducible research, methodological benchmarking, and education in domains such as stress modeling, behavioral analysis, and machine learning. We introduce FLOW, a synthetic longitudinal dataset designed to model daily interactions between workload, lifestyle behaviors, and wellbeing. FLOW is generated using a rule-based, feedback-driven simulation that produces coherent temporal dynamics across variables such as stress, sleep, mood, physical activity, and body weight. The dataset simulates 1{,}000 individuals over a two-year period with daily resolution and is released as a publicly available resource. In addition to the static dataset, we describe a configurable data generation tool that enables reproducible experimentation under adjustable behavioral and contextual assumptions. FLOW is intended as a controlled experimental environment rather than a proxy for observed human populations, supporting exploratory analysis, methodological development, and benchmarking where real-world data are inaccessible.
Similar Papers
PixleepFlow: A Pixel-Based Lifelog Framework for Predicting Sleep Quality and Stress Level
Signal Processing
Helps understand sleep and stress from pictures.
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI
Machine Learning (CS)
Makes AI learn better from data.
Flow Gym
Fluid Dynamics
Teaches computers to see moving air from pictures.