francescopace
diff --git a/‎CHANGELOG.md‎
Lines changed: 2 additions & 1 deletion b/‎CHANGELOG.md‎
Lines changed: 2 additions & 1 deletion
diff --git a/‎PERFORMANCE.md‎
Lines changed: 20 additions & 18 deletions b/‎PERFORMANCE.md‎
Lines changed: 20 additions & 18 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 1 deletion b/‎README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎SETUP.md‎
Lines changed: 1 addition & 1 deletion b/‎SETUP.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎components/espectre/gain_controller.h‎
Lines changed: 10 additions & 4 deletions b/‎components/espectre/gain_controller.h‎
Lines changed: 10 additions & 4 deletions
diff --git a/‎components/espectre/ml_detector.cpp‎
Lines changed: 37 additions & 29 deletions b/‎components/espectre/ml_detector.cpp‎
Lines changed: 37 additions & 29 deletions
diff --git a/‎components/espectre/ml_detector.h‎
Lines changed: 9 additions & 8 deletions b/‎components/espectre/ml_detector.h‎
Lines changed: 9 additions & 8 deletions
@@ -19,7 +19,8 @@ All notable changes to this project will be documented in this file.
 - **NBVI strategy selection expanded**: each window evaluates four candidates (Entropy Spaced, MAD Clustered, Classic Spaced, Classic Clustered) and selects the lowest-FP option; scoring now exposes `nbvi_classic`, `nbvi_entropy`, and `nbvi_mad`.
 
 - **NBVI defaults and validation tightened**: `alpha` 0.5->0.75, `percentile` 10->5, `noise_gate_percentile` 25->15; calibration FP is now measured with the runtime-consistent adaptive threshold (`P95 x 1.1`).
-- **Hint-band fallback made conservative**: hint/current band is preferred only when calibrated candidates miss the <=5% FP target and the hint is strictly better (`hint_fp_tolerance`, `prefer_hint_on_tie`).
+- **Hint-band fallback made conservative**: hint/current band is now also kept when both the calibrated candidate and the hint/default band already satisfy the <=5% FP target and the hint is not meaningfully worse on that proxy. This prevents over-conservative NBVI bands from replacing a known-good default on datasets such as ESP32-C5.
+- **Python/C++ real-data pairing aligned**: the native C++ test harness now uses full ISO timestamps including fractional seconds when choosing nearest baseline/movement pairs, matching the Python path and removing false regressions caused by second-level truncation.
 
 ### ML and dataset pipeline
 
 
@@ -6,10 +6,12 @@ This document provides detailed performance metrics for ESPectre's motion detect
 
 ## Performance Targets
 
-| Metric | Target (all chips) | Rationale |
-|--------|--------------------|-----------|
-| Recall | >95% | Minimize missed detections |
-| FP Rate | <5% | Avoid false alarms |
+| Scope | Metric | Target | Rationale |
+|-------|--------|--------|-----------|
+| MVS / NBVI | Recall | >95% | Minimize missed detections |
+| MVS / NBVI | FP Rate | <5% | Avoid false alarms |
+| ML | Recall | >94% (temporary) | Temporary validation gate while the residual ESP32 ML recall gap is under investigation |
+| ML | FP Rate | <5% | Avoid false alarms |
 
 --
 ### Test Configuration
@@ -63,24 +65,24 @@ Results from C++ and Python tests follow the same trends (same algorithms, same
 | Chip | Algorithm | Recall | Precision | FP Rate | F1-Score |
 |------|-----------|--------|-----------|---------|----------|
 | ESP32-C3 | MVS Default | 96.1% | 99.9% | 0.1% | 98.0% |
-| ESP32-C3 | MVS + NBVI | 96.5% | 100.0% | 0.0% | 98.2% |
-| ESP32-C3 | ML | 99.6% | 100.0% | 0.0% | 99.8% |
-| ESP32-C5 | MVS Default | 99.7% | 99.2% | 1.1% | 99.5% |
-| ESP32-C5 | MVS + NBVI | 99.1% | 100.0% | 0.0% | 99.6% |
+| ESP32-C3 | MVS + NBVI | 96.1% | 100.0% | 0.0% | 98.0% |
+| ESP32-C3 | ML | 99.9% | 100.0% | 0.0% | 99.9% |
+| ESP32-C5 | MVS Default | 99.6% | 100.0% | 0.0% | 99.8% |
+| ESP32-C5 | MVS + NBVI | 99.2% | 100.0% | 0.0% | 99.6% |
 | ESP32-C5 | ML | 100.0% | 100.0% | 0.0% | 100.0% |
-| ESP32-C6 | MVS Default | 98.1% | 100.0% | 0.0% | 99.0% |
-| ESP32-C6 | MVS + NBVI | 99.6% | 99.8% | 0.3% | 99.7% |
-| ESP32-C6 | ML | 100.0% | 100.0% | 0.0% | 100.0% |
+| ESP32-C6 | MVS Default | 99.7% | 100.0% | 0.0% | 99.9% |
+| ESP32-C6 | MVS + NBVI | 99.6% | 100.0% | 0.0% | 99.8% |
+| ESP32-C6 | ML | 98.9% | 100.0% | 0.0% | 99.4% |
 | ESP32-S3 | MVS Default | 99.8% | 98.0% | 2.8% | 98.9% |
 | ESP32-S3 | MVS + NBVI | 96.7% | 100.0% | 0.0% | 98.3% |
-| ESP32-S3 | ML | 99.8% | 100.0% | 0.0% | 99.9% |
-| ESP32 | MVS Default | 99.8% | 100.0% | 0.0% | 99.9% |
-| ESP32 | MVS + NBVI | 99.8% | 100.0% | 0.0% | 99.9% |
-| ESP32 | ML | 99.6% | 100.0% | 0.0% | 99.8% |
+| ESP32-S3 | ML | 99.9% | 100.0% | 0.0% | 99.9% |
+| ESP32 | MVS Default | 99.4% | 98.4% | 2.0% | 98.9% |
+| ESP32 | MVS + NBVI | 97.6% | 100.0% | 0.0% | 98.8% |
+| ESP32 | ML | 94.2% | 98.2% | 2.3% | 96.1% |
 
 **MVS Default**: Uses default subcarriers.
 **MVS + NBVI**: Uses NBVI auto-calibration (production case).
-**ML**: Neural network with chip-grouped CV, hard-positive mining, and Hampel filter.
+**ML**: Neural network with grouped session-level blocked CV for model selection, context-aware MVS-guided weights, and Hampel filtering.
 
 ---
 
@@ -156,8 +158,8 @@ For ML architecture details, see [ALGORITHMS.md](micro-espectre/ALGORITHMS.md#ar
 
 | Date | Version | Dataset | Calibration | Algorithm | Recall | Precision | FP Rate | F1-Score |
 |------|---------|---------|-------------|-----------|--------|-----------|---------|----------|
-| 2026-03-29 | v2.8.0 | C6 |  -   | ML + Hampel | 100.0% | 100.0% | 0.0% | 100.0% |
-| 2026-03-29 | v2.8.0 | C6 | NBVI | MVS + Hampel| 99.6% | 99.8% | 0.3% | 99.7% |
+| 2026-05-04 | v2.8.0 | C6 |  -   | ML + Hampel | 98.9% | 100.0% | 0.0% | 99.4% |
+| 2026-05-04 | v2.8.0 | C6 | NBVI | MVS + Hampel| 99.6% | 100.0% | 0.0% | 99.8% |
 | 2026-03-11 | v2.6.1 | C6 |  -   | ML | 100.0% | 100.0% | 0.0% | 100.0% |
 | 2026-03-11 | v2.6.1 | C6 | NBVI | MVS | 99.3% | 100.0% | 0.0% | 99.7% |
 | 2026-03-08 | v2.6.0 | C6 |  -   | ML | 100.0% | 100.0% | 0.0% | 100.0% |
 
@@ -374,7 +374,7 @@ While ESPectre v2.x focuses on **motion detection** (MVS + automatic subcarrier
 
 | Capability | Status | Description |
 |------------|--------|-------------|
-| **ML Detector** | Experimental | Neural network (MLP 12→16→8→1, 97-100% F1), ~3s boot time |
+| **ML Detector** | Experimental | Neural network (MLP 12→24→12→1, 97-100% F1), ~3s boot time |
 | **Gesture Recognition** | Planned | Detect hand gestures (swipe, push, circle) for smart home control |
 | **Human Activity Recognition** | Planned | Identify activities (sitting, walking, falling) |
 | **People Counting** | Planned | Estimate number of people in a room |
 
@@ -247,7 +247,7 @@ For detailed parameter tuning (ranges, recommended values, troubleshooting), see
 | Algorithm | How It Works | Pros | Cons | Best For |
 |-----------|--------------|------|------|----------|
 | **MVS** (default) | Variance of spatial turbulence | Low CPU, adaptive threshold | Requires 10s NBVI calibration | General use |
-| **ML** | Neural network (MLP 12→16→8→1) | Fast boot (~3s), no calibration | Pre-trained weights, fixed subcarriers | Experimental |
+| **ML** | Neural network (MLP 12→24→12→1) | Fast boot (~3s), no calibration | Pre-trained weights, fixed subcarriers | Experimental |
 
 Both algorithms support optional low-pass and Hampel filters on the turbulence stream.
 
 
@@ -224,14 +224,20 @@ class GainController {
   /**
    * Check if CV normalization is needed
    * 
-   * CV normalization (dividing by mean) is needed when gain lock was skipped
-   * (strong signal) or when mode is DISABLED. In these cases, AGC/FFT vary
-   * dynamically and CV normalization provides stable turbulence values.
+   * CV normalization (dividing by mean) is needed whenever AGC/FFT are not
+   * effectively locked. That includes:
+   * - strong-signal AUTO fallback (gain lock skipped)
+   * - explicit DISABLED mode
+   * - platforms that do not expose PHY gain-lock APIs at all
+   *
+   * In these cases, AGC/FFT can vary dynamically and CV normalization provides
+   * stable turbulence values aligned with the training pipeline used for
+   * `gain_locked=false` datasets.
    * 
    * @return true if CV normalization should be applied
    */
   bool needs_cv_normalization() const {
-    return skipped_strong_signal_ || mode_ == GainLockMode::DISABLED;
+    return skip_gain_lock_ || skipped_strong_signal_ || mode_ == GainLockMode::DISABLED;
   }
 
  private:
 
@@ -18,6 +18,8 @@ namespace esphome {
 namespace espectre {
 
 static const char *TAG = "MLDetector";
+static_assert(ML_MODEL_INPUT_SIZE == ML_NUM_FEATURES,
+              "Exported model input size must match extracted ML feature count");
 
 // ============================================================================
 // CONSTRUCTOR
@@ -57,7 +59,7 @@ void MLDetector::update_state() {
         return;
     }
 
-    // Extract 12 features
+    // Extract ML features expected by the exported model
     float features[ML_NUM_FEATURES];
     extract_features(features);
 
@@ -105,39 +107,45 @@ void MLDetector::extract_features(float* features_out) {
 // ============================================================================
 
 float MLDetector::predict(const float* features) {
-    float normalized[12];
-    float h1[16];
-    float h2[8];
-    
+    constexpr size_t kBufferSize =
+        (ML_MAX_LAYER_WIDTH > ML_MODEL_INPUT_SIZE) ? ML_MAX_LAYER_WIDTH : ML_MODEL_INPUT_SIZE;
+    float buffer_a[kBufferSize] = {0.0f};
+    float buffer_b[kBufferSize] = {0.0f};
+
     // Normalize features using pre-computed mean and scale
-    for (int i = 0; i < 12; i++) {
-        normalized[i] = (features[i] - ML_FEATURE_MEAN[i]) / ML_FEATURE_SCALE[i];
+    for (int i = 0; i < ML_MODEL_INPUT_SIZE; i++) {
+        buffer_a[i] = (features[i] - ML_FEATURE_MEAN[i]) / ML_FEATURE_SCALE[i];
     }
-    
-    // Layer 1: 12 -> 16 + ReLU
-    for (int j = 0; j < 16; j++) {
-        h1[j] = ML_B1[j];
-        for (int i = 0; i < 12; i++) {
-            h1[j] += normalized[i] * ML_W1[i][j];
+
+    float *current = buffer_a;
+    float *next = buffer_b;
+    float out = 0.0f;
+
+    for (int layer = 0; layer < ML_MODEL_NUM_LAYERS; layer++) {
+        const int in_size = ML_MODEL_LAYER_INPUT_SIZES[layer];
+        const int out_size = ML_MODEL_LAYER_OUTPUT_SIZES[layer];
+        const float *weights = ML_MODEL_WEIGHTS[layer];
+        const float *biases = ML_MODEL_BIASES[layer];
+        const bool is_output_layer = (layer == ML_MODEL_NUM_LAYERS - 1);
+
+        for (int j = 0; j < out_size; j++) {
+            float val = biases[j];
+            for (int i = 0; i < in_size; i++) {
+                val += current[i] * weights[i * out_size + j];
+            }
+
+            if (is_output_layer) {
+                out = val;
+            } else {
+                next[j] = std::max(0.0f, val);
+            }
         }
-        h1[j] = std::max(0.0f, h1[j]);  // ReLU
-    }
-    
-    // Layer 2: 16 -> 8 + ReLU
-    for (int j = 0; j < 8; j++) {
-        h2[j] = ML_B2[j];
-        for (int i = 0; i < 16; i++) {
-            h2[j] += h1[i] * ML_W2[i][j];
+
+        if (!is_output_layer) {
+            std::swap(current, next);
         }
-        h2[j] = std::max(0.0f, h2[j]);  // ReLU
     }
-    
-    // Layer 3: 8 -> 1 + Sigmoid
-    float out = ML_B3[0];
-    for (int i = 0; i < 8; i++) {
-        out += h2[i] * ML_W3[i][0];
-    }
-    
+
     // Sigmoid with overflow protection and scaling to 0-10 range
     if (out < -20.0f) return 0.0f;
     if (out > 20.0f) return ML_METRIC_SCALE;
 
@@ -7,8 +7,8 @@
  * 1. Calculate spatial turbulence (std of subcarrier amplitudes) per packet
  * 2. Apply optional Hampel filter to remove outliers
  * 3. Apply optional low-pass filter for noise reduction
- * 4. Extract 12 statistical features from turbulence buffer
- * 5. Run MLP inference (12 -> 16 -> 8 -> 1)
+ * 4. Extract statistical features from turbulence buffer
+ * 5. Run MLP inference using exported architecture metadata
  * 6. Compare probability to threshold for motion detection
  * 
  * Author: Francesco Pace <francesco.pace@gmail.com>
@@ -69,16 +69,17 @@ class MLDetector : public BaseDetector {
 
 private:
     /**
-     * Extract 12 features from turbulence buffer
+     * Extract ML features from the turbulence buffer
      */
     void extract_features(float* features_out);
 
     /**
-     * Run MLP inference on features
-     * 
-     * Architecture: 12 -> 16 (ReLU) -> 8 (ReLU) -> 1 (Sigmoid)
-     * 
-     * @param features Normalized feature vector (12 values)
+     * Run MLP inference on features.
+     *
+     * The hidden-layer layout is defined by the auto-generated
+     * `ml_weights.h` metadata rather than hardcoded in this class.
+     *
+     * @param features Feature vector expected by the exported model
      * @return Scaled motion metric (0.0-10.0, unified with MVS)
      */
     float predict(const float* features);