cm-dashboard

Author	SHA1	Message	Date
Christoffer Martinsson	b2b301332f	Fix storage display showing missing total usage data All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details The structured data bridge conversion was only converting individual drive metrics (temperature, wear) and filesystem metrics, but wasn't generating the aggregated total usage metrics expected by the storage widget (disk_{drive}_total_gb, disk_{drive}_used_gb, disk_{drive}_usage_percent). This caused physical drives to display "—% —GB/—GB" instead of actual usage statistics. Updated the bridge conversion to calculate drive totals by aggregating all filesystems on each drive: - total_used = sum of all filesystem used_gb values - total_size = sum of all filesystem total_gb values - average_usage = (total_used / total_size) * 100 Now physical drives like nvme0n1 properly display total usage aggregated from all their filesystems (e.g., /boot + / = total drive usage). Version bump: v0.1.131 → v0.1.132	2025-11-23 21:43:34 +01:00
Christoffer Martinsson	adf3b0f51c	Implement complete structured data architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details Replace fragile string-based metrics with type-safe JSON data structures. Agent converts all metrics to structured data, dashboard processes typed fields. Changes: - Add AgentData struct with CPU, memory, storage, services, backup fields - Replace string parsing with direct field access throughout system - Maintain UI compatibility via temporary metric bridge conversion - Fix NVMe temperature display and eliminate string parsing bugs - Update protocol to support structured data transmission over ZMQ - Comprehensive metric type coverage: CPU, memory, storage, services, backup Version bump to 0.1.131	2025-11-23 21:32:00 +01:00
Christoffer Martinsson	0faed9309e	Improve host disconnection detection and fix notification exclusions All checks were successful Build and Release / build-and-release (push) Successful in 1m34s Details - Add dedicated heartbeat transmission every 5 seconds independent of metric collection - Fix host offline detection by clearing metrics for disconnected hosts - Move exclude_email_metrics to NotificationConfig for better organization - Add cleanup_offline_hosts method to remove stale metrics after heartbeat timeout - Ensure offline hosts show proper status icons and visual indicators Version 0.1.63	2025-11-08 11:33:32 +01:00
Christoffer Martinsson	5f6e47ece5	Implement heartbeat-based host connectivity detection All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details - Add agent_heartbeat metric to agent transmission for reliable host detection - Update dashboard to track heartbeat timestamps per host instead of general metrics - Add configurable heartbeat_timeout_seconds to dashboard ZMQ config (default 10s) - Remove unused timeout_ms from agent config and revert to non-blocking command reception - Remove unused heartbeat_interval_ms from agent configuration - Host disconnect detection now uses dedicated heartbeat metrics for improved reliability - Bump version to 0.1.57	2025-11-06 11:04:01 +01:00
Christoffer Martinsson	aaf7edfbce	Implement cross-host agent version comparison - MetricStore tracks agent versions from all hosts - Detects version mismatches using most common version as reference - Dashboard logs warnings for hosts with outdated agents - Foundation for visual version mismatch indicators in UI - Helps identify deployment inconsistencies across infrastructure	2025-10-26 10:42:26 +01:00
Christoffer Martinsson	3d2b37b26c	Remove hardcoded defaults and migrate dashboard config to NixOS - Remove all unused configuration options from dashboard config module - Eliminate hardcoded defaults - dashboard now requires config file like agent - Keep only actually used config: zmq.subscriber_ports and hosts.predefined_hosts - Remove unused get_host_metrics function from metric store - Clean up missing module imports (hosts, utils) - Make dashboard fail fast if no configuration provided - Align dashboard config approach with agent configuration pattern	2025-10-21 21:54:23 +02:00
Christoffer Martinsson	00a8ed3da2	Implement hysteresis for metric status changes to prevent flapping Add comprehensive hysteresis support to prevent status oscillation near threshold boundaries while maintaining responsive alerting. Key Features: - HysteresisThresholds with configurable upper/lower limits - StatusTracker for per-metric status history - Default gaps: CPU load 10%, memory 5%, disk temp 5°C Updated Components: - CPU load collector (5-minute average with hysteresis) - Memory usage collector (percentage-based thresholds) - Disk temperature collector (SMART data monitoring) - All collectors updated to support StatusTracker interface Cache Interval Adjustments: - Service status: 60s → 10s (faster response) - Disk usage: 300s → 60s (more frequent checks) - Backup status: 900s → 60s (quicker updates) - SMART data: moved to 600s tier (10 minutes) Architecture: - Individual metric status calculation in collectors - Centralized StatusTracker in MetricCollectionManager - Status aggregation preserved in dashboard widgets	2025-10-20 18:45:41 +02:00
Christoffer Martinsson	0141a6e111	Remove unused code and eliminate build warnings Removed unused widget subscription system, cache utilities, error variants, theme functions, and struct fields. Replaced subscription-based widgets with direct metric filtering. Build now completes with zero warnings.	2025-10-18 23:50:15 +02:00
Christoffer Martinsson	8a36472a3d	Implement real-time process monitoring and fix UI hardcoded data This commit addresses several key issues identified during development: Major Changes: - Replace hardcoded top CPU/RAM process display with real system data - Add intelligent process monitoring to CpuCollector using ps command - Fix disk metrics permission issues in systemd collector - Optimize service collection to focus on status, memory, and disk only - Update dashboard widgets to display live process information Process Monitoring Implementation: - Added collect_top_cpu_process() and collect_top_ram_process() methods - Implemented ps-based monitoring with accurate CPU percentages - Added filtering to prevent self-monitoring artifacts (ps commands) - Enhanced error handling and validation for process data - Dashboard now shows realistic values like "claude (PID 2974) 11.0%" Service Collection Optimization: - Removed CPU monitoring from systemd collector for efficiency - Enhanced service directory permission error logging - Simplified services widget to show essential metrics only - Fixed service-to-directory mapping accuracy UI and Dashboard Improvements: - Reorganized dashboard layout with btop-inspired multi-panel design - Updated system panel to include real top CPU/RAM process display - Enhanced widget formatting and data presentation - Removed placeholder/hardcoded data throughout the interface Technical Details: - Updated agent/src/collectors/cpu.rs with process monitoring - Modified dashboard/src/ui/mod.rs for real-time process display - Enhanced systemd collector error handling and disk metrics - Updated CLAUDE.md documentation with implementation details	2025-10-16 23:55:05 +02:00

9 Commits