cm-dashboard

Author	SHA1	Message	Date
Christoffer Martinsson	d164c1da5f	Add missing service_status field to ServiceData All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-24 21:20:09 +01:00
Christoffer Martinsson	66ab7a492d	Complete monitoring system restoration All checks were successful Build and Release / build-and-release (push) Successful in 2m39s Details Fully restored CM Dashboard as a complete monitoring system with working status evaluation and email notifications. COMPLETED PHASES: ✅ Phase 1: Fixed storage display issues - Use lsblk instead of findmnt (eliminates /nix/store bind mount) - Fixed NVMe SMART parsing (Temperature: and Percentage Used:) - Added sudo to smartctl for permissions - Consistent filesystem and tmpfs sorting ✅ Phase 2a: Fixed missing NixOS build information - Added build_version field to AgentData - NixOS collector now populates build info - Dashboard shows actual build instead of "unknown" ✅ Phase 2b: Restored status evaluation system - Added status fields to all structured data types - CPU: load and temperature status evaluation - Memory: usage status evaluation - Storage: temperature, health, and filesystem usage status - All collectors now use their threshold configurations ✅ Phase 3: Restored notification system - Status change detection between collection cycles - Email alerts on status degradation (OK→Warning/Critical) - Detailed notification content with metric values - Full NotificationManager integration CORE FUNCTIONALITY RESTORED: - Real-time monitoring with proper status evaluation - Email notifications on threshold violations - Correct storage display (nvme0n1 T: 28°C W: 1%) - Complete status-aware infrastructure monitoring - Dashboard is now a monitoring system, not just data viewer The CM Dashboard monitoring system is fully operational.	2025-11-24 19:58:26 +01:00
Christoffer Martinsson	adf3b0f51c	Implement complete structured data architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details Replace fragile string-based metrics with type-safe JSON data structures. Agent converts all metrics to structured data, dashboard processes typed fields. Changes: - Add AgentData struct with CPU, memory, storage, services, backup fields - Replace string parsing with direct field access throughout system - Maintain UI compatibility via temporary metric bridge conversion - Fix NVMe temperature display and eliminate string parsing bugs - Update protocol to support structured data transmission over ZMQ - Comprehensive metric type coverage: CPU, memory, storage, services, backup Version bump to 0.1.131	2025-11-23 21:32:00 +01:00
Christoffer Martinsson	156d707377	Add version display and fix status aggregation priorities All checks were successful Build and Release / build-and-release (push) Successful in 2m37s Details - Add dynamic version display in top bar using CARGO_PKG_VERSION - Rewrite status aggregation to only show Critical/Warning/OK in top bar - Fix Status enum ordering to prioritize OK over transitional states - Remove blue/gray colors from top bar background	2025-11-21 16:19:45 +01:00
Christoffer Martinsson	9575077045	Fix Status::Inactive aggregation priority for green title bar All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details - Move Status::Inactive to lowest priority in enum (before Ok) - Status aggregation now prefers Ok over Inactive in mixed scenarios - Title bar stays green when mixing active and inactive services - Inactive services still show gray icons but don't affect overall status - Ensures healthy systems with stopped services maintain green status	2025-11-18 18:17:25 +01:00
Christoffer Martinsson	34a1f7b9dc	Fix Status::Inactive ordering to prevent gray title bar All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details - Reorder Status enum variants to fix aggregation priority - Status::Inactive now has same priority as Status::Ok in aggregation - Prevents inactive services from causing gray title bar - Title bar stays green when system has only active and inactive services - Only Unknown/Offline/Pending/Warning/Critical statuses affect title color	2025-11-18 18:03:50 +01:00
Christoffer Martinsson	d11aa11f99	Add Status::Inactive for inactive services with empty circle display All checks were successful Build and Release / build-and-release (push) Successful in 1m12s Details - Add new Status::Inactive variant to enum for better service state representation - Agent now assigns Status::Inactive instead of Status::Warning for inactive services - Dashboard displays inactive services with empty circle (○) icon in gray color - User-stopped services still show as Status::Ok with green filled circle - Inactive services treated as OK for host status aggregation - Improves visual clarity between active (●), inactive (○), and warning (◐) states	2025-11-18 17:54:51 +01:00
Christoffer Martinsson	6179bd51a7	Implement WakeOnLAN functionality with simplified configuration All checks were successful Build and Release / build-and-release (push) Successful in 2m32s Details - Add Status::Offline enum variant for disconnected hosts - All configured hosts now always visible showing offline status when disconnected - Add WakeOnLAN support using wake-on-lan Rust crate - Implement w key binding to wake offline hosts with MAC addresses - Simplify configuration to single [hosts] section with MAC addresses only - Change critical status icon from ◯ to ! for better visibility - Add proper MAC address parsing and error handling - Silent WakeOnLAN operation with logging for success/failure Configuration format: [hosts] hostname = { mac_address = "AA:BB:CC:DD:EE:FF" }	2025-10-31 09:03:01 +01:00
Christoffer Martinsson	b6da71b7e7	Implement real-time terminal popup for system rebuild operations All checks were successful Build and Release / build-and-release (push) Successful in 1m21s Details - Add terminal popup UI component with 80% screen coverage and terminal styling - Extend ZMQ protocol with CommandOutputMessage for streaming output - Implement real-time output streaming in agent system rebuild handler - Add keyboard controls (ESC/Q to close, ↑↓ to scroll) for popup interaction - Fix system panel Build display to show actual NixOS build instead of config hash - Update service filters in README with wildcard patterns for better matching - Add periodic progress updates during nixos-rebuild execution - Integrate command output handling in dashboard main loop	2025-10-26 11:39:03 +01:00
Christoffer Martinsson	a08670071c	Implement simple persistent cache with automatic saving on status changes	2025-10-21 20:12:19 +02:00
Christoffer Martinsson	d80f2ce811	Remove unused cache tiers system	2025-10-21 18:43:46 +02:00
Christoffer Martinsson	41208aa2a0	Implement status aggregation with notification batching	2025-10-21 18:12:42 +02:00
Christoffer Martinsson	00a8ed3da2	Implement hysteresis for metric status changes to prevent flapping Add comprehensive hysteresis support to prevent status oscillation near threshold boundaries while maintaining responsive alerting. Key Features: - HysteresisThresholds with configurable upper/lower limits - StatusTracker for per-metric status history - Default gaps: CPU load 10%, memory 5%, disk temp 5°C Updated Components: - CPU load collector (5-minute average with hysteresis) - Memory usage collector (percentage-based thresholds) - Disk temperature collector (SMART data monitoring) - All collectors updated to support StatusTracker interface Cache Interval Adjustments: - Service status: 60s → 10s (faster response) - Disk usage: 300s → 60s (more frequent checks) - Backup status: 900s → 60s (quicker updates) - SMART data: moved to 600s tier (10 minutes) Architecture: - Individual metric status calculation in collectors - Centralized StatusTracker in MetricCollectionManager - Status aggregation preserved in dashboard widgets	2025-10-20 18:45:41 +02:00
Christoffer Martinsson	7f85a6436e	Clean up unused imports and fix build warnings - Remove unused imports (Duration, HashMap, SharedError, DateTime, etc.) - Fix unused variables by prefixing with underscore - Remove redundant dashboard.toml config file - Update theme imports to use only needed components - Maintain all functionality while reducing warnings - Add srv02 to predefined hosts configuration - Remove unused broadcast_command methods	2025-10-18 23:12:07 +02:00
Christoffer Martinsson	dcca5bbea3	Fix cache tier test to match actual configuration - Update test expectations from 5s to 2s intervals for realtime tier - Fix comment to reflect actual 2s interval instead of outdated 5s reference - All tests now pass correctly	2025-10-18 18:44:13 +02:00
Christoffer Martinsson	8a36472a3d	Implement real-time process monitoring and fix UI hardcoded data This commit addresses several key issues identified during development: Major Changes: - Replace hardcoded top CPU/RAM process display with real system data - Add intelligent process monitoring to CpuCollector using ps command - Fix disk metrics permission issues in systemd collector - Optimize service collection to focus on status, memory, and disk only - Update dashboard widgets to display live process information Process Monitoring Implementation: - Added collect_top_cpu_process() and collect_top_ram_process() methods - Implemented ps-based monitoring with accurate CPU percentages - Added filtering to prevent self-monitoring artifacts (ps commands) - Enhanced error handling and validation for process data - Dashboard now shows realistic values like "claude (PID 2974) 11.0%" Service Collection Optimization: - Removed CPU monitoring from systemd collector for efficiency - Enhanced service directory permission error logging - Simplified services widget to show essential metrics only - Fixed service-to-directory mapping accuracy UI and Dashboard Improvements: - Reorganized dashboard layout with btop-inspired multi-panel design - Updated system panel to include real top CPU/RAM process display - Enhanced widget formatting and data presentation - Removed placeholder/hardcoded data throughout the interface Technical Details: - Updated agent/src/collectors/cpu.rs with process monitoring - Modified dashboard/src/ui/mod.rs for real-time process display - Enhanced systemd collector error handling and disk metrics - Updated CLAUDE.md documentation with implementation details	2025-10-16 23:55:05 +02:00
Christoffer Martinsson	1b572c5c1d	Implement intelligent caching system for optimal CPU performance Replace traditional 5-second polling with tiered collection strategy: - RealTime (5s): CPU load, memory usage - Medium (5min): Service status, disk usage - Slow (15min): SMART data, backup status Key improvements: - Reduce CPU usage from 9.5% to <2% - Cache warming for instant dashboard responsiveness - Background refresh at 80% of tier intervals - Thread-safe cache with automatic cleanup Remove legacy polling code - smart caching is now the default and only mode. Agent startup enhanced with parallel cache population for immediate data availability. Architecture: SmartCache + CachedCollector + tiered CollectionScheduler	2025-10-15 11:21:36 +02:00
Christoffer Martinsson	57b676ad25	Testing	2025-10-13 00:16:24 +02:00
Christoffer Martinsson	2581435b10	Implement per-service disk usage monitoring Replaced system-wide disk usage with accurate per-service tracking by scanning service-specific directories. Services like sshd now correctly show minimal disk usage instead of misleading system totals. - Rename storage widget and add drive capacity/usage columns - Move host display to main dashboard title for cleaner layout - Replace separate alert displays with color-coded row highlighting - Add per-service disk usage collection using du command - Update services widget formatting to handle small disk values - Restructure into workspace with dedicated agent and dashboard packages	2025-10-11 22:59:16 +02:00
Christoffer Martinsson	82afe3d4f1	Restructure into workspace with dashboard and agent	2025-10-11 14:19:05 +02:00

20 Commits