cm-dashboard

Author	SHA1	Message	Date
Christoffer Martinsson	47ab1e387d	Add Status::Info for informational sub-services All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Agent uses Status enum to control display: - Status::Info: no icon, no status text (VPN IP) - Other statuses: icon + text (containers, nginx sites) Dashboard checks status, no hardcoded service_type exceptions. Version: v0.1.237	2025-12-01 15:11:16 +01:00
Christoffer Martinsson	40f3ff66d8	Show archive count range to detect inconsistencies - Display single number if all services have same count - Display min-max range if counts differ (indicates problem)	2025-11-29 17:59:24 +01:00
Christoffer Martinsson	f5913dbd43	Add archive count to backup disk display	2025-11-29 17:41:11 +01:00
Christoffer Martinsson	afb8d68e03	Implement multi-disk backup support - Update BackupData structure to support multiple backup disks - Scan /var/lib/backup/status/ directory for all status files - Calculate status icons for backup and disk usage - Aggregate repository status from all disks - Update dashboard to display all backup disks with per-disk status - Display repository list with count and aggregated status	2025-11-29 16:44:50 +01:00
Christoffer Martinsson	7c030b33d6	Show top 3 C-states with usage percentages All checks were successful Build and Release / build-and-release (push) Successful in 1m21s Details - Changed CpuData.cstate from String to Vec<CStateInfo> - Added CStateInfo struct with name and percent fields - Collector calculates percentage for each C-state based on accumulated time - Sorts and returns top 3 C-states by usage - Dashboard displays: "C10:79% C8:10% C6:8%" Provides better visibility into CPU idle state distribution. Bump version to v0.1.209 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-28 23:45:46 +01:00
Christoffer Martinsson	c6817537a8	Replace CPU frequency with C-state monitoring All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details - Changed CpuData.frequency_mhz to CpuData.cstate (String) - Implemented collect_cstate() to read CPU idle depth from sysfs - Finds deepest C-state with most accumulated time (C0-C10) - Updated dashboard to display C-state instead of frequency - More accurate indicator of CPU activity vs power management Bump version to v0.1.208 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-28 23:30:14 +01:00
Christoffer Martinsson	0e01813ff5	Add service metrics from systemctl (memory, uptime, restarts) Shared: - Add memory_bytes, restart_count, uptime_seconds to ServiceData Agent: - Add new fields to ServiceStatusInfo struct - Fetch MemoryCurrent, NRestarts, ExecMainStartTimestamp from systemctl show - Calculate uptime from start timestamp - Parse and populate new fields in ServiceData - Remove unused load_state and sub_state fields Dashboard: - Add memory_bytes, restart_count, uptime_seconds to ServiceInfo - Update header: Service, Status, RAM, Uptime, ↻ (restarts) - Format memory as MB/GB - Format uptime as Xd Xh, Xh Xm, or Xm - Show restart count with ! prefix if > 0 to indicate instability All metrics obtained from single systemctl show call - zero overhead.	2025-11-28 23:06:13 +01:00
Christoffer Martinsson	67b686f8c7	Remove RAM and disk collection for services Complete removal of service resource metrics: Agent: - Remove memory_mb and disk_gb fields from ServiceData struct - Remove get_service_memory_usage() method - Remove get_service_disk_usage() method - Remove get_directory_size() method - Remove unused warn import Dashboard: - Remove memory_mb and disk_gb from ServiceInfo struct - Remove memory/disk display from format_parent_service_line - Remove memory/disk parsing in legacy metric path - Remove unused format_disk_size() function Service resource metrics were slow, unreliable, and never worked properly since structured data migration. Will be handled differently in the future.	2025-11-28 14:25:12 +01:00
Christoffer Martinsson	2f94a4b853	Add service_type field to separate data from presentation Changes: - Add service_type field to SubServiceData: 'nginx_site', 'container', 'image' - Agent sends pure data without display formatting - Dashboard checks service_type to decide presentation - Docker images now display without status icon (service_type='image') - Remove unused image_size_str from docker images tuple Clean separation: agent provides data, dashboard handles display logic.	2025-11-27 18:09:20 +01:00
Christoffer Martinsson	937f4ad427	Add VLAN ID display and smart parent assignment for virtual interfaces All checks were successful Build and Release / build-and-release (push) Successful in 1m43s Details Agent changes: - Parse /proc/net/vlan/config to extract VLAN IDs for interfaces - Detect primary physical interface via default route - Auto-assign primary interface as parent for virtual interfaces without explicit parent - Added vlan_id field to NetworkInterfaceData Dashboard changes: - Display VLAN ID in format "interface (vlan X): IP" - Show VLAN IDs for both nested and standalone virtual interfaces This ensures virtual interfaces (docker0, tailscale0, etc.) are properly nested under the primary physical NIC, and VLAN interfaces show their IDs. Updated to version 0.1.170	2025-11-27 09:52:45 +01:00
Christoffer Martinsson	fc247bd0ad	Create dedicated network collector with physical/virtual interface grouping All checks were successful Build and Release / build-and-release (push) Successful in 1m43s Details Move network collection from NixOS collector to dedicated NetworkCollector. Add link status detection for physical interfaces (up/down). Group interfaces by physical/virtual, show status icons for physical NICs only. Down interfaces show as Inactive instead of Critical. Version bump to 0.1.165	2025-11-26 19:02:50 +01:00
Christoffer Martinsson	b7ffeaced5	Add network interface collection and display Some checks failed Build and Release / build-and-release (push) Failing after 1m32s Details Extend NixOS collector to gather network interfaces using ip command JSON output. Display all interfaces with IPv4 and IPv6 addresses in Network section above CPU metrics. Filters out loopback and link-local addresses. Version bump to 0.1.161	2025-11-26 17:41:35 +01:00
Christoffer Martinsson	dc1105eefe	Display disk serial numbers instead of device names All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details - Add serial_number field to DriveData structure - Collect serial numbers from SMART data for all drives - Display truncated serial numbers (last 8 chars) in dashboard - Fix parity drive label to show status icon before "Parity:" - Fix mount point label styling to match other labels	2025-11-25 11:06:54 +01:00
Christoffer Martinsson	c9d12793ef	Replace device names with serial numbers in MergerFS pool display All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details Updates disk collector and dashboard to show drive serial numbers instead of device names (sdX) for MergerFS data/parity drives. Agent extracts serial numbers from SMART data and dashboard displays them when available, falling back to device names.	2025-11-25 10:30:37 +01:00
Christoffer Martinsson	8f80015273	Fix dashboard storage pool label styling All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details Replace non-existent Typography::primary() with Typography::secondary() for MergerFS pool labels following existing UI patterns.	2025-11-25 10:16:26 +01:00
Christoffer Martinsson	67b59e9551	Simplify backup timestamp display with raw TOML format and remove spacing All checks were successful Build and Release / build-and-release (push) Successful in 1m41s Details Replace timestamp parsing with direct display of start_time from backup TOML file to ensure timestamp always appears regardless of format. Remove empty line spacing above backup section for compact layout. Changes: - Remove parsed timestamp fields and use raw start_time string from TOML - Display backup time directly from TOML file without parsing - Remove blank line above backup section for tighter layout - Simplify BackupData structure by removing last_run and next_scheduled fields Version bump to v0.1.150	2025-11-25 00:08:36 +01:00
Christoffer Martinsson	da37e28b6a	Integrate backup metrics into system widget with enhanced disk monitoring All checks were successful Build and Release / build-and-release (push) Successful in 2m5s Details Replace standalone backup widget with compact backup section in system widget displaying disk serial, temperature, wear level, timing, and usage information. Changes: - Remove standalone backup widget and integrate into system widget - Update backup collector to read TOML format from backup script - Add BackupDiskData structure with serial, usage, temperature, wear fields - Implement compact backup display matching specification format - Add time formatting utilities for backup timing display - Update backup data extraction from TOML with disk space parsing Version bump to v0.1.149	2025-11-24 23:55:35 +01:00
Christoffer Martinsson	75ec190b93	Fix service status icon mismatch with single source of truth architecture All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details - Remove duplicate status string fields from ServiceData and SubServiceData - Use only Status enum as single source of truth for service status - Agent calculates Status enum using calculate_service_status() - Dashboard converts Status enum to display text for UI - Implement flexible metrics system for sub-services with label/value/unit - Fix status icon/text mismatches (inactive services now show gray circles) - Ensure perfect alignment between service icons and status text	2025-11-24 22:43:22 +01:00
Christoffer Martinsson	d164c1da5f	Add missing service_status field to ServiceData All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-24 21:20:09 +01:00
Christoffer Martinsson	66ab7a492d	Complete monitoring system restoration All checks were successful Build and Release / build-and-release (push) Successful in 2m39s Details Fully restored CM Dashboard as a complete monitoring system with working status evaluation and email notifications. COMPLETED PHASES: ✅ Phase 1: Fixed storage display issues - Use lsblk instead of findmnt (eliminates /nix/store bind mount) - Fixed NVMe SMART parsing (Temperature: and Percentage Used:) - Added sudo to smartctl for permissions - Consistent filesystem and tmpfs sorting ✅ Phase 2a: Fixed missing NixOS build information - Added build_version field to AgentData - NixOS collector now populates build info - Dashboard shows actual build instead of "unknown" ✅ Phase 2b: Restored status evaluation system - Added status fields to all structured data types - CPU: load and temperature status evaluation - Memory: usage status evaluation - Storage: temperature, health, and filesystem usage status - All collectors now use their threshold configurations ✅ Phase 3: Restored notification system - Status change detection between collection cycles - Email alerts on status degradation (OK→Warning/Critical) - Detailed notification content with metric values - Full NotificationManager integration CORE FUNCTIONALITY RESTORED: - Real-time monitoring with proper status evaluation - Email notifications on threshold violations - Correct storage display (nvme0n1 T: 28°C W: 1%) - Complete status-aware infrastructure monitoring - Dashboard is now a monitoring system, not just data viewer The CM Dashboard monitoring system is fully operational.	2025-11-24 19:58:26 +01:00
Christoffer Martinsson	adf3b0f51c	Implement complete structured data architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details Replace fragile string-based metrics with type-safe JSON data structures. Agent converts all metrics to structured data, dashboard processes typed fields. Changes: - Add AgentData struct with CPU, memory, storage, services, backup fields - Replace string parsing with direct field access throughout system - Maintain UI compatibility via temporary metric bridge conversion - Fix NVMe temperature display and eliminate string parsing bugs - Update protocol to support structured data transmission over ZMQ - Comprehensive metric type coverage: CPU, memory, storage, services, backup Version bump to 0.1.131	2025-11-23 21:32:00 +01:00
Christoffer Martinsson	156d707377	Add version display and fix status aggregation priorities All checks were successful Build and Release / build-and-release (push) Successful in 2m37s Details - Add dynamic version display in top bar using CARGO_PKG_VERSION - Rewrite status aggregation to only show Critical/Warning/OK in top bar - Fix Status enum ordering to prioritize OK over transitional states - Remove blue/gray colors from top bar background	2025-11-21 16:19:45 +01:00
Christoffer Martinsson	9575077045	Fix Status::Inactive aggregation priority for green title bar All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details - Move Status::Inactive to lowest priority in enum (before Ok) - Status aggregation now prefers Ok over Inactive in mixed scenarios - Title bar stays green when mixing active and inactive services - Inactive services still show gray icons but don't affect overall status - Ensures healthy systems with stopped services maintain green status	2025-11-18 18:17:25 +01:00
Christoffer Martinsson	34a1f7b9dc	Fix Status::Inactive ordering to prevent gray title bar All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details - Reorder Status enum variants to fix aggregation priority - Status::Inactive now has same priority as Status::Ok in aggregation - Prevents inactive services from causing gray title bar - Title bar stays green when system has only active and inactive services - Only Unknown/Offline/Pending/Warning/Critical statuses affect title color	2025-11-18 18:03:50 +01:00
Christoffer Martinsson	d11aa11f99	Add Status::Inactive for inactive services with empty circle display All checks were successful Build and Release / build-and-release (push) Successful in 1m12s Details - Add new Status::Inactive variant to enum for better service state representation - Agent now assigns Status::Inactive instead of Status::Warning for inactive services - Dashboard displays inactive services with empty circle (○) icon in gray color - User-stopped services still show as Status::Ok with green filled circle - Inactive services treated as OK for host status aggregation - Improves visual clarity between active (●), inactive (○), and warning (◐) states	2025-11-18 17:54:51 +01:00
Christoffer Martinsson	6179bd51a7	Implement WakeOnLAN functionality with simplified configuration All checks were successful Build and Release / build-and-release (push) Successful in 2m32s Details - Add Status::Offline enum variant for disconnected hosts - All configured hosts now always visible showing offline status when disconnected - Add WakeOnLAN support using wake-on-lan Rust crate - Implement w key binding to wake offline hosts with MAC addresses - Simplify configuration to single [hosts] section with MAC addresses only - Change critical status icon from ◯ to ! for better visibility - Add proper MAC address parsing and error handling - Silent WakeOnLAN operation with logging for success/failure Configuration format: [hosts] hostname = { mac_address = "AA:BB:CC:DD:EE:FF" }	2025-10-31 09:03:01 +01:00
Christoffer Martinsson	b6da71b7e7	Implement real-time terminal popup for system rebuild operations All checks were successful Build and Release / build-and-release (push) Successful in 1m21s Details - Add terminal popup UI component with 80% screen coverage and terminal styling - Extend ZMQ protocol with CommandOutputMessage for streaming output - Implement real-time output streaming in agent system rebuild handler - Add keyboard controls (ESC/Q to close, ↑↓ to scroll) for popup interaction - Fix system panel Build display to show actual NixOS build instead of config hash - Update service filters in README with wildcard patterns for better matching - Add periodic progress updates during nixos-rebuild execution - Integrate command output handling in dashboard main loop	2025-10-26 11:39:03 +01:00
Christoffer Martinsson	a08670071c	Implement simple persistent cache with automatic saving on status changes	2025-10-21 20:12:19 +02:00
Christoffer Martinsson	d80f2ce811	Remove unused cache tiers system	2025-10-21 18:43:46 +02:00
Christoffer Martinsson	41208aa2a0	Implement status aggregation with notification batching	2025-10-21 18:12:42 +02:00
Christoffer Martinsson	00a8ed3da2	Implement hysteresis for metric status changes to prevent flapping Add comprehensive hysteresis support to prevent status oscillation near threshold boundaries while maintaining responsive alerting. Key Features: - HysteresisThresholds with configurable upper/lower limits - StatusTracker for per-metric status history - Default gaps: CPU load 10%, memory 5%, disk temp 5°C Updated Components: - CPU load collector (5-minute average with hysteresis) - Memory usage collector (percentage-based thresholds) - Disk temperature collector (SMART data monitoring) - All collectors updated to support StatusTracker interface Cache Interval Adjustments: - Service status: 60s → 10s (faster response) - Disk usage: 300s → 60s (more frequent checks) - Backup status: 900s → 60s (quicker updates) - SMART data: moved to 600s tier (10 minutes) Architecture: - Individual metric status calculation in collectors - Centralized StatusTracker in MetricCollectionManager - Status aggregation preserved in dashboard widgets	2025-10-20 18:45:41 +02:00
Christoffer Martinsson	7f85a6436e	Clean up unused imports and fix build warnings - Remove unused imports (Duration, HashMap, SharedError, DateTime, etc.) - Fix unused variables by prefixing with underscore - Remove redundant dashboard.toml config file - Update theme imports to use only needed components - Maintain all functionality while reducing warnings - Add srv02 to predefined hosts configuration - Remove unused broadcast_command methods	2025-10-18 23:12:07 +02:00
Christoffer Martinsson	dcca5bbea3	Fix cache tier test to match actual configuration - Update test expectations from 5s to 2s intervals for realtime tier - Fix comment to reflect actual 2s interval instead of outdated 5s reference - All tests now pass correctly	2025-10-18 18:44:13 +02:00
Christoffer Martinsson	8a36472a3d	Implement real-time process monitoring and fix UI hardcoded data This commit addresses several key issues identified during development: Major Changes: - Replace hardcoded top CPU/RAM process display with real system data - Add intelligent process monitoring to CpuCollector using ps command - Fix disk metrics permission issues in systemd collector - Optimize service collection to focus on status, memory, and disk only - Update dashboard widgets to display live process information Process Monitoring Implementation: - Added collect_top_cpu_process() and collect_top_ram_process() methods - Implemented ps-based monitoring with accurate CPU percentages - Added filtering to prevent self-monitoring artifacts (ps commands) - Enhanced error handling and validation for process data - Dashboard now shows realistic values like "claude (PID 2974) 11.0%" Service Collection Optimization: - Removed CPU monitoring from systemd collector for efficiency - Enhanced service directory permission error logging - Simplified services widget to show essential metrics only - Fixed service-to-directory mapping accuracy UI and Dashboard Improvements: - Reorganized dashboard layout with btop-inspired multi-panel design - Updated system panel to include real top CPU/RAM process display - Enhanced widget formatting and data presentation - Removed placeholder/hardcoded data throughout the interface Technical Details: - Updated agent/src/collectors/cpu.rs with process monitoring - Modified dashboard/src/ui/mod.rs for real-time process display - Enhanced systemd collector error handling and disk metrics - Updated CLAUDE.md documentation with implementation details	2025-10-16 23:55:05 +02:00
Christoffer Martinsson	1b572c5c1d	Implement intelligent caching system for optimal CPU performance Replace traditional 5-second polling with tiered collection strategy: - RealTime (5s): CPU load, memory usage - Medium (5min): Service status, disk usage - Slow (15min): SMART data, backup status Key improvements: - Reduce CPU usage from 9.5% to <2% - Cache warming for instant dashboard responsiveness - Background refresh at 80% of tier intervals - Thread-safe cache with automatic cleanup Remove legacy polling code - smart caching is now the default and only mode. Agent startup enhanced with parallel cache population for immediate data availability. Architecture: SmartCache + CachedCollector + tiered CollectionScheduler	2025-10-15 11:21:36 +02:00
Christoffer Martinsson	57b676ad25	Testing	2025-10-13 00:16:24 +02:00
Christoffer Martinsson	2581435b10	Implement per-service disk usage monitoring Replaced system-wide disk usage with accurate per-service tracking by scanning service-specific directories. Services like sshd now correctly show minimal disk usage instead of misleading system totals. - Rename storage widget and add drive capacity/usage columns - Move host display to main dashboard title for cleaner layout - Replace separate alert displays with color-coded row highlighting - Add per-service disk usage collection using du command - Update services widget formatting to handle small disk values - Restructure into workspace with dedicated agent and dashboard packages	2025-10-11 22:59:16 +02:00
Christoffer Martinsson	82afe3d4f1	Restructure into workspace with dashboard and agent	2025-10-11 14:19:05 +02:00

38 Commits