cm-dashboard

Author	SHA1	Message	Date
Christoffer Martinsson	620d1f10b6	Show archive count per service instead of total sum	2025-11-29 17:51:01 +01:00
Christoffer Martinsson	f5913dbd43	Add archive count to backup disk display	2025-11-29 17:41:11 +01:00
Christoffer Martinsson	afb8d68e03	Implement multi-disk backup support - Update BackupData structure to support multiple backup disks - Scan /var/lib/backup/status/ directory for all status files - Calculate status icons for backup and disk usage - Aggregate repository status from all disks - Update dashboard to display all backup disks with per-disk status - Display repository list with count and aggregated status	2025-11-29 16:44:50 +01:00
Christoffer Martinsson	5e08b34280	Move C-state name cleaning to agent for smaller JSON All checks were successful Build and Release / build-and-release (push) Successful in 1m32s Details - Agent now extracts "C" + digits pattern (C3, C10) using char parsing - Removes suffixes like "_ACPI", "_MWAIT" at source - Reduces JSON payload size over ZMQ - No regex dependency - uses fast char iteration (~1μs overhead) - Robust fallback to original name if pattern not found - Dashboard simplified to use clean names directly Bump version to v0.1.212 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-29 14:05:55 +01:00
Christoffer Martinsson	d84690cb3b	Move transmission interval to ZMQ config section All checks were successful Build and Release / build-and-release (push) Successful in 1m43s Details - Changed code to use zmq.transmission_interval_seconds instead of top-level collection_interval_seconds - Removed collection_interval_seconds from AgentConfig - Updated validation to check zmq.transmission_interval_seconds - Improves config organization by grouping all ZMQ settings together Bump version to v0.1.210 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-29 13:31:39 +01:00
Christoffer Martinsson	7c030b33d6	Show top 3 C-states with usage percentages All checks were successful Build and Release / build-and-release (push) Successful in 1m21s Details - Changed CpuData.cstate from String to Vec<CStateInfo> - Added CStateInfo struct with name and percent fields - Collector calculates percentage for each C-state based on accumulated time - Sorts and returns top 3 C-states by usage - Dashboard displays: "C10:79% C8:10% C6:8%" Provides better visibility into CPU idle state distribution. Bump version to v0.1.209 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-28 23:45:46 +01:00
Christoffer Martinsson	c6817537a8	Replace CPU frequency with C-state monitoring All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details - Changed CpuData.frequency_mhz to CpuData.cstate (String) - Implemented collect_cstate() to read CPU idle depth from sysfs - Finds deepest C-state with most accumulated time (C0-C10) - Updated dashboard to display C-state instead of frequency - More accurate indicator of CPU activity vs power management Bump version to v0.1.208 Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-28 23:30:14 +01:00
Christoffer Martinsson	28cfd5758f	Fix service metrics not showing - remove cache check The service_status_cache from discovery only has active_state with all detailed metrics set to None. During collection, get_service_status() was returning cached data instead of fetching fresh systemctl show data. Now always fetch fresh data to populate memory_bytes, restart_count, and uptime_seconds properly.	2025-11-28 23:15:51 +01:00
Christoffer Martinsson	0e01813ff5	Add service metrics from systemctl (memory, uptime, restarts) Shared: - Add memory_bytes, restart_count, uptime_seconds to ServiceData Agent: - Add new fields to ServiceStatusInfo struct - Fetch MemoryCurrent, NRestarts, ExecMainStartTimestamp from systemctl show - Calculate uptime from start timestamp - Parse and populate new fields in ServiceData - Remove unused load_state and sub_state fields Dashboard: - Add memory_bytes, restart_count, uptime_seconds to ServiceInfo - Update header: Service, Status, RAM, Uptime, ↻ (restarts) - Format memory as MB/GB - Format uptime as Xd Xh, Xh Xm, or Xm - Show restart count with ! prefix if > 0 to indicate instability All metrics obtained from single systemctl show call - zero overhead.	2025-11-28 23:06:13 +01:00
Christoffer Martinsson	67b686f8c7	Remove RAM and disk collection for services Complete removal of service resource metrics: Agent: - Remove memory_mb and disk_gb fields from ServiceData struct - Remove get_service_memory_usage() method - Remove get_service_disk_usage() method - Remove get_directory_size() method - Remove unused warn import Dashboard: - Remove memory_mb and disk_gb from ServiceInfo struct - Remove memory/disk display from format_parent_service_line - Remove memory/disk parsing in legacy metric path - Remove unused format_disk_size() function Service resource metrics were slow, unreliable, and never worked properly since structured data migration. Will be handled differently in the future.	2025-11-28 14:25:12 +01:00
Christoffer Martinsson	e3996fdb84	Fix compilation errors from command receiver removal All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details - Remove AgentCommand import from agent.rs - Remove handle_commands() method - Remove command handling from main loop - Remove command_port validation checks	2025-11-28 13:01:36 +01:00
Christoffer Martinsson	c19ff56df8	Remove unused ZMQ command receiver (port 6131) Service control migrated to SSH, command receiver no longer needed. - Remove command_receiver Socket from ZmqHandler - Remove try_receive_command method - Remove AgentCommand enum - Remove command_port from ZmqConfig	2025-11-28 12:52:43 +01:00
Christoffer Martinsson	2f94a4b853	Add service_type field to separate data from presentation Changes: - Add service_type field to SubServiceData: 'nginx_site', 'container', 'image' - Agent sends pure data without display formatting - Dashboard checks service_type to decide presentation - Docker images now display without status icon (service_type='image') - Remove unused image_size_str from docker images tuple Clean separation: agent provides data, dashboard handles display logic.	2025-11-27 18:09:20 +01:00
Christoffer Martinsson	fac0188c6f	Change docker image display format and status Changes: - Rename docker images from 'image_node:18...' to 'I node:18...' for conciseness - Change image status from 'active' to 'inactive' for neutral informational display - Images now show with gray empty circle ○ instead of green filled circle ● Docker images are static artifacts without meaningful operational status, so using inactive status provides neutral gray display that won't trigger alerts or affect service status aggregation.	2025-11-27 17:57:24 +01:00
Christoffer Martinsson	374b126446	Reduce all command timeouts to 2-3 seconds max With 10-second host heartbeat timeout, all command timeouts must be significantly lower to ensure total collection time stays under 10 seconds. Changed timeouts: - smartctl: 10s → 3s (critical: multiple drives queried sequentially) - du: 5s → 2s - lsblk: 5s → 2s - systemctl list commands: 5s → 3s - systemctl show/is-active: 3s → 2s - docker commands: 5s → 3s - df, ip commands: 3s → 2s Total worst-case collection time now capped at more reasonable levels, preventing false host offline alerts from blocking operations.	2025-11-27 16:38:54 +01:00
Christoffer Martinsson	1e0510be81	Add comprehensive timeouts to all blocking system commands Fixes random host disconnections caused by blocking operations preventing timely ZMQ packet transmission. Changes: - Add run_command_with_timeout() wrapper using tokio for async command execution - Apply 10s timeout to smartctl (prevents 30+ second hangs on failing drives) - Apply 5s timeout to du, lsblk, systemctl list commands - Apply 3s timeout to systemctl show/is-active, df, ip commands - Apply 2s timeout to hostname command - Use system 'timeout' command for sync operations where async not needed Critical fixes: - smartctl: Failing drives could block for 30+ seconds per drive - du: Large directories (Docker, PostgreSQL) could block 10-30+ seconds - systemctl/docker: Commands could block indefinitely during system issues With 1-second collection interval and 10-second heartbeat timeout, any blocking operation >10s causes false "host offline" alerts. These timeouts ensure collection completes quickly even during system degradation.	2025-11-27 16:34:08 +01:00
Christoffer Martinsson	6d6beb207d	Parse Docker image sizes to MB and sort services alphabetically All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details	2025-11-27 15:57:38 +01:00
Christoffer Martinsson	7a68da01f5	Remove debug logging for NVMe SMART collection All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details	2025-11-27 15:40:16 +01:00
Christoffer Martinsson	5be67fed64	Add debug logging for NVMe SMART data collection All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-27 15:00:48 +01:00
Christoffer Martinsson	cac836601b	Add NVMe device type flag for SMART data collection All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-27 13:34:30 +01:00
Christoffer Martinsson	bd22ce265b	Use direct smartctl with CAP_SYS_RAWIO instead of sudo All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details	2025-11-27 13:22:13 +01:00
Christoffer Martinsson	bbc8b7b1cb	Add info-level logging for SMART data collection debugging All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-27 13:15:53 +01:00
Christoffer Martinsson	5dd8cadef3	Remove debug logging from Docker collection code All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details	2025-11-27 12:50:20 +01:00
Christoffer Martinsson	fefe30ec51	Remove sudo from docker commands - use docker group membership instead All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details Agent changes: - Changed docker ps and docker images commands to run without sudo - cm-agent user is already in docker group, so sudo is not needed - Fixes "unable to change to root gid: Operation not permitted" error - Systemd security restrictions were blocking sudo gid changes This fixes Docker container and image collection on systems with systemd security hardening enabled. Updated to version 0.1.178	2025-11-27 12:35:38 +01:00
Christoffer Martinsson	fb40cce748	Add stderr logging for Docker images command failure All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Agent changes: - Log stderr output when docker images command fails - This will show the actual error message (e.g., permission denied, docker not found) - Helps diagnose why docker images collection is failing Updated to version 0.1.177	2025-11-27 12:28:55 +01:00
Christoffer Martinsson	eaa057b284	Change Docker collection logging from debug to info level All checks were successful Build and Release / build-and-release (push) Successful in 1m10s Details Agent changes: - Changed debug!() to info!() for Docker collection logs - This allows logs to show with default RUST_LOG=info setting - Added info import to tracing use statement Now logs will be visible in journalctl without needing to change log level: - "Collecting Docker sub-services for service: docker" - "Found X Docker containers" - "Found X Docker images" - "Total Docker sub-services added: X" Updated to version 0.1.176	2025-11-27 12:18:17 +01:00
Christoffer Martinsson	f23a1b5cec	Add debug logging for Docker container and image collection All checks were successful Build and Release / build-and-release (push) Successful in 1m10s Details Agent changes: - Added debug logging to Docker images collection function - Log when Docker sub-services are being collected for a service - Log count of containers and images found - Log total sub-services added - Show command failure details instead of silently returning empty vec This will help diagnose why Docker images aren't showing up as sub-services on some hosts. The logs will show if the docker commands are failing or if the collection is working but data isn't being transmitted properly. Updated to version 0.1.175	2025-11-27 12:04:51 +01:00
Christoffer Martinsson	3f98f68b51	Show Docker images as sub-services under docker service All checks were successful Build and Release / build-and-release (push) Successful in 1m23s Details Agent changes: - Added get_docker_images() function to list all Docker images - Use docker images to show stored images with repository:tag and size - Display images as sub-services under docker service with size in parentheses - Skip dangling images (<none>:<none>) - Images shown with active status (always present when listed) Example display: ● docker active 139M 1MB ├─ ● docker_gitea active ├─ ○ docker_old-app inactive ├─ ● image_nginx:latest (142MB) ├─ ● image_postgres:15 (379MB) └─ ● image_gitea:latest (256MB) Updated to version 0.1.174	2025-11-27 11:43:35 +01:00
Christoffer Martinsson	3d38a7a984	Show all Docker containers as sub-services with active/inactive status All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Agent changes: - Use docker ps -a to show ALL containers (running and stopped) - Map container status: Up -> active, Exited/Created -> inactive, other -> failed - Display Docker containers as sub-services under the docker service - Each container shown with proper status indicator Example display: ● docker active 139M 1MB ├─ ● docker_gitea active ├─ ○ docker_old-app inactive └─ ● docker_immich active Updated to version 0.1.173	2025-11-27 10:56:15 +01:00
Christoffer Martinsson	b0ee0242bd	Show all Docker containers as top-level services with active/inactive status All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details Agent changes: - Changed docker ps to docker ps -a to show ALL containers (running and stopped) - Map container status: Up -> active, Exited/Created -> inactive, other -> failed - Display Docker containers as individual top-level services instead of sub-services - Each container shown as "docker_{container_name}" in service list This provides better visibility of all containers and their status directly in the services panel, making it easier to see stopped containers at a glance. Updated to version 0.1.172	2025-11-27 10:51:47 +01:00
Christoffer Martinsson	937f4ad427	Add VLAN ID display and smart parent assignment for virtual interfaces All checks were successful Build and Release / build-and-release (push) Successful in 1m43s Details Agent changes: - Parse /proc/net/vlan/config to extract VLAN IDs for interfaces - Detect primary physical interface via default route - Auto-assign primary interface as parent for virtual interfaces without explicit parent - Added vlan_id field to NetworkInterfaceData Dashboard changes: - Display VLAN ID in format "interface (vlan X): IP" - Show VLAN IDs for both nested and standalone virtual interfaces This ensures virtual interfaces (docker0, tailscale0, etc.) are properly nested under the primary physical NIC, and VLAN interfaces show their IDs. Updated to version 0.1.170	2025-11-27 09:52:45 +01:00
Christoffer Martinsson	8aefab83ae	Fix network interface display for VLANs and physical NICs All checks were successful Build and Release / build-and-release (push) Successful in 1m11s Details Agent changes: - Filter out ifb* interfaces from network display - Parse @parent notation for VLAN interfaces (e.g., lan@enp0s31f6) - Show physical interfaces even without IP addresses - Only filter virtual interfaces that have no IPs - Extract parent interface relationships for proper nesting Dashboard changes: - Nest VLAN/child interfaces under their physical parent - Show physical NICs with status icons even when down - Display child interfaces grouped under parent interface - Keep standalone virtual interfaces at root level Updated to version 0.1.169	2025-11-26 23:47:16 +01:00
Christoffer Martinsson	5c6b11c794	Filter out network interfaces without IP addresses All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Remove interfaces like ifb0, dummy devices that have no IPs. Only show interfaces with at least one IPv4 or IPv6 address. Version bump to 0.1.167	2025-11-26 19:19:21 +01:00
Christoffer Martinsson	fc247bd0ad	Create dedicated network collector with physical/virtual interface grouping All checks were successful Build and Release / build-and-release (push) Successful in 1m43s Details Move network collection from NixOS collector to dedicated NetworkCollector. Add link status detection for physical interfaces (up/down). Group interfaces by physical/virtual, show status icons for physical NICs only. Down interfaces show as Inactive instead of Critical. Version bump to 0.1.165	2025-11-26 19:02:50 +01:00
Christoffer Martinsson	b7ffeaced5	Add network interface collection and display Some checks failed Build and Release / build-and-release (push) Failing after 1m32s Details Extend NixOS collector to gather network interfaces using ip command JSON output. Display all interfaces with IPv4 and IPv6 addresses in Network section above CPU metrics. Filters out loopback and link-local addresses. Version bump to 0.1.161	2025-11-26 17:41:35 +01:00
Christoffer Martinsson	3858309a5d	Fix Docker container detection with sudo permissions Some checks failed Build and Release / build-and-release (push) Failing after 1m19s Details Update systemd collector to use sudo for docker ps command to resolve permission issues when cm-agent user lacks docker group membership. This ensures Docker containers are properly discovered and displayed as sub-services under the docker service. Version: 0.1.160	2025-11-25 12:40:27 +01:00
Christoffer Martinsson	df104bf940	Remove debug prints and unused code All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Remove all debug println statements - Remove unused service_tracker module - Remove unused struct fields and methods - Remove empty placeholder files (cpu.rs, memory.rs, defaults.rs) - Fix all compiler warnings - Clean build with zero warnings Version bump to 0.1.159	2025-11-25 12:19:04 +01:00
Christoffer Martinsson	d5ce36ee18	Add support for additional SMART attributes All checks were successful Build and Release / build-and-release (push) Successful in 1m30s Details - Support Temperature_Case attribute for Intel SSDs - Support Media_Wearout_Indicator attribute for wear percentage - Parse wear value from column 3 (VALUE) for Media_Wearout_Indicator - Fixes temperature and wear display for Intel PHLA847000FL512DGN drives	2025-11-25 11:53:08 +01:00
Christoffer Martinsson	4f80701671	Fix NVMe serial display and improve pool health logic All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details - Fix physical drive serial number display in dashboard - Improve pool health calculation for arrays with multiple disks - Support proper tree symbols for multiple parity drives - Read git commit hash from /var/lib/cm-dashboard/git-commit for Build display	2025-11-25 11:44:20 +01:00
Christoffer Martinsson	267654fda4	Improve NVMe serial parsing and restructure MergerFS display All checks were successful Build and Release / build-and-release (push) Successful in 1m25s Details - Fix NVMe serial number parsing to handle whitespace variations - Move mount point to MergerFS header, remove drive count - Restructure data drives to same level as parity with Data_1, Data_2 labels - Remove "Total:" label from pool usage line - Update parity to use closing tree symbol as last item	2025-11-25 11:28:54 +01:00
Christoffer Martinsson	dc1105eefe	Display disk serial numbers instead of device names All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details - Add serial_number field to DriveData structure - Collect serial numbers from SMART data for all drives - Display truncated serial numbers (last 8 chars) in dashboard - Fix parity drive label to show status icon before "Parity:" - Fix mount point label styling to match other labels	2025-11-25 11:06:54 +01:00
Christoffer Martinsson	c9d12793ef	Replace device names with serial numbers in MergerFS pool display All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details Updates disk collector and dashboard to show drive serial numbers instead of device names (sdX) for MergerFS data/parity drives. Agent extracts serial numbers from SMART data and dashboard displays them when available, falling back to device names.	2025-11-25 10:30:37 +01:00
Christoffer Martinsson	8f80015273	Fix dashboard storage pool label styling All checks were successful Build and Release / build-and-release (push) Successful in 1m20s Details Replace non-existent Typography::primary() with Typography::secondary() for MergerFS pool labels following existing UI patterns.	2025-11-25 10:16:26 +01:00
Christoffer Martinsson	7b11db990c	Restore complete MergerFS and SnapRAID functionality to disk collector All checks were successful Build and Release / build-and-release (push) Successful in 1m17s Details Updated the disk collector to include all missing functionality from the previous string-based implementation while working with the new structured JSON data architecture: - MergerFS pool discovery from /proc/mounts parsing - SnapRAID parity drive detection via mount path heuristics - Drive categorization (data vs parity) based on path analysis - Numeric mergerfs reference resolution (1:2 -> /mnt/disk paths) - Pool health calculation based on member drive SMART status - Complete SMART data integration for temperatures and wear levels - Proper exclusion of pool member drives from physical drive grouping The implementation replicates the exact logic from the old code while adapting to structured AgentData output format. All mergerfs and snapraid monitoring capabilities are fully restored.	2025-11-25 08:37:32 +01:00
Christoffer Martinsson	67b59e9551	Simplify backup timestamp display with raw TOML format and remove spacing All checks were successful Build and Release / build-and-release (push) Successful in 1m41s Details Replace timestamp parsing with direct display of start_time from backup TOML file to ensure timestamp always appears regardless of format. Remove empty line spacing above backup section for compact layout. Changes: - Remove parsed timestamp fields and use raw start_time string from TOML - Display backup time directly from TOML file without parsing - Remove blank line above backup section for tighter layout - Simplify BackupData structure by removing last_run and next_scheduled fields Version bump to v0.1.150	2025-11-25 00:08:36 +01:00
Christoffer Martinsson	da37e28b6a	Integrate backup metrics into system widget with enhanced disk monitoring All checks were successful Build and Release / build-and-release (push) Successful in 2m5s Details Replace standalone backup widget with compact backup section in system widget displaying disk serial, temperature, wear level, timing, and usage information. Changes: - Remove standalone backup widget and integrate into system widget - Update backup collector to read TOML format from backup script - Add BackupDiskData structure with serial, usage, temperature, wear fields - Implement compact backup display matching specification format - Add time formatting utilities for backup timing display - Update backup data extraction from TOML with disk space parsing Version bump to v0.1.149	2025-11-24 23:55:35 +01:00
Christoffer Martinsson	d89b3ac881	Fix nginx sub-services persistent caching with complete service data storage All checks were successful Build and Release / build-and-release (push) Successful in 1m17s Details Resolves nginx sites appearing only briefly during collection cycles by implementing proper caching of complete service data including sub-services. Changes: - Add cached_service_data field to store complete ServiceData with sub-services - Modify collection logic to cache full service objects instead of basic ServiceInfo - Update cache retrieval to use complete cached data preserving nginx site metrics - Eliminate flickering of nginx sites between collection cycles Version bump to v0.1.148	2025-11-24 23:24:00 +01:00
Christoffer Martinsson	7f26991609	Fix nginx sub-services flickering with persistent caching All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Remove nginx_ prefix from site names in hierarchical structure - Fix get_nginx_site_metrics to call correct internal method - Implement same caching functionality as old working version - Sites now stay visible continuously with 30s latency updates - Preserve cached results between refresh cycles	2025-11-24 23:01:51 +01:00
Christoffer Martinsson	75ec190b93	Fix service status icon mismatch with single source of truth architecture All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details - Remove duplicate status string fields from ServiceData and SubServiceData - Use only Status enum as single source of truth for service status - Agent calculates Status enum using calculate_service_status() - Dashboard converts Status enum to display text for UI - Implement flexible metrics system for sub-services with label/value/unit - Fix status icon/text mismatches (inactive services now show gray circles) - Ensure perfect alignment between service icons and status text	2025-11-24 22:43:22 +01:00
Christoffer Martinsson	eb892096d9	Complete systemd collector restoration matching original architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details - Add nginx site metrics caching with configurable intervals matching original - Implement complex nginx config parsing with brace counting and redirect detection - Replace curl with reqwest HTTP client for proper timeout and redirect handling - Fix docker container parsing to use comma format with proper status mapping - Add sudo to directory size command for permission handling - Change nginx URLs to use https protocol matching original - Add advanced NixOS ExecStart parsing for argv[] format support - Add nginx -T fallback functionality for config discovery - Implement proper server block parsing with domain validation and brace tracking - Add get_service_memory function matching original signature All functionality now matches pre-refactor implementation architecture.	2025-11-24 22:02:15 +01:00

1 2 3 4 5 ...

277 Commits