cm-dashboard

Author	SHA1	Message	Date
Christoffer Martinsson	d89b3ac881	Fix nginx sub-services persistent caching with complete service data storage All checks were successful Build and Release / build-and-release (push) Successful in 1m17s Details Resolves nginx sites appearing only briefly during collection cycles by implementing proper caching of complete service data including sub-services. Changes: - Add cached_service_data field to store complete ServiceData with sub-services - Modify collection logic to cache full service objects instead of basic ServiceInfo - Update cache retrieval to use complete cached data preserving nginx site metrics - Eliminate flickering of nginx sites between collection cycles Version bump to v0.1.148	2025-11-24 23:24:00 +01:00
Christoffer Martinsson	7f26991609	Fix nginx sub-services flickering with persistent caching All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Remove nginx_ prefix from site names in hierarchical structure - Fix get_nginx_site_metrics to call correct internal method - Implement same caching functionality as old working version - Sites now stay visible continuously with 30s latency updates - Preserve cached results between refresh cycles	2025-11-24 23:01:51 +01:00
Christoffer Martinsson	75ec190b93	Fix service status icon mismatch with single source of truth architecture All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details - Remove duplicate status string fields from ServiceData and SubServiceData - Use only Status enum as single source of truth for service status - Agent calculates Status enum using calculate_service_status() - Dashboard converts Status enum to display text for UI - Implement flexible metrics system for sub-services with label/value/unit - Fix status icon/text mismatches (inactive services now show gray circles) - Ensure perfect alignment between service icons and status text	2025-11-24 22:43:22 +01:00
Christoffer Martinsson	eb892096d9	Complete systemd collector restoration matching original architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details - Add nginx site metrics caching with configurable intervals matching original - Implement complex nginx config parsing with brace counting and redirect detection - Replace curl with reqwest HTTP client for proper timeout and redirect handling - Fix docker container parsing to use comma format with proper status mapping - Add sudo to directory size command for permission handling - Change nginx URLs to use https protocol matching original - Add advanced NixOS ExecStart parsing for argv[] format support - Add nginx -T fallback functionality for config discovery - Implement proper server block parsing with domain validation and brace tracking - Add get_service_memory function matching original signature All functionality now matches pre-refactor implementation architecture.	2025-11-24 22:02:15 +01:00
Christoffer Martinsson	c006625a3f	Restore complete systemd collector functionality All checks were successful Build and Release / build-and-release (push) Successful in 2m7s Details - Enhanced directory size logic with minimum 0.001GB visibility and permission error logging - Added nginx site monitoring with latency checks and NixOS config discovery - Added docker container monitoring as sub-services - Integrated sub-service collection for active nginx and docker services - All missing features from original implementation now restored	2025-11-24 21:51:42 +01:00
Christoffer Martinsson	dcd5fff8c1	Update version to v0.1.143 All checks were successful Build and Release / build-and-release (push) Successful in 1m16s Details	2025-11-24 21:43:01 +01:00
Christoffer Martinsson	b120f95f8a	Restore service discovery and disk usage calculation Some checks failed Build and Release / build-and-release (push) Failing after 1m2s Details Fixes missing services and 0B disk usage issues by restoring: - Wildcard pattern matching for service filters (gitea, redis) - Service disk usage calculation from directories and WorkingDirectory - Proper Status::Inactive for inactive services Services now properly discovered and show actual disk usage.	2025-11-24 20:25:08 +01:00
Christoffer Martinsson	66ab7a492d	Complete monitoring system restoration All checks were successful Build and Release / build-and-release (push) Successful in 2m39s Details Fully restored CM Dashboard as a complete monitoring system with working status evaluation and email notifications. COMPLETED PHASES: ✅ Phase 1: Fixed storage display issues - Use lsblk instead of findmnt (eliminates /nix/store bind mount) - Fixed NVMe SMART parsing (Temperature: and Percentage Used:) - Added sudo to smartctl for permissions - Consistent filesystem and tmpfs sorting ✅ Phase 2a: Fixed missing NixOS build information - Added build_version field to AgentData - NixOS collector now populates build info - Dashboard shows actual build instead of "unknown" ✅ Phase 2b: Restored status evaluation system - Added status fields to all structured data types - CPU: load and temperature status evaluation - Memory: usage status evaluation - Storage: temperature, health, and filesystem usage status - All collectors now use their threshold configurations ✅ Phase 3: Restored notification system - Status change detection between collection cycles - Email alerts on status degradation (OK→Warning/Critical) - Detailed notification content with metric values - Full NotificationManager integration CORE FUNCTIONALITY RESTORED: - Real-time monitoring with proper status evaluation - Email notifications on threshold violations - Correct storage display (nvme0n1 T: 28°C W: 1%) - Complete status-aware infrastructure monitoring - Dashboard is now a monitoring system, not just data viewer The CM Dashboard monitoring system is fully operational.	2025-11-24 19:58:26 +01:00
Christoffer Martinsson	fd7ad23205	Fix storage display issues and use dynamic versioning All checks were successful Build and Release / build-and-release (push) Successful in 1m7s Details Phase 1 fixes for storage display: - Replace findmnt with lsblk to eliminate bind mount issues (/nix/store) - Add sudo to smartctl commands for permission access - Fix NVMe SMART parsing for Temperature: and Percentage Used: fields - Use dynamic version from CARGO_PKG_VERSION instead of hardcoded strings Storage display should now show correct mount points and temperature/wear. Status evaluation and notifications still need restoration in subsequent phases.	2025-11-24 19:26:09 +01:00
Christoffer Martinsson	2b2cb2da3e	Complete atomic migration to structured data architecture All checks were successful Build and Release / build-and-release (push) Successful in 1m7s Details Implements clean structured data collection eliminating all string metric parsing bugs. Collectors now populate AgentData directly with type-safe field access. Key improvements: - Mount points preserved correctly (/ and /boot instead of root/boot) - Tmpfs discovery added to memory collector - Temperature data flows as typed f32 fields - Zero string parsing overhead - Complete removal of MetricCollectionManager bridge - Direct ZMQ transmission of structured JSON All functionality maintained: service tracking, notifications, status evaluation, and multi-host monitoring.	2025-11-24 18:53:31 +01:00
Christoffer Martinsson	11d1c2dc94	Fix storage display format and clean up warnings All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Update storage display to match CLAUDE.md specification: - Show drive temp/wear on main line: nvme0n1 T: 25°C W: 4% - Display individual filesystems as sub-items: /: 55% 250.5GB/456.4GB - Remove Total usage line in favor of filesystem breakdown Clean up code warnings: - Remove unused heartbeat methods and fields - Remove unused backup widget fields and methods - Add allow attributes for legacy methods	2025-11-24 16:03:31 +01:00
Christoffer Martinsson	bea2d120b5	Update storage display format to match CLAUDE.md specification All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details Remove parentheses from drive temperature/wear display to match the hierarchical format specified in documentation. Drive details now show directly with status icons as 'nvme0n1 T: 25°C W: 4%' format.	2025-11-24 15:21:58 +01:00
Christoffer Martinsson	5394164123	Remove agent heartbeat causing dashboard zero dropouts All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Agent heartbeat was sending empty AgentData every few seconds, causing dashboard to display zero values for all metrics intermittently. Since agent already transmits complete data every 1 second, heartbeat is redundant. Dashboard will detect offline hosts via data timestamps.	2025-11-24 15:03:20 +01:00
Christoffer Martinsson	4329cd26e0	Make disk collector filesystems field optional for auto-discovery All checks were successful Build and Release / build-and-release (push) Successful in 1m32s Details Allow agent configuration without explicit filesystems list by making the field optional with serde default, enabling pure auto-discovery mode. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 13:47:53 +01:00
Christoffer Martinsson	b85bd6b153	Fix agent collector timing to prevent intermittent data gaps All checks were successful Build and Release / build-and-release (push) Successful in 1m42s Details Update last_collection timestamp even when collectors fail to prevent immediate retry loops that cause data transmission gaps every 5 seconds. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-24 13:42:29 +01:00
Christoffer Martinsson	c9b2d5e342	Update version to v0.1.133 All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details Bump version across all workspace crates for next release including agent, dashboard, and shared components.	2025-11-23 22:25:19 +01:00
Christoffer Martinsson	b2b301332f	Fix storage display showing missing total usage data All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details The structured data bridge conversion was only converting individual drive metrics (temperature, wear) and filesystem metrics, but wasn't generating the aggregated total usage metrics expected by the storage widget (disk_{drive}_total_gb, disk_{drive}_used_gb, disk_{drive}_usage_percent). This caused physical drives to display "—% —GB/—GB" instead of actual usage statistics. Updated the bridge conversion to calculate drive totals by aggregating all filesystems on each drive: - total_used = sum of all filesystem used_gb values - total_size = sum of all filesystem total_gb values - average_usage = (total_used / total_size) * 100 Now physical drives like nvme0n1 properly display total usage aggregated from all their filesystems (e.g., /boot + / = total drive usage). Version bump: v0.1.131 → v0.1.132	2025-11-23 21:43:34 +01:00
Christoffer Martinsson	adf3b0f51c	Implement complete structured data architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details Replace fragile string-based metrics with type-safe JSON data structures. Agent converts all metrics to structured data, dashboard processes typed fields. Changes: - Add AgentData struct with CPU, memory, storage, services, backup fields - Replace string parsing with direct field access throughout system - Maintain UI compatibility via temporary metric bridge conversion - Fix NVMe temperature display and eliminate string parsing bugs - Update protocol to support structured data transmission over ZMQ - Comprehensive metric type coverage: CPU, memory, storage, services, backup Version bump to 0.1.131	2025-11-23 21:32:00 +01:00
Christoffer Martinsson	41ded0170c	Add wear percentage display and NVMe temperature collection All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details - Display wear percentage in storage headers for single physical drives - Remove redundant drive type indicators, show wear data instead - Fix wear metric parsing for physical drives (underscore count issue) - Add NVMe temperature parsing support (Temperature: format) - Add raw metrics debugging functionality for troubleshooting - Clean up physical drive display to remove redundant information	2025-11-23 20:29:24 +01:00
Christoffer Martinsson	9b4191b2c3	Fix physical drive name and health status display All checks were successful Build and Release / build-and-release (push) Successful in 2m13s Details - Display actual drive name (e.g., nvme0n1) instead of mount point for physical drives - Fix health status parsing for physical drives to show proper status icons - Update pool name extraction to handle disk_{drive}_health metrics correctly - Improve storage widget rendering for physical drive identification	2025-11-23 19:25:45 +01:00
Christoffer Martinsson	53dbb43352	Fix SnapRAID parity association using directory-based discovery All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details - Replace blanket parity drive inclusion with smart relationship detection - Only associate parity drives from same parent directory as data drives - Prevent incorrect exclusion of nvme0n1 physical drives from grouping - Maintain zero-configuration auto-discovery without hardcoded paths	2025-11-23 18:42:48 +01:00
Christoffer Martinsson	ba03623110	Remove hardcoded pool mount point mappings for true auto-discovery All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Eliminate hardcoded mappings like 'root' -> '/' and 'steampool' -> '/mnt/steampool' - Use device names directly for physical drives - Rely on mount_point metrics from agent for actual mount paths - Implement zero-configuration architecture as specified in CLAUDE.md	2025-11-23 18:34:45 +01:00
Christoffer Martinsson	f24c4ed650	Fix pool name extraction to prevent wrong physical drive naming All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details - Remove fallback logic that could extract incorrect pool names - Simplify pool suffix matching to use explicit arrays - Ensure only valid metric patterns create pools	2025-11-23 18:24:39 +01:00
Christoffer Martinsson	86501fd486	Fix display format to match CLAUDE.md specification All checks were successful Build and Release / build-and-release (push) Successful in 1m17s Details - Use actual device names (sdb, sdc) instead of data_0, parity_0 - Fix physical drive naming to show device names instead of mount points - Update pool name extraction to handle new device-based naming - Ensure Drive: line shows temperature and wear data for physical drives	2025-11-23 18:13:35 +01:00
Christoffer Martinsson	192eea6e0c	Integrate SnapRAID parity drives into mergerfs pools All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Add SnapRAID parity drive detection to mergerfs discovery - Remove Pool Status health line as discussed - Update drive display to always show wear data when available - Include /mnt/parity drives as part of mergerfs pool structure	2025-11-23 18:05:19 +01:00
Christoffer Martinsson	43fb838c9b	Fix duplicate drive display in mergerfs pools All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details - Restructure storage rendering logic to prevent drive duplication - Use specific mergerfs check instead of generic multi-drive condition - Ensure drives only appear once under organized data/parity sections	2025-11-23 17:46:09 +01:00
Christoffer Martinsson	54483653f9	Fix mergerfs drive metric parsing for proper pool consolidation All checks were successful Build and Release / build-and-release (push) Successful in 2m11s Details - Update extract_pool_name to handle data_/parity_ drive metrics correctly - Fix extract_drive_name to parse mergerfs drive roles properly - Prevent srv_media_data from being parsed as separate pool	2025-11-23 17:40:12 +01:00
Christoffer Martinsson	e47803b705	Fix mergerfs pool consolidation and naming All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details - Improve pool name extraction in dashboard parsing - Use consistent mergerfs pool naming in agent - Add mount_point metric parsing to use actual mount paths - Fix pool consolidation to prevent duplicate entries	2025-11-23 17:35:23 +01:00
Christoffer Martinsson	439d0d9af6	Fix mergerfs numeric reference parsing for proper pool detection All checks were successful Build and Release / build-and-release (push) Successful in 2m11s Details Add support for numeric mergerfs references like "1:2" by mapping them to actual mount points (/mnt/disk1, /mnt/disk2). This enables proper mergerfs pool detection and hides individual member drives as intended.	2025-11-23 17:27:45 +01:00
Christoffer Martinsson	2242b5ddfe	Make mergerfs detection more robust to prevent discovery failures All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details Skip mergerfs pools with numeric device references (e.g., "1:2") instead of crashing. This allows regular drive detection to work even when mergerfs uses non-standard mount formats. Preserves existing functionality for standard mergerfs setups.	2025-11-23 17:19:15 +01:00
Christoffer Martinsson	9d0f42d55c	Fix filesystem usage_percent parsing and remove hardcoded status All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details 1. Add missing _fs_ filter to usage_percent parsing in dashboard 2. Fix agent to use calculated fs_status instead of hardcoded Status::Ok This completes the disk collector auto-discovery by ensuring filesystem usage percentages and status indicators display correctly.	2025-11-23 16:47:20 +01:00
Christoffer Martinsson	1da7b5f6e7	Fix both pool-level and filesystem metric parsing bugs All checks were successful Build and Release / build-and-release (push) Successful in 1m10s Details 1. Prevent filesystem _fs_ metrics from overwriting pool totals 2. Fix filesystem name extraction to properly parse boot/root names This resolves both the pool total display (showing 0.1GB instead of 220GB) and individual filesystem display (showing —% —GB/—GB).	2025-11-23 16:29:00 +01:00
Christoffer Martinsson	006f27f7d9	Fix lsblk parsing for filesystem discovery All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Remove unused debug code and fix device name parsing to properly handle lsblk tree characters. This resolves the issue where only /boot filesystem was discovered instead of both /boot and /.	2025-11-23 16:09:48 +01:00
Christoffer Martinsson	07422cd0a7	Add debug logging for filesystem discovery All checks were successful Build and Release / build-and-release (push) Successful in 1m18s Details	2025-11-23 15:26:49 +01:00
Christoffer Martinsson	de30b80219	Fix filesystem metric parsing bounds error in dashboard All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details Prevent string slicing panic in extract_filesystem_metric when parsing individual filesystem metrics. This resolves the issue where filesystem entries show —% —GB/—GB instead of actual usage.	2025-11-23 15:23:15 +01:00
Christoffer Martinsson	7d96ca9fad	Fix disk collector filesystem discovery with debug logging All checks were successful Build and Release / build-and-release (push) Successful in 1m9s Details Add debug logging to filesystem usage collection to identify why some mount points are being dropped during discovery. This should resolve the issue where total capacity shows incorrect values.	2025-11-23 15:15:56 +01:00
Christoffer Martinsson	9b940ebd19	Fix string slicing bounds error in metric parsing All checks were successful Build and Release / build-and-release (push) Successful in 1m8s Details Fixed critical bug where dashboard crashed with 'begin <= end' slice error when parsing disk metrics with new naming format. Added bounds checking to prevent invalid string slicing operations. - Fixed extract_pool_name string slicing bounds check - Removed ineffective panic handling that caused infinite loop - Dashboard now handles new disk collector metrics correctly	2025-11-23 14:52:09 +01:00
Christoffer Martinsson	6d4da1b7da	Add robust error handling to prevent dashboard crashes All checks were successful Build and Release / build-and-release (push) Successful in 2m9s Details Added comprehensive error handling to storage metrics parsing to prevent dashboard crashes when encountering unexpected metric formats or parsing errors. Dashboard now continues gracefully with empty storage display instead of crashing, improving reliability during metric format changes. - Wrapped storage metric parsing in panic recovery - Added logging for metric parsing failures - Dashboard shows empty storage on errors instead of crashing - Ensures dashboard remains functional during agent updates	2025-11-23 14:45:00 +01:00
Christoffer Martinsson	1e7f1616aa	Complete disk collector rewrite with clean architecture All checks were successful Build and Release / build-and-release (push) Successful in 2m8s Details Replaced complex disk collector with simple lsblk → df → group workflow. Supports both physical drives and mergerfs pools with unified metrics. Eliminates configuration complexity through pure auto-discovery. - Clean discovery pipeline using lsblk and df commands - Physical drive grouping with filesystem children - MergerFS pool detection with parity heuristics - Unified metric generation for consistent dashboard display - SMART data collection for temperature, wear, and health	2025-11-23 14:22:19 +01:00
Christoffer Martinsson	7a3ee3d5ba	Fix physical drive grouping logic for unified pool visualization All checks were successful Build and Release / build-and-release (push) Successful in 2m11s Details Updated filesystem grouping to use extract_base_device method for proper partition-to-drive mapping. This ensures nvme0n1p1 and nvme0n1p2 are correctly grouped under nvme0n1 drive pool instead of separate pools.	2025-11-23 13:54:33 +01:00
Christoffer Martinsson	0e8b149718	Add partial filesystem data display for debugging All checks were successful Build and Release / build-and-release (push) Successful in 2m11s Details - Make filesystem display more forgiving - show partial data if available - Will display usage% even if GB values are missing, or vice versa - This should help identify which specific metrics aren't being populated - Debug version to identify filesystem data population issues	2025-11-23 13:33:36 +01:00
Christoffer Martinsson	2c27d0e1db	Prepare v0.1.107 for filesystem data debugging All checks were successful Build and Release / build-and-release (push) Successful in 1m31s Details Current status: Filesystem children appear with correct mount points but show —% —GB/—GB Need to debug why usage_percent, used_gb, total_gb metrics aren't populating filesystem entries	2025-11-23 13:24:13 +01:00
Christoffer Martinsson	9f18488752	Fix filesystem metric parsing for correct mount point names All checks were successful Build and Release / build-and-release (push) Successful in 2m10s Details - Fix extract_filesystem_metric() to handle multi-underscore metric names correctly - Parse known metric suffixes (usage_percent, mount_point, available_gb, etc.) - Prevent incorrect parsing like boot_mount_point -> fs_name='boot_mount', metric_type='point' - Should now correctly show /boot and / instead of /boot/mount and /root/mount	2025-11-23 13:11:05 +01:00
Christoffer Martinsson	fab6404cca	Fix filesystem children creation logic All checks were successful Build and Release / build-and-release (push) Successful in 1m17s Details - Allow filesystem entries to be created with any metric, not just mount_point - Ensure filesystem children appear under physical drive pools - Improve mount point fallback logic for better compatibility	2025-11-23 13:04:01 +01:00
Christoffer Martinsson	c3626cc362	Fix unified pool visualization filesystem children display issues All checks were successful Build and Release / build-and-release (push) Successful in 2m14s Details - Fix extract_pool_name() to handle filesystem metrics (_fs_) correctly - Prevent individual filesystem pools (nvme0n1_fs_boot, nvme0n1_fs_root) from being created - Fix incorrect mount point names (was showing /root/mount instead of /) - Only create filesystem entries when receiving mount_point metrics - Add available_gb field to FileSystem struct for proper available space handling - Ensure filesystem children show correct usage data instead of —% —GB/—GB	2025-11-23 12:58:16 +01:00
Christoffer Martinsson	d68ecfbc64	Complete unified pool visualization with filesystem children All checks were successful Build and Release / build-and-release (push) Successful in 2m17s Details - Implement filesystem children display under physical drive pools - Agent generates individual filesystem metrics for each mount point - Dashboard parses filesystem metrics and displays as tree children - Add filesystem usage, total, and available space metrics - Support target format: drive info + filesystem children hierarchy - Fix compilation warnings by properly using available_bytes calculation	2025-11-23 12:48:24 +01:00
Christoffer Martinsson	d1272a6c13	Implement unified pool visualization for single drives All checks were successful Build and Release / build-and-release (push) Successful in 1m19s Details - Group single disk filesystems by physical drive during auto-discovery - Create physical drive pools with filesystem children - Display temperature, wear, and health at drive level - Provide consistent hierarchical storage visualization - Fix borrow checker issues in create_physical_drive_pool method - Add PhysicalDrive case to all StoragePoolType match statements	2025-11-23 12:10:42 +01:00
Christoffer Martinsson	33b3beb342	Implement storage auto-discovery system All checks were successful Build and Release / build-and-release (push) Successful in 1m49s Details - Add automatic detection of mergerfs pools by parsing /proc/mounts - Implement smart heuristics for parity disk identification - Store discovered topology at agent startup for efficient monitoring - Eliminate need for manual storage pool configuration - Support zero-config storage visualization with backward compatibility - Clean up mount parsing and remove unused fields	2025-11-23 11:44:57 +01:00
Christoffer Martinsson	f9384d9df6	Implement enhanced storage pool visualization All checks were successful Build and Release / build-and-release (push) Successful in 2m34s Details - Add support for mergerfs pool grouping with data and parity disk separation - Implement pool health monitoring (healthy/degraded/critical status) - Create hierarchical tree view for multi-disk storage arrays - Add automatic pool type detection and member disk association - Maintain backward compatibility for single disk configurations - Support future extension for RAID and ZFS pool types	2025-11-23 11:18:21 +01:00
Christoffer Martinsson	156d707377	Add version display and fix status aggregation priorities All checks were successful Build and Release / build-and-release (push) Successful in 2m37s Details - Add dynamic version display in top bar using CARGO_PKG_VERSION - Rewrite status aggregation to only show Critical/Warning/OK in top bar - Fix Status enum ordering to prioritize OK over transitional states - Remove blue/gray colors from top bar background	2025-11-21 16:19:45 +01:00

1 2 3 4

192 Commits