The SmartGrid™ Control Suite provides a range of diagnostic features that allow operators and engineers to collect evidence, isolate causes, and validate corrective actions. These tools are essential in ensuring that issues are properly documented and resolved.
System Logs
Logs are the first resource when diagnosing problems. They capture both real-time events and historical information.
Log Categories:
Event Logs: Record alarms, warnings, and state changes.
Optimization Logs: Track algorithm decisions, such as peak shaving or load shedding.
Communication Logs: Record packet transmission, failures, and SCADA link status.
Safety Logs: Document E-stop activations, relay trips, and overcurrent events.
Retention Policy:
Default: 7 days, rolling archive.
Configurable via sg_config logrotate.
Always correlate logs with SCADA timestamps. This helps identify whether issues are internal to SmartGrid™ or external in the supervisory system.
Diagnostic Console
SmartGrid™ includes a built-in command-line interface (CLI) for on-site diagnostics.
Run sg_diag before and after major maintenance. This creates a baseline to compare against future issues.
Network & SCADA Diagnostics
Communication issues are often network-related. Use sg_netcheck to verify paths.
If packet loss >2%, check:
Ethernet cabling integrity.
Switch port configuration.
Firewall rules.
Log Analysis Best Practices
Correlate Events: Align multiple logs to reconstruct event sequences.
Look for Patterns: Repeated errors often indicate systemic faults (e.g., firmware bugs).
Distinguish Cause vs. Effect: A SCADA timeout may be an effect of a relay failure, not the root cause.
Automate Parsing: Use log parsing tools (grep, awk, or vendor-provided analyzers).
When escalating to support, compress and send the entire /var/log/smartgrid/ directory rather than selective entries. Context matters.