Standard Operations Procedures (SOP)

STANDARD OPERATING PROCEDURE FOR NETWORK INFRASTRUCTURE MANAGEMENT SUB UNIT

1. Purpose: This SOP establishes the processes and procedures for effective management and maintenance of the University’s network infrastructure, ensuring high availability, performance, security, and compliance with University policies and industry best practices. This SOP directly supports the strategic goals of the NOC under COMSIT Directorate.

2. Scope: This SOP encompasses all aspects of network infrastructure management, including:

  1. Network Planning and Design: Future-proofing the network, including capacity planning, technology selection, and architectural design for scalability and resilience.
  2. Implementation and Deployment: Physical installation, configuration, testing, and documentation of all network devices and infrastructure.
  3. Maintenance and Operations: Proactive and reactive maintenance, including troubleshooting, upgrades, and performance monitoring.
  4. Documentation and Inventory: Maintaining accurate and up-to-date network documentation, diagrams, and inventory records.
  5. Security: Implementing and enforcing network security policies and procedures.
  6. Vendor Management: Managing relationships with network vendors and service providers.

3. Roles and Responsibilities: (Note: The number of personnel in each role will be determined by the size and complexity of the network infrastructure.)

  1. Network Infrastructure Management Lead: Overall responsibility for the network infrastructure management sub-unit. Develops and implements strategic plans, manages staff, and reports to the Network Administrator.
  2. Network Architect: Designs, implements, and maintains the overall network architecture. Develops long-term plans for expansion and upgrades, ensuring interoperability and compliance with industry standards.
  3. Senior Network Engineers: Responsible for design, implementation, and maintenance of complex network segments (e.g., fiber optics, core routing). Provide advanced troubleshooting and technical guidance to junior engineers.
  4. Network Engineers: Implement and maintain network infrastructure according to designs. Perform routine maintenance, troubleshooting, and performance monitoring.
  5. Network Technicians: Install and maintain physical network infrastructure (cabling, equipment). Perform basic troubleshooting and maintenance; assist engineers with larger projects.
  6. Data Center Technician: Responsible for the physical maintenance of the server rooms and network closets, including power and cooling systems.

4. Line of Command and Reporting:

  1. All Network Infrastructure Management staff report to the Network Infrastructure Management Lead.
  2. The Network Infrastructure Management Lead reports to the Network Administrator.
  3. The Network Administrator reports to the Head of the NOC.
  4. The Head of the NOC reports to the Director, COMSIT.

5. Processes and Procedures:

  1. Change Management: All changes to the network infrastructure (hardware, software, configuration) must follow a change management procedure.
  2. Incident Management: A well-defined incident management process is in place, including clear escalation paths, communication protocols, and reporting procedures.
  3. Problem Management: Proactive identification and resolution of recurring network problems to prevent future incidents.
  4. Preventive Maintenance Schedule: A detailed preventative maintenance schedule is developed and implemented, covering all network devices and infrastructure.
  5. Capacity Planning: Regular capacity planning should be conducted to ensure continuous service improvement for current and future demands.
  6. Network Security Policy: University network security policy and  infrastructure management activities must adhere to University network security policies and procedures. Regular network security audits should be conducted.
  7. Vendor Management: Relationships with network vendors are managed through contracts, service level agreements (SLAs), and regular communication.
  8. Documentation: Comprehensive network documentation is maintained, including diagrams, configurations, inventory records, and standard operating procedures.

6. Escalation Matrix:

Issue TypeLevel 1 (Technician)Level 2 (Engineer)Level 3 (Lead/Architect)
Minor network issuesTroubleshoot and resolveEscalate if unresolvedReview resolution
Major network outagesEscalate immediatelyTroubleshoot and escalateCoordinate resolution
Security incidentsEscalate immediatelyInvestigate and escalateManage response
Capacity issuesReport to LeadAnalyze and recommendPlan and implement

7. Performance Measurement: KPIs (Key Performance Indicators) are used to monitor the effectiveness of the Network Infrastructure Management sub-unit:

  1. Network Uptime (Fiber & Wireless): Targets should be set based on industry best practices and service level agreements (e.g., 99.99% for fiber)over a period of time.
  2. Mean Time To Repair (MTTR): Defined targets for resolving network outages based on severity (e.g., under 1 hour for critical outages).
  3. Successful Project Completion Rate: Track percentage of network projects completed on time and within budget.
  4. Documentation Accuracy: Maintain 100% accuracy of network documentation.
  5. Security Incident Response Time: Targets for resolving security incidents based on severity.

7. Review and Updates: This SOP must have review due date(RDD) and should have strictly adhered to,so as to reflect technological advancements, changes to the network infrastructure, and evolving organizational requirements. 

STANDARD OPERATING PROCEDURE (SOP) FOR NETWORK SECURITY AND MONITORING SUB UNIT

1. Purpose: This SOP defines the processes and procedures for securing and monitoring the University’s network infrastructure to ensure its confidentiality, integrity, and availability. It supports the overall strategic goals of the NOC and COMSIT.

2. Scope: This SOP covers all aspects of network security and monitoring, including:

  1. Security Policy Enforcement: Implementing and enforcing network security policies and procedures.
  2. Threat Detection and Prevention: Deploying and managing security tools to detect and prevent security threats.
  3. Security Monitoring: Continuous monitoring of network activity for suspicious behavior.
  4. Incident Response: Handling security incidents according to established procedures.
  5. Vulnerability Management: Identifying and remediating network vulnerabilities.
  6. Security Awareness Training: Educating users about network security best practices.
  7. Security Auditing: Regularly auditing network security controls.

3. Roles and Responsibilities:

  1. Network Security and Monitoring Lead: Overall responsibility for the Network Security and Monitoring sub-unit. Manages staff, develops and implements security policies, and reports to the Network Administrator.
  2. Security Engineer: Designs, implements, and maintains network security infrastructure; develops and enforces security policies and procedures. Responds to major security incidents.
  3. Security Analyst: Monitors network traffic and security logs; analyzes security events and alerts; investigates security incidents; creates reports.
  4. Network Monitoring Specialist: Monitors network performance using NMS (Network Management System); sets up and configures monitoring tools; analyzes data and generates reports.

4. Line of Command and Reporting:

  1. All Network Security and Monitoring staff report to the Network Security and Monitoring Lead.
  2. The Network Security and Monitoring Lead reports to the Network Administrator.
  3. The Network Administrator reports to the Head of the NOC.
  4. The Head of the NOC reports to the Director, COMSIT.

5. Processes and Procedures:

  1. Security Policy Enforcement: Security policies are clearly defined, communicated, and enforced. Regular training and awareness programs are conducted for all users.
  2. Threat Detection and Prevention: Firewalls, intrusion detection/prevention systems (IDS/IPS), and other security tools are deployed and managed effectively. Regular vulnerability scans are performed.
  3. Security Monitoring: Continuous monitoring of network traffic, security logs, and system events is conducted using SIEM (Security Information and Event Management) and other monitoring tools. Alerts are reviewed and investigated promptly.
  4. Incident Response: A documented incident response plan is in place and followed for all security incidents. This plan includes escalation procedures, communication protocols, and post-incident analysis.
  5. Vulnerability Management: A vulnerability management program is implemented to identify, assess, and remediate network vulnerabilities.
  6. Security Auditing: Regular security audits are conducted to verify the effectiveness of security controls.
  7. Compliance: The Network Security and Monitoring team ensures compliance with all relevant University policies, industry standards (e.g., NIST Cybersecurity Framework), and legal regulations.

6. Escalation Matrix:

Issue TypeLevel 1 (Analyst)Level 2 (Engineer)Level 3 (Lead/COMSIT Security Team)
Suspicious network activityInvestigate and analyzeEscalate if malicious activityFull incident response investigation
Security alertsInvestigate and triageEscalate if criticalFull incident response investigation
Security incidents (minor)Respond and remediateEscalate if unresolvedFull incident response investigation
Security incidents (major)Escalate immediatelyCoordinate responseFull incident response investigation
System compromise suspectedEscalate immediatelyCoordinate responseFull incident response investigation

7. Performance Measurement: KPIs (Key Performance Indicators) are used to monitor the effectiveness of the Network Security and Monitoring sub-unit:

  1. Security Incident Response Time: Defined targets for responding to and resolving security incidents based on severity.
  2. Mean Time To Detection (MTTD): Track the average time it takes to detect security incidents.
  3. Mean Time To Resolution (MTTR): Track the average time it takes to resolve security incidents.
  4. Number of Security Vulnerabilities: Track the number of vulnerabilities identified and remediated.
  5. Network Uptime due to Security: Track network downtime caused by security incidents.
  6. Accuracy of Security Audits: Ensure 100% accuracy of security audit reports.
  7. False Positive Rate: Minimize false positives from security monitoring systems.

STANDARD OPERATING PROCEDURE (SOP)

1. Purpose: This SOP defines the processes and procedures for managing and maintaining the University’s servers and data, ensuring high availability, performance, security, and data integrity. It directly supports the strategic goals of the NOC and COMSIT.

2. Scope: This SOP covers all aspects of server and data management, including:

  1. Server provisioning and Management: Managing the lifecycle of servers, including procurement, installation, configuration, maintenance, and decommissioning.
  2. Data Management: Implementing and maintaining data management policies and procedures, including backup and recovery, data storage, and access control.
  3. Database Administration: Managing and maintaining databases, including performance tuning, security, and backups.
  4. System Monitoring: Continuous monitoring of server health and performance.
  5. Incident Response: Handling server and data-related incidents according to established procedures.
  6. Capacity Planning: Ensuring sufficient server and storage capacity to meet current and future needs.
  7. Security: Implementing and maintaining appropriate security measures for servers and data.
  8. Roles and Responsibilities:
  9. Server and Data Management Lead: Overall responsibility for the Server and Data Management sub-unit. Develops and implements strategies, manages staff, and reports to the Network Administrator.
  10. System Administrators: Manage and maintain servers, operating systems, applications, and databases. Perform regular maintenance tasks, implement security measures, and resolve server-related incidents.
  11. Database Administrators: Manage and maintain databases, ensuring data integrity and availability. Perform regular backups and implement recovery procedures. Optimize database performance and security.
  12. Data Center Technician: Responsible for the physical maintenance of the server room, including power, cooling, and environmental monitoring.

4. Line of Command and Reporting:

  1. All Server and Data Management staff report to the Server and Data Management Lead.
  2. The Server and Data Management Lead reports to the Network Administrator.
  3. The Network Administrator reports to the Head of the NOC.
  4. The Head of the NOC reports to the Director, COMSIT.

5. Processes and Procedures:

  1. Server Provisioning: A standardized process for procuring, installing, configuring, and deploying servers is followed. This includes documentation of all configurations and settings.
  2. Server Maintenance: Regular preventative maintenance tasks are scheduled and performed, including patching, software updates, and performance monitoring.
  3. Data Backup and Recovery: A robust backup and recovery plan is implemented, including regular backups, testing of backups, and disaster recovery procedures.
  4. Data Storage: Data storage policies and procedures are established and followed, including data retention policies and archiving strategies.
  5. Access Control: Appropriate access control measures are implemented to protect server and data security.
  6. Database Administration: Database performance is monitored and optimized. Regular database backups are performed. Security measures are implemented to protect database integrity and availability.
  7. System Monitoring: Server health and performance are continuously monitored using appropriate tools. Alerts are reviewed and investigated promptly.
  8. Incident Response: A documented incident response plan is in place for handling server and data-related incidents. This plan includes escalation procedures, communication protocols, and post-incident analysis.
  9. Capacity Planning: Regular capacity planning is conducted to ensure sufficient server and storage capacity to meet current and future needs.
  10. Security: Security measures are implemented to protect server and data security, including access control, encryption, and regular security audits.

6. Escalation Matrix:

Issue TypeLevel 1 (System Admin)Level 2 (DBA/Lead)Level 3 (NOC Management/COMSIT)
Minor server issuesTroubleshoot and resolveEscalate if unresolvedEscalate only if critical impact
Major server outagesEscalate immediatelyImmediate troubleshootingIncident Management Team
Database performance issuesTroubleshoot and optimizeEscalate if unresolvedDatabase Administrator Team
Data loss or corruptionEscalate immediatelyInitiate data recoveryIncident Management Team
Security incidentsEscalate immediatelyCoordinate responseIncident Management Team

7. Performance Measurement: KPIs (Key Performance Indicators) are used to monitor the effectiveness of the Server and Data Management sub-unit:

  1. Server Uptime: Target should be set based on industry best practices and SLAs (e.g., 99.99%).
  2. Database Uptime: Target for database availability (e.g., 99.9%).
  3. Backup and Recovery Success Rate: Track the success rate of backups and recoveries.
  4. Mean Time To Repair (MTTR): Defined targets for resolving server outages based on severity.
  5. Mean Time To Recovery (MTTR): Defined targets for recovering from data loss or corruption incidents.
  6. Data Loss Rate: Track instances of data loss due to server failures or other incidents. Aim for 0%.
  7. Compliance with Data Backup and Retention Policies: Ensure 100% adherence to policies.

8. Review and Updates: This SOP will be reviewed and updated at least annually, or more frequently as needed, to reflect technological advancements, changes to the server and data infrastructure, and evolving organizational requirements. Regular reviews with stakeholders should be incorporated.

STANDARD OPERATING PROCEDURE (SOP) FOR HELP DESK SUPPORT

1. Purpose: This SOP outlines the processes and procedures for providing efficient and effective technical support to University of Ilorin users. It aims to ensure high user satisfaction and aligns with the overall strategic goals of the NOC and COMSIT.

2. Scope: This SOP covers all aspects of Help Desk support, including:

  1. Incident Management: Handling and resolving user-reported IT incidents.
  2. Request Fulfillment: Processing and fulfilling user requests for IT services.
  3. Knowledge Management: Creating and maintaining a knowledge base of solutions to common IT issues.
  4. Communication: Communicating effectively with users regarding IT support issues.
  5. Escalation: Escalating complex issues to higher-level support teams.
  6. Performance Monitoring: Tracking key metrics to measure Help Desk performance.

3. Roles and Responsibilities:

  1. Help Desk Manager: Overall responsibility for the Help Desk. Manages staff, develops and implements support procedures, and reports to the Network Administrator.
  2. Help Desk Analysts (Tier 1): Provide first-level support to end-users, troubleshooting common issues, creating and managing tickets within the ITSM (IT Service Management) system, and escalating complex issues to Tier 2 support.
  3. Second-Level Support (Tier 2): Provide in-depth troubleshooting and technical support for complex issues. Collaborate with other IT units as needed.

4. Line of Command and Reporting:

  1. All Help Desk staff report to the Help Desk Manager.
  2. The Help Desk Manager reports to the Network Administrator.
  3. The Network Administrator reports to the Head of the NOC.
  4. The Head of the NOC reports to the Director, COMSIT.

5. Processes and Procedures:

  1. Incident Logging and Tracking: All incidents are logged in the ITSM system, including detailed descriptions, user information, and priority levels. Tickets are assigned and tracked throughout the resolution process.
  2. Incident Resolution: Help Desk Analysts follow established troubleshooting procedures to resolve incidents. Documentation of the resolution process is required for all incidents.
  3. Request Fulfillment: User requests for IT services (e.g., account creation, software installation) are processed and fulfilled according to established procedures.
  4. Knowledge Management: A knowledge base is maintained containing solutions to frequently encountered IT issues. Help Desk Analysts are encouraged to contribute to the knowledge base.
  5. Communication: Effective communication with users is essential. Help Desk Analysts should provide regular updates on incident status and clearly explain technical issues in non-technical terms.
  6. Escalation: Complex or unresolved incidents are escalated to Tier 2 support or other specialized teams according to the escalation matrix.
  7. Performance Monitoring: Key performance indicators (KPIs) are tracked and analyzed to measure Help Desk performance, such as average resolution time, customer satisfaction, and first-call resolution rate. Regular reports are generated.

6. Escalation Matrix:

Issue TypeTier 1 (Analyst)Tier 2 (Second-Level Support)Level 3 (Specialized Team/NOC Management)
Simple software issuesResolve directlyEscalate if unresolvedNot usually required
Basic hardware problemsResolve directly or escalateResolve or escalate to vendorEscalate only if critical impact
Network connectivity issuesEscalate to Network teamFurther investigationNetwork Infrastructure Team
Account access issuesResolve directlyEscalate if unresolvedIdentity and Access Management Team
Complex technical issuesEscalate immediatelyInvestigate and resolveRelevant specialized team or NOC Management
Security incidentsEscalate immediatelyInvestigate and escalateSecurity Team

7. Performance Measurement: KPIs (Key Performance Indicators) are used to monitor the effectiveness of the Help Desk:

  1. Average Resolution Time: Target for resolving incidents (e.g., under 2 hours for high priority, under 4 hours for normal priority).
  2. Customer Satisfaction: Measured through surveys or feedback mechanisms. Target should be 90% or higher.
  3. First Call Resolution Rate: Percentage of incidents resolved on the first contact. Target should be 70% or higher.
  4. Ticket Resolution Accuracy: Percentage of tickets resolved correctly on the first attempt. Target should be above 95%.
  5. Number of Escalated Tickets: Track percentage of tickets escalated to higher levels. Aim for a low percentage.

8. Review and Updates: This SOP will be reviewed and updated at least annually, or more frequently as needed, to reflect changes in technology, user needs, and organizational requirements. Regular reviews with stakeholders should be incorporated.

STANDARD OPERATING PROCEDURE (SOP) FOR POWER AND COOLING ADMINISTRATION SUB UNIT

1. Purpose: This SOP defines the processes and procedures for managing and maintaining the power and cooling infrastructure of the University of Ilorin NOC, ensuring reliable and stable operation of IT equipment. This directly supports the overall strategic goals of the NOC and COMSIT.

2. Scope: This SOP covers all aspects of power and cooling administration, including:

  1. Power Infrastructure Management: Maintaining the primary and backup power systems, including generators, UPS (Uninterruptible Power Supply) systems, and power distribution.
  2. Cooling Infrastructure Management: Maintaining the HVAC (Heating, Ventilation, and Air Conditioning) systems and other cooling equipment to regulate temperature and humidity within the NOC.
  3. Environmental Monitoring: Continuous monitoring of environmental conditions (temperature, humidity, airflow) within the NOC.
  4. Preventive Maintenance: Implementing and adhering to a preventative maintenance schedule for all power and cooling equipment.
  5. Incident Response: Handling power and cooling-related incidents according to established procedures.

3. Roles and Responsibilities:

  1. Power and Cooling Administration Lead: Overall responsibility for the Power and Cooling Administration sub-unit. Develops and implements strategies, manages staff, and reports to the Network Administrator.
  2. Power and Cooling Engineer: Designs, implements, and maintains the NOC’s power and cooling systems; develops and implements preventative maintenance plans; responds to power and cooling system alerts; performs troubleshooting; monitors environmental conditions.
  3. Power and Cooling Technician: Performs routine maintenance on power and cooling equipment; assists the engineer with more complex tasks.

4. Line of Command and Reporting:

  1. All Power and Cooling Administration staff report to the Power and Cooling Administration Lead.
  2. The Power and Cooling Administration Lead reports to the Network Administrator.
  3. The Network Administrator reports to the Head of the NOC.
  4. The Head of the NOC reports to the Director, COMSIT.

5. Processes and Procedures:

  1. Power System Maintenance: Regular preventative maintenance is performed on all power equipment, including generators, UPS systems, and power distribution. This includes regular testing, inspections, and necessary repairs.
  2. Cooling System Maintenance: Regular preventative maintenance is performed on all cooling equipment, including HVAC systems and other cooling devices. This includes regular testing, inspections, and necessary repairs.
  3. Environmental Monitoring: Environmental conditions (temperature, humidity, airflow) within the NOC are continuously monitored using sensors and monitoring systems. Alerts are triggered if conditions exceed predefined thresholds.
  4. Preventative Maintenance Schedule: A detailed preventative maintenance schedule is developed and implemented for all power and cooling equipment. This schedule includes specific tasks, frequencies, and responsible personnel.
  5. Incident Response: A documented incident response plan is in place for handling power and cooling related incidents. This plan includes escalation procedures, communication protocols, and post-incident analysis.
  6. Emergency Procedures: Procedures are in place for handling power outages and other emergencies. This includes backup power procedures and emergency contact information.
  7. Documentation: Comprehensive documentation is maintained for all power and cooling equipment, including specifications, maintenance records, and diagrams.

6. Escalation Matrix:

Issue TypeLevel 1 (Technician)Level 2 (Engineer)Level 3 (NOC Management/Facilities)
Minor power/cooling issuesTroubleshoot and resolveEscalate if unresolvedEscalate only if critical impact
Major power outagesEscalate immediatelyImmediate troubleshootingFacilities Management/Emergency Services
HVAC system malfunctionsEscalate immediatelyImmediate troubleshootingFacilities Management
Environmental condition alertsInvestigate and addressEscalate if unresolvedFacilities Management
Generator failureEscalate immediatelyImmediate troubleshootingFacilities Management/Emergency Services

7. Performance Measurement: KPIs (Key Performance Indicators) are used to monitor the effectiveness of the Power and Cooling Administration sub-unit:

  1. NOC Uptime due to Reliable Power: Track the percentage of NOC downtime caused by power outages. Aim for 0%.
  2. UPS System Availability: Track the percentage of time the UPS system provides uninterrupted power. Aim for 100%.
  3. HVAC System Uptime: Track the percentage of time the HVAC system maintains optimal environmental conditions. Aim for 99% or higher.
  4. Mean Time To Repair (MTTR) for Power/Cooling Outages: Track the average time taken to resolve power or cooling system outages. Define targets based on severity.
  5. Adherence to Preventive Maintenance Schedule: Track the percentage of preventative maintenance tasks completed according to schedule. Aim for 100%.
  1. DOCUMENT CONTROL
  2. Version: 
  3. Effective Date: 
  4. Review Due Date: 
  5. Prepared by: 
  6. Approved by: 
  7. Distribution:
  8. Head of NOC Unit
  9. Core Mandate Leads (Network Infrastructure Management, Network Security & Monitoring, Server & Data Management, Help Desk & Support, Power & Cooling Administration)
  10. Relevant Technical Staff within each Sub-unit
  11. Network Administrator
  12. Deputy Directors, COMSIT
  13. Director, COMSIT
  14. [Add any other relevant stakeholders as needed]

7. Revision History: