Responsibilities
- Be able to define business requirements and systems goals, planning to change, update for system infrastructure both long-term fix and hot-fix plans
- Gauge the effectiveness and efficiency of existing systems; develop and implement strategies for improving or further leveraging these systems.
- Design and perform server and security audits, system backup procedures, and other recovery processes in accordance with the company’s disaster recovery and business continuity strategies.
- Ensure system connectivity of all servers, shared software, groupware, and other applications
- Create and maintain documentation as it relates to system configuration, mapping, processes, and service records.
- Ensure compatibility and interoperability of in-house computing systems
- Monitor and test system performance; prepare and deliver system performance statistics and reports.
- Cooperate with DevOps and Software, NOC team for incident handling.
Requirements
- Proven experience in overseeing the design, development, and implementation of software systems, applications, and related products.
- Deep knowledge about Linux system IS A MUST
- Experienced with monitoring tools (Zabbix, Nagios, Grafana, Prometheus)
- Experienced with web server: Nginx, Apache, Tomcat …
- Experienced with proxy: Haproxy, Nginx …
- Excellent written, oral, and interpersonal communication skills
- Ability to conduct research into systems issues and products as required
- Ability to communicate ideas in both technical and user-friendly language
- Highly self-motivated and directed, with keen attention to detail
- Proven analytical and creative problem-solving abilities
- Able to prioritize and execute tasks in a high-pressure environment/wastewater
- Strong customer service orientation
- Ability to work in a team-oriented, collaborative environment/wastewater
- Knowledge of networking concepts (DNS, TCP/IP, HTTP/HTTPS and firewalls, LDAP)
- Experienced with high availability system
- Knowledge Database: MongoDB, PostgreSQL … and database model (replicaset, cluster)
- Experienced source code manager: Gitlab
- Willing to learn and research new technologies
* Plus:
- Experienced with Docker container: Docker-composer, Swarm, Kubernetes.
- Experienced Build and Management big system
- Experienced build and management log tool: ELK, Graylog, Splunk
JOB FEATURES
Job Category: Infrastructure System, IT, IT and Software Development, System Engineer