
SAS Date Engineer- Linux, bilingual in Korean
Cesna Recruitment
Posted about 17 hours ago
[SAS Datalake Engineer]
1. Node Replacement and Upgrades:
- Assess Isilon and NetApp cluster capacity and performance metrics using OneFS and ONTAP to determine the need for node replacements or upgrades.
- Manage the integration of new Isilon and NetApp nodes, ensuring system compatibility and minimal downtime during scalability operations.
- Oversee the upgrade and replacement of HPC, Kubernetes, Greenplum, Impala cluster servers, and GPU server hardware to enhance computational and graphics processing capabilities.
2. Troubleshooting and Problem Resolution:
- Utilize Impala Insight IQ and NetApp tools for in-depth analysis to diagnose and resolve issues related to throughput, latency, and capacity.
- Leverage One FS and ONTAP troubleshooting tools and logs to identify and rectify system errors and hardware malfunctions.
- Address performance bottlenecks and system failures in HPC, Kubernetes, Greenplum, GPU servers, Impala, and NetApp environments to ensure operational efficiency.
3. System Monitoring and Maintenance:
- Implement continuous monitoring with Isilon One FS event logs, NetApp ONTAP tools, SNMP alerts, and Grafana to detect anomalies early.
- Schedule regular maintenance using Isilon's and NetApp's software suites to ensure optimal performance and longevity of the storage systems.
- Monitor and maintain HPC, Kubernetes, Greenplum, GPU, Impala, and NetApp server environments, applying necessary updates and performing system health checks regularly.
4. Patches and Updates:
- Apply Impala and NetApp-specific patches and firmware updates to nodes and system software, ensuring all components are secure and up-to-date.
- Test new software updates in a sandbox environment to evaluate their impact on performance and stability before deployment across the cluster.
- Manage and coordinate software updates and patch installations across HPC, Kubernetes, Greenplum, GPU servers, Impala, and NetApp systems to maintain software integrity and security.
5. Technical Support and Consultation:
- Provide specialized support and consultation for optimizing storage solutions with Isilon and NetApp, including data migration, system expansion, and configuration tuning.
- Offer technical support for optimizing HPC, Kubernetes, Greenplum, and GPU server configurations, improving computational resources and data processing workflows.
- Assist with the design, configuration, and optimization of NetApp storage architectures to ensure efficient data management.
6. End-of-Life (EOL) Hardware Management:
- Manage the phased decommissioning of aging hardware in Isilon and NetApp systems, ensuring compliance with environmental standards and security protocols.
- Coordinate the replacement of EOL hardware for HPC, Kubernetes, Greenplum, GPU servers, Isilon, and NetApp to prevent performance degradation.
7. Integration and System Compatibility:
- Ensure that Impala, NetApp, HPC, Kubernetes, Greenplum, and GPU storage solutions are fully integrated and compatible with the overall IT infrastructure.
- Develop and implement strategies for the effective integration of storage and processing technologies to maximize performance and resource utilization.
8. Documentation and Knowledge Sharing:
- Maintain detailed documentation of system configurations, upgrades, and troubleshooting activities to streamline processes and support team onboarding.
- Conduct training sessions or create guides for internal teams to share knowledge about system operations and best practices.
9. Disaster Recovery and Backup:
- Plan and test disaster recovery procedures for Impala, NetApp, and HPC/Kubernetes/Greenplum/GPU servers to ensure data availability and business continuity.
- Ensure backup and recovery processes are in place and regularly tested to minimize data loss risks.
10. Capacity Planning:
- Develop capacity planning strategies to proactively scale storage and compute resources based on performance trends and business needs.
11. Automation:
- Develop and maintain automation scripts for routine tasks such as monitoring, data migration, or maintenance to improve system efficiency.
- Utilize automation tools to streamline repetitive tasks, enhancing system performance and reducing manual errors.
12. Security and Compliance:
- Ensure compliance with organizational security standards and relevant regulations during all maintenance, upgrades, and troubleshooting activities.
- Implement or coordinate with security teams on applying best practices and hardening measures for Isilon, NetApp, HPC, Kubernetes, Greenplum, and GPU servers.
13. Data Center Work:
- Participate in data center activities, including racking and initial installation of servers and storage equipment.
- Ensure proper cabling, power connections, and network configurations during initial setup to facilitate smooth integration into the IT environment.
Job details
Jobr Assistant extension
Get the extension →