Software Engineer III, AI Infrastructure
Google.com
Office
Taipei, Taiwan
Full Time
Minimum Qualifications:
- Bachelor's degree in Electrical Engineering, Computer Science, or a related technical field, or equivalent practical experience.
- 2 years of experience in software development using languages such as C++.
Preferred Qualifications:
- Master's degree or PhD in Electrical Engineering, Computer Science or related technical fields.
- Experience working with cross-functional teams, including engineers, product managers, and technical program managers along with excellent communication skills.
- Knowledge of software for GPU/TPU architecture.
- Understanding of machine learning and AI, with knowledge of the underlying hardware and software stacks.
- Excellent problem-solving skills and ability to work in a fluid and collaborative environment.
About The Job
Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.
In this role, you will design and implement software for bootstrap and configure the Compute or Storage components, Operating System (OS) image deployment and node specific software as agents or offload computation, integrating with both Tensor Processing Unit (TPU) clusters and cloud infrastructure.
The AI and Infrastructure team is redefining what’s possible. We empower Google customers with breakthrough capabilities and insights by delivering AI and Infrastructure at unparalleled scale, efficiency, reliability and velocity. Our customers include Googlers, Google Cloud customers, and billions of Google users worldwide.We're the driving team behind Google's groundbreaking innovations, empowering the development of our cutting-edge AI models, delivering unparalleled computing power to global services, and providing the essential platforms that enable developers to build the future. From software to hardware our teams are shaping the future of world-leading hyperscale computing, with key teams working on the development of our TPUs, Vertex AI for Google Cloud, Google Global Networking, Data Center operations, systems research, and much more.
Responsibilities
- Partner with TPU Software (SW) and Hardware (HW) team and help design, develop and validate TPU based systems for deployment.
- Build and integrate Cloud Compute software to bootstrap TPU Artificial Intelligence (AI) Infrastructure.
- Drive timely delivery of projects within the team.
- Collaborate with cross-functional leaders to shape the future of our Machine Learning (ML) infrastructure.
- Provide technical guidance and mentorship, fostering the growth of team members.
