This document proposes a collaboration between IBM and a university to establish a Center of Excellence for AI, HPC, and cloud computing. It outlines hardware specifications including an IBM AC922 system with Power9 processors, GPUs, and storage. It also discusses offering Red Hat Enterprise Linux courseware, IBM Watson Machine Learning, opportunities for faculty and students through projects, publications, and skills development. The collaboration aims to advance research and provide training through specialized courses over a two-year period.
2. 2
Proposed AI setup and specifications - Hardware
System and its Configuration
College Ethernet Network
4
4
College Ethernet Network
1 IBM AC922 System
2 POWER9 Processors ( 40 Cores )
128 GB Memory
2 TB Hard drive
40 Cores Power 9 Processor
2 NVLINK-2 nVidia GPUs
Edge Compute devices
1 RACK ( Which can fit in )
5. 2 Year Developmental Timeline
a) IBM POWER Academic Initiative
partnership
b) OpenPOWER system and
Accelerator for Deep Learning
and Machine Learning
c) Technical Projects deployment
d) Review of progress in technical
projects, lab coursework
e) Big data and AI curriculums
6. 6
Red Hat Enterprise Linux Courseware As part of Academia
partnership
1. Red Hat System Administration I -- Relates to Red Hat Enterprise Linux 7, and is designed for IT
professionals without previous Linux administration experience. The course focuses on providing
students with Linux administration "survival skills" by focusing on core administration tasks.
2. Red Hat System Administration II -- Specifically designed for students who have completed Red
Hat System Administration I, and focuses on key tasks needed to become a full-time Linux®
administrator.
3. Red Hat System Administration III -- Related to Red Hat Enterprise Linux, 7, and designed for
experienced Linux system administrators who hold a Red Hat Certified System Administrator (RHCSA)
certification or equivalent skills and who want to broaden their ability to administer Linux systems at an
enterprise level.
4. Red Hat JBoss Enterprise Application Development I -- Learn how to create, test, and maintain
Java EE-compliant applications from start to finish using the Eclipse-based Red Hat JBoss Developer
Studio.
5. Red Hat OpenStack Administration I -- Designed for system administrators who are intending to
implement a cloud computing environment using Red Hat OpenStack Platform. This course is based on
Red Hat OpenStack Platform 8.
7. IBM Software
Offerings along
with the Server
Software Overview
§ IBM’s hardware offerings for HPC are enhanced when
combined with enterprise class software solutions. These
include Red
§ Hat Enterprise Linux (RHEL), IBM Watson Machine Learning,
and IBM Spectrum Computing.
Red Hat
§ The proposed solution includes Red Hat Enterprise Linux 7
(RHEL) with 5-year support on all compute and storage
nodes, RHEL and CentOS are highly compatible Linux
operating systems. Although support is available for both
operating systems on the IBM Power Systems AC922 server,
running RHEL on IBM Power provides clients with enterprise
grade Linux support.
§ Red Hat is a leading provider of open-source solutions, and
IBM is one of the largest Linux contributors. RHEL 8 for
Power exploits the latest IBM POWER and virtualization
technologies to help maximize system resources and provide
high qualities of service to your end users. RHEL 7 enables
the following functions on POWER:
§ Simultaneous multithreading
§ Static micro threading
§ Transactional memory
7
8. IBM Software
Offerings along
with the Servers
§ IBM Watson Machine Learning CE) are available at
no charge.
§ IBM Watson Machine Learning (formerly IBM
PowerAI)
§ IBM Watson Machine Learning makes deep learning
and machine learning more accessible to your staff,
and the benefits of
§ AI more obtainable to the University. It combines
popular open source deep learning frameworks,
efficient Artificial Intelligence development tools, and
accelerated IBM. Power Systems™ servers. With
IBM Watson Machine Learning, the University can
deploy a fully optimized and supported AI platform
that delivers blazing performance, along with proven
dependability and resilience.
8
9. Advantages for Your Faculty and
Students
§ Talent and Skills: (Remote Interns; Skills and Training)
Students and Research scholars will start working on the
advanced technologies will enable them to work on
many applications
Publications and Mindshare: (Press releases, Articles,
and Publications; Conferences and Events)
1. Conference Paper on software-based application
research /development in 6 months
§ Intellectual Capital: (Patents, Open source; Prototypes,
Demos; Curriculum; Student projects, Theses)
1. Prototype building of many research problems using
software-centric approach (hardware-centric baseline
implementation almost getting completed)
2. Potential to file disclosures
§ Opportunities: (Seed revenue; Leverage other funding;
Build ecosystems; Build government/client relationships)
1. Once software-centric solution available with
comparable performance using latest technologies ,
your team would create prototypes which can be
demonstrated to several colleges
10. Special Courses along with COE
§ Big Data with docker and Kubernetes
§ Machine Learning with Python
§ Data Science Course
§ Exascale Computing Infra
§ Quantum Computing Workshop
§ Faculty Development programs 2
More than 100 hours of technology workshops
11. Processor Core Enablement and
Partnership
1. Introduction of open-ended experiments on A2I
Core in the FPGA Lab curriculum
2. Allotment of Mini Projects to students on
HDL/Verilog/ A2I Core
3. Global Remote Mentoring for students with our
mentors, who have desired FPGA coding skills
4. FDP for faculty on porting & integration of modules
for application design using A2I core
5. Discussion on the creation of data-path for the
development of softcore processor architecture
6. Joint research activities
7. Development of specific solutions for IBM as
sponsored projects / consultancy
8. Sharing of learning materials for A2I core and
relevant tool chain
11
12. Onstitute Platform and Wisconsin
Collaboration-Platteville
§ By registering to Onstitute, the students can get the
following benefits:
§ Learn a broad range of data science topics (e.g., big
data analytics, cloud computing, machine learning, deep
learning, etc.) and analytic software tools.
§ Get access to cutting edge hardware infrastructure
(including supercomputing-level systems with multicore
CPU, multiple GPU, etc.) while learning.
§ Exposure multiple job opportunities in data-science and
related field.
§ Involve in real-world big data and AI projects together
with academia and industry leaders.
§ Opportunities to participate in world-class
workshops/webinars and rewarding hackathons.
12
13. University of Oregon , E4S and TAU
Collaborations
E4S or the Extreme-scale Scientific Software Stack [https://e4s.io] is a community effort to provide open
source software packages for developing, deploying and running scientific applications on high-
performance computing (HPC) platforms. E4S provides from-source builds and containers of a broad
collection of HPC software packages. E4S exists to accelerate the development, deployment and use of
HPC software, lowering the barriers for HPC users.
§ "TAU Performance System® [http://tau.uoregon.edu] available on
OpenPOWER:
– Profiling and tracing support with 3D profile browsers
– Support for IBM XL, GNU, and LLVM Clang compilers
– Support for PowerAI, Spectrum MPI, and MVAPICH2 GDR, CUDA,
OpenACC
– Multi-platform support in TAU
• IBM Power, Cray XC, ARM64, x86_64, NVIDIA CUPTI and AMD
GPUs (ROCm)
13