Amazon AWS
- Amazon AWS
- Getting Started
- Architecture
- Research (Services/Applications)
- Machine Learning on AWS
- HPC on AWS
- Container Services
Getting Started
- Getting Started with AWS
- Overview: Free online training paths for researchers and research IT
- AWS Immersion Day tutorials
- AWS Cloud Practitioner Essentials Training Course
- Navigating the Console
- Overview of Services
- Cost optimization/cost monitoring
- Control your costs tutorial
- IAM
Architecture
VPC
Compute
- Getting Started with EC2
- EC2 Pricing
- EC2 Instance Types
Storage
- Overview of different kinds of storage options
- Storage Overview Training Course
- S3: Object Storage
- S3 Object Storage Training Course
- EBS: Disk Storage
- EBS Deep Dive Training Course
- S3 Glacier: Archive Storage
Research (Services/Applications)
General Information
- Open Data Registry on AWS
- Amazon Science
- Self-Paced/Free Digital Training Courses
- Curated List of AWS Research Training
Machine Learning on AWS
Learn about end-to-end Machine Learning resources in AWS like Amazon SageMaker, leverage AI Services like Amazon Comprehend for sentiment analysis, Amazon Transcribe for speech to text, or Translate to provide language translation without needing the knowledge of building ML models.
General Information
- Landing Page
- SageMaker Documentation
- AWS Training & Certification Free Digital Training
- AWS Machine Learning Research Awards
- Coursera Getting Started with AWS Machine Learning
- AWSLabs Machine Learning Samples
- Augmented AI Example UIs
- Group Truth data labeling UIs
Workshops
- Distributed Training with SageMaker and Horovod
- Distributed Training with EKS and Horovod
- FHIR Integration with Amazon Comprehend Medical
- Everything DeepRacer
- SpaceNet on SageMaker
- ETL Pipelines for SageMaker
- Amazon SageMaker Heart Disease Prediction
- MLOps with Amazon SageMaker
- Object Detection from scratch with Amazon SageMaker
- Elastic Inference Object Detection with Amazon SageMaker
Blogs
Research Computing
SageMaker
- AWS DataExchange and Amazon SageMaker for sharing data for ML workloads
- Semantic segmentation labeling with Amazon SageMaker Ground Truth
- Amazon SageMaker multi-model inference endpoints
- Batch Inference with Amazon SageMaker and Tensorflow
- Optimizing TensorFlow model serving with Kubernetes and Amazon Elastic Inference
- Power contextual bandits using continual learning with Amazon SageMaker RL
- Speed up training on Amazon SageMaker using Amazon FSx for Lustre and Amazon EFS file systems
- Git integration with Amazon SageMaker
- Build end-to-end machine learning workflows with Amazon SageMaker and Apache Airflow
- Amazon SageMaker automatic model tuning now supports random search and hyperparameter scaling
- Architecting ML with Amazon SageMaker 3 day course
- Classification of chest x-rays with Amazon SageMaker
AI Services
- Custom Classifier with Amazon Comprehend
- Amazon Rekognition custom labels
- Build a custom entity recognizer using Amazon Comprehend
- De-identify medical images with the help of Amazon Comprehend Medical and Amazon Rekognition
- Map clinical notes to the OMOP Common Data Model and healthcare ontologies using Amazon Comprehend Medical
Videos
- Artificial Intelligence and Machine Learning in Research
- Deep learning for disaster management and response
- End-to-end machine learning using Spark and Amazon SageMaker
- Amazon SageMaker deep dive: A modular solution for machine learning
- Insights into patient health with Amazon Comprehend Medical
- Build accurate training datasets with Amazon SageMaker Ground Truth
Tutorials
HPC on AWS
Learn about HPC services offered in AWS like cloud native scheduling with AWS Batch, using traditional job schedulers like Slurm with AWS ParallelCluster, and features like Elastic Fabric Adapter (EFA) to scale Message Passing Interface (MPI) and Machine Learning (ML) jobs in cloud HPC clusters.
General Information
- HPC on AWS
- AWS Batch Landing Page
- AWS Batch Documentation
- AWS Batch Getting Started Guide
- ParallelCluster Landing Page
- ParellelCluster Documentation
- ParallelCluster Getting Started Guide
Workshops
- Introduction to ParallelCluster
- ParallelCluster with FSx for Luster
- Introduction to AWS Batch with CARLA driving simulator
- Monte Carlo Simulations on AWS Batch with Spot
- AWS Batch Genomics Workflows with Cromwell and Nextflow
Blogs
- Building an interactive and scalable ML research environment using AWS ParallelCluster
- A Scientist’s Guide to Cloud-HPC: Example with AWS ParallelCluster, Slurm, Spack, and WRF
Videos
- AWS infrastructure for large-scale training at Facebook AI
- Enabling Research using Hybrid HPC Cloud Computing
Tutorials
HT Condor
Slurm
Container Services
AWS has a number of container offerings like Elastic Container Service (ECS) or Elastic Kubernetes Service (EKS). Learn about how to use managed Kubernetes for Machine Learning, Analytics, and HPC workloads.
General Information
- AWS Training & Certification Free Digital Training
- SageMaker Operators for Kubernetes
- EKS Worker Node Drainer
Workshops
- Everything EKS Workshop
- EKS and Kops Kubernetes Networking
- EKS with Kubeflow
- Apache Spark on EKS
- EKS with Terraform and Kubeflow
Blogs
- Help us write a new chapter for Gitops, Kubernetes, and Open Source collaboration
- Kubernetes workloads on EC2 Spot with EKS
- Using the FSx for Lustre CSI Driver with Amazon EKS
- Deploy Apache Spark jobs on EKS
- Optimizing Distributed Deep Learning Performance on Amazon EKS
Videos
- Running Kubernetes Applications on AWS Fargate
- EKS under the hood
- Running Kubernetes at Amazon scale using Amazon EKS
- Building machine-learning infrastructure on Amazon EKS with Kubeflow
- Top 5 container and Kubernetes best practices
- Running Containers in a Hybrid Environment