Name	Name	Last commit message	Last commit date
parent directory ..
SampleData	SampleData
dataset-validation	dataset-validation
distillation_recipes/01_citations	distillation_recipes/01_citations
Distillation-via-S3-input.ipynb	Distillation-via-S3-input.ipynb
Historical_invocation_distillation.ipynb	Historical_invocation_distillation.ipynb
README.md	README.md
utils.py	utils.py

Name

Last commit message

Last commit date

dataset-validation

distillation_recipes/01_citations

Distillation-via-S3-input.ipynb

Historical_invocation_distillation.ipynb

README.md

utils.py

Amazon Bedrock Model Distillation Samples

This repository contains code samples and notebooks demonstrating how to use Amazon Bedrock Model Distillation. The samples cover two main approaches for creating distillation jobs: using S3 to upload a JSONL file with prompts, and using historical invocation logs.

Introduction
Prerequisites
Notebooks
Usage
Key Benefits
Use Cases
Contributing

Introduction

Amazon Bedrock Model Distillation allows you to create smaller, faster, and more cost-efficient models that deliver use-case specific accuracy comparable to larger, more capable models. This repository provides practical examples of how to implement model distillation using Amazon Bedrock.

Prerequisites

Before using these samples, ensure you have:

An active AWS account
Selected teacher and student models enabled in Amazon Bedrock
Confirmed availability of model region and quotas
Created an IAM role with necessary permissions
Set up an Amazon S3 bucket for storing distillation job output metrics
Enabled invocation logging (if using historical invocation logs)
Sufficient quota for running provisioned throughput during inference

Notebooks

This repository contains two main notebooks:

Distillation-via-S3-input.ipynb: Demonstrates how to use S3 to upload a JSONL file with prompts for model distillation.
Historical_invocation_distillation.ipynb: Shows how to use historical invocation logs to create a distillation job, including generating invocation logs and metadata using ConverseAPI.

Usage

To use these notebooks:

Clone this repository
Open the desired notebook in a Jupyter environment
Follow the step-by-step instructions in each notebook

Ensure you have the necessary AWS permissions and have set up your environment according to the prerequisites.

Key Benefits

Efficiency: Distilled models provide high use-case specific accuracy comparable to the most capable models while being as fast as some of the smallest models.
Cost Optimization: Inference from distilled models is less expensive compared to larger advanced models.
Advanced Customization: Bedrock Model Distillation removes the need to create labelled dataset for fine-tuning.
Ease of Use: Bedrock Model Distillation offers a single workflow that automates the generation of teacher responses, addition of data synthesis, and fine-tunes the student model with optimized hyperparameter tuning.

Use Cases

Retrieval-Augmented Generation (RAG)
Document Summarization
Chatbot Deployments
Text Classification

Contributing

We welcome contributions to improve these samples. Please submit a pull request or open an issue to discuss proposed changes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Amazon Bedrock Model Distillation Samples

Table of Contents

Introduction

Prerequisites

Notebooks

Usage

Key Benefits

Use Cases

Contributing

FilesExpand file tree

model_distillation

Directory actions

More options

Directory actions

More options

Latest commit

History

model_distillation

Folders and files

parent directory

README.md

Amazon Bedrock Model Distillation Samples

Table of Contents

Introduction

Prerequisites

Notebooks

Usage

Key Benefits

Use Cases

Contributing