2025-05-26

Sliiiin · Sliiiin · commit 699f600ec05a · 2025-05-26T10:23:37.000+02:00
diff --git a/apigw-lambda-bedrock-sam/README.md b/apigw-lambda-bedrock-sam/README.md
@@ -1,6 +1,6 @@
 # Amazon API Gateway to AWS Lambda to Amazon Bedrock using SAM
 
-This sample project deploys an Amazon API Gateway REST API with an AWS Lambda integration. The Lambda function is written in Python, calls the Amazon Bedrock API for Anthropic Claude-v2 model and returns a response containing the generated content.
+This sample project deploys an Amazon API Gateway REST API with an AWS Lambda integration. The Lambda function is written in Python, calls the Amazon Bedrock API for Anthropic Claude 3.5 Sonnet model and returns a response containing the generated content.
 
 Learn more about this pattern at Serverless Land Patterns: serverlessland.com/patterns/apigw-lambda-bedrock
 
@@ -100,16 +100,16 @@ Amazon Bedrock users need to request access to models before they are available
 
 ## How it works
 
-This SAM project uses Amazon Bedrock API for Anthropic Claude-v2 model to generate content based on given prompt. This is exposed through a serverless REST API. Please refer to the architecture diagram below:
+This SAM project uses Amazon Bedrock API for Anthropic Claude 3.5 Sonnet model to generate content based on given prompt. This is exposed through a serverless REST API. Please refer to the architecture diagram below:
 ![End to End Architecture](images/architecture.png)
 
 Here's a breakdown of the steps:
 
 1. **Amazon API Gateway**: Receives the HTTP POST request containing the prompt.
 
-2. **AWS Lambda**: Triggered by the API Gateway, this function forwards the prompt to Amazon Bedrock API using boto3 bedrock-runtime API. It uses Anthropic Claude-v2 model and sets other required parameters to fixed values for simplicity.
+2. **AWS Lambda**: Triggered by the API Gateway, this function forwards the prompt to Amazon Bedrock API using boto3 bedrock-runtime API. It uses Anthropic Claude 3.5 Sonnet model and sets parameters like temperature, max tokens, and other configuration options.
 
-3. **Amazon Bedrock**: Based on the given prompt, using Anthropic Claude-v2 model generates the content and returns the response to Lambda.
+3. **Amazon Bedrock**: Based on the given prompt, using Anthropic Claude 3.5 Sonnet model generates the content and returns the response to Lambda.
 
 4. **Response**: Lambda processes the Bedrock output and sends it back to the user via the API Gateway.
 
diff --git a/apigw-lambda-bedrock-sam/src/bedrock_integration.py b/apigw-lambda-bedrock-sam/src/bedrock_integration.py
@@ -13,32 +13,46 @@ def lambda_handler(event, context):
     prompt = body["prompt"]
     print("Prompt = " + prompt)
 
-    # Create the body
-    body = json.dumps({
-        'prompt': "\n\nHuman:" + prompt + "\n\nAssistant:",
+    # Create the request body using the messaging API format for Claude 3.5
+    request_body = json.dumps({
+        "anthropic_version": "bedrock-2023-05-31",
+        "max_tokens": 200,
+        "messages": [
+            {
+                "role": "user",
+                "content": prompt
+            }
+        ],
         "temperature": 0.5,
         "top_p": 1,
-        "top_k": 250,
-        "max_tokens_to_sample": 200,
-        "stop_sequences": ["\n\nHuman:"]
+        "top_k": 250
     })
 
     # Set the model id and other parameters required to invoke the model
-    model_id = 'anthropic.claude-v2'
+    model_id = 'anthropic.claude-3-5-sonnet-20240620-v1:0'
     accept = 'application/json'
     content_type = 'application/json'
 
     # Invoke Bedrock API
-    response = bedrock.invoke_model(body=body, modelId=model_id, accept=accept, contentType=content_type)
-    print(response)
-
+    response = bedrock.invoke_model(body=request_body, modelId=model_id, accept=accept, contentType=content_type)
+    
     # Parse the response body
     response_body = json.loads(response.get('body').read())
     print(response_body)
+    
+    # Extract the completion from the messaging API response format
+    completion = response_body.get('content', [{}])[0].get('text', '')
+    stop_reason = response_body.get('stop_reason', '')
+    
+    # Format the response to maintain backward compatibility with the existing API
+    formatted_response = {
+        'completion': completion,
+        'stop_reason': stop_reason
+    }
 
     return {
         'statusCode': 200,
         'body': json.dumps({
-            'generated-text': response_body
+            'generated-text': formatted_response
         })
     }