Automate test environment setup and cleanup. Use OpenTelemetry as a Tracetest source.

SrdjanLL · SrdjanLL · commit 64ecd7e7f40a · 2025-05-16T18:05:25.000+01:00
diff --git a/example-apps/chatbot-rag-app/docker-compose.yml b/example-apps/chatbot-rag-app/docker-compose.yml
@@ -33,3 +33,9 @@ services:
       - "4000:4000"
     extra_hosts:  # send localhost traffic to the docker host, e.g. your laptop
         - "localhost:host-gateway"
+    healthcheck:
+      test: ["CMD", "bash", "-c", "exec 6<>/dev/tcp/localhost/4000"]
+      interval: 5s
+      timeout: 3s
+      retries: 3
+      start_period: 5s
diff --git a/example-apps/chatbot-rag-app/test/tracetest/Dockerfile.tracetest b/example-apps/chatbot-rag-app/test/tracetest/Dockerfile.tracetest
diff --git a/example-apps/chatbot-rag-app/test/tracetest/README.md b/example-apps/chatbot-rag-app/test/tracetest/README.md
@@ -0,0 +1,116 @@
+# Trace-Based Testing for Chatbot RAG Application
+
+*The instructions and tests below have been used with models hosted by OpenAI.
+We plan to add tests for additional model configurations in the future.*
+
+## Introduction to Trace Testing
+
+Trace testing is a modern approach to testing distributed systems by leveraging the distributed traces that flow through your applications. In a complex system like the Chatbot RAG application, traditional testing approaches may fall short because they can't effectively monitor the interactions between microservices, databases, and external APIs.
+
+Tracetest is an open-source tool that enables you to create, run, and maintain integration tests using distributed traces with support for OpenTelemetry and observability backends such as Elastic APM. It allows you to:
+
+- Validate the flow of requests through your entire system
+- Assert on specific spans within a trace
+- Test complex scenarios involving multiple services
+
+For more information about Tracetest, visit the [official documentation](https://docs.tracetest.io/).
+
+## Setup
+
+Chatbot RAG application setup uses Docker to create a testing environment that includes:
+
+1. A Tracetest server for executing and managing tests
+2. An Elasticsearch cluster for storing traces, logs, and application data
+3. An OpenTelemetry collector for processing and routing telemetry data
+4. The chatbot RAG application itself
+
+The setup leverages several Docker Compose files to combine the test environment with the local Elastic Stack (from [docker/docker-compose-elastic.yml](../../../../docker/docker-compose-elastic.yml)) and Chatbot RAG application (from [example-apps/chatbot-rag-app/docker-compose.yml](../../docker-compose.yml)). In order to spin up the up-to-date versions of all moving parts, we leverage overrides maintained within this directory. We use:
+
+- `docker-compose.test.yml` - for Tracetest configuration
+- `docker-compose.test.override.yml` - Test-specific Tracetest configuration
+- `elastic-stack.override.yml` - for test-specific configuration for Elasticsearch and OpenTelemetry Collector
+- `chatbot-rag.override.yml` - for configuration of the chatbot application in test mode.
+
+All services are connected through a shared Docker network to enable communication between components.
+
+## Environment Configuration
+
+Before running tests, you need to prepare a `.env.test` file with the necessary environment variables. This file configures the behavior of the chatbot application during testing (same configuration as described in [the applications's directory](../../README.md)).
+
+Create a `.env.test` file in the `test/tracetest` directory with the following content to reproduce the environment we're testing with:
+
+```bash
+# Location of the application routes
+FLASK_APP=api/app.py
+# Ensure print statements appear as they happen
+PYTHONUNBUFFERED=1
+
+# How you connect to Elasticsearch: change details to your instance
+ELASTICSEARCH_URL=http://elasticsearch:9200
+ELASTICSEARCH_USER=elastic
+ELASTICSEARCH_PASSWORD=elastic
+
+# The name of the Elasticsearch indexes
+ES_INDEX=workplace-app-docs
+ES_INDEX_CHAT_HISTORY=workplace-app-docs-chat-history
+
+# OpenAI Configuration
+LLM_TYPE=openai
+OPENAI_API_KEY=
+CHAT_MODEL=gpt-4o-mini
+
+# Set to false to record logs, traces and metrics
+OTEL_SDK_DISABLED=false
+
+# Assign the service name that shows up in Kibana
+OTEL_SERVICE_NAME=chatbot-rag-app
+
+# OpenTelemetry configuration
+OTEL_EXPORTER_OTLP_ENDPOINT=http://otel-collector:4318
+OTEL_EXPORTER_OTLP_PROTOCOL=http/protobuf
+OTEL_INSTRUMENTATION_GENAI_CAPTURE_MESSAGE_CONTENT=true
+
+# Performance tuning
+OTEL_METRIC_EXPORT_INTERVAL=3000
+OTEL_BSP_SCHEDULE_DELAY=3000
+OTEL_EXPERIMENTAL_RESOURCE_DETECTORS=process_runtime,os,otel,telemetry_distro
+```
+
+> Note: Make sure to add your actual OpenAI API key.
+
+## Running the Tests
+
+To run the trace-based tests for the chatbot RAG application, follow these steps:
+
+1. Navigate to the test directory:
+
+   ```bash
+   cd example-apps/chatbot-rag-app/test/tracetest
+   ```
+
+2. Execute the test script:
+
+   ```bash
+    ./run-tests.sh
+   ```
+
+   To automatically clean up resources after the tests complete (or if they fail), you can use the --with-cleanup flag:
+
+   ```bash
+    ./run-tests.sh --with-cleanup
+   ```
+
+The script performs the following operations:
+
+- Creates a shared Docker network for all services
+- Sets up the Tracetest server
+- Starts the Elastic stack (Elasticsearch and OpenTelemetry Collector)
+- Builds and starts the chatbot RAG application
+- Executes the trace tests defined in `resources/openai-chatbot-test.yaml`
+- If `--with-cleanup` is provided, automatically cleans up all resources when the script exits (normally or due to an error)
+
+The example test sends a question about working from home policy to the LLM via API and validates several aspects of the application:
+
+- Successful interraction with the LLM (in the initial setup, a `gpt-4o-mini` via OpenAI API)
+- Proper search operations in Elasticsearch for RAG functionality
+- Correct updating of chat history
diff --git a/example-apps/chatbot-rag-app/test/tracetest/chatbot-rag.override.yml b/example-apps/chatbot-rag-app/test/tracetest/chatbot-rag.override.yml
@@ -0,0 +1,19 @@
+networks:
+  shared-test-network:
+    external: true # Reference the network created above
+
+services:
+  create-index:
+    env_file:
+      - ./test/tracetest/.env.test
+    networks:
+      - shared-test-network
+  api-frontend:
+    # Build image on the local codebase to allow testing locally.
+    image: chatbot-rag-app-local
+    build:
+      context: .
+    env_file:
+      - ./test/tracetest/.env.test
+    networks:
+      - shared-test-network
diff --git a/example-apps/chatbot-rag-app/test/tracetest/docker-compose.test.override.yml b/example-apps/chatbot-rag-app/test/tracetest/docker-compose.test.override.yml
@@ -0,0 +1,12 @@
+networks:
+  shared-test-network:
+    external: true # Reference the network created above
+
+services:
+  postgres:
+    networks:
+      - shared-test-network
+  tracetest: 
+    networks:
+      - shared-test-network
+
diff --git a/example-apps/chatbot-rag-app/test/tracetest/docker-compose.test.yml b/example-apps/chatbot-rag-app/test/tracetest/docker-compose.test.yml
@@ -12,8 +12,6 @@ services:
             interval: 1s
             retries: 60
         image: postgres:14
-        networks:
-            default: null
     tracetest:
         command: --provisioning-file /app/provision.yaml
         depends_on:
@@ -34,8 +32,6 @@ services:
             interval: 1s
             retries: 60
         image: kubeshop/tracetest:v1.7.1
-        networks:
-            default: null
         ports:
             - mode: ingress
               target: 11633
@@ -48,20 +44,4 @@ services:
             - type: bind
               source: ./resources/tracetest-provision.yaml
               target: /app/provision.yaml
-
-    tracetest-run:
-        build:
-            dockerfile: Dockerfile.tracetest
-        volumes:
-            - ./resources:/resources
-        entrypoint:
-            - bash
-            - /resources/run.sh
-        networks:
-            default: null
-        depends_on:
-            tracetest:
-                condition: service_healthy
-networks:
-    default:
-        name: tracetest_default
+        
diff --git a/example-apps/chatbot-rag-app/test/tracetest/elastic-stack.override.yml b/example-apps/chatbot-rag-app/test/tracetest/elastic-stack.override.yml
@@ -0,0 +1,79 @@
+networks:
+  shared-test-network:
+    external: true # Reference the network created above
+
+services:
+  elasticsearch:
+    networks:
+      - shared-test-network
+  elasticsearch_settings: 
+    networks:
+      - shared-test-network
+  kibana:
+    networks:
+      - shared-test-network
+  otel-collector:
+    networks:
+      - shared-test-network
+configs:
+  # This is the minimal yaml configuration needed to listen on all interfaces
+  # for OTLP logs, metrics and traces, exporting to Elasticsearch.
+  # Overriding for test purposes in order to add tracetest exporter
+  edot-collector-config:
+    content: |
+      receivers:
+        otlp:
+          protocols:
+            grpc:
+              endpoint: 0.0.0.0:4317
+            http:
+              endpoint: 0.0.0.0:4318
+      
+      connectors:
+        elasticapm:
+      
+      processors:
+        elastictrace:
+      
+      exporters:
+
+        elasticsearch:
+          endpoint: http://elasticsearch:9200
+          user: elastic
+          password: elastic
+          mapping:
+            mode: otel
+          logs_dynamic_index:
+            enabled: true
+          metrics_dynamic_index:
+            enabled: true
+          traces_dynamic_index:
+            enabled: true
+          flush:
+            interval: 1s  # improve responsiveness in example apps (default 30s)
+        otlp/tracetest:
+          endpoint: http://tracetest:4317
+          tls:
+            insecure: true
+      
+      service:
+        pipelines:
+          traces:
+            receivers: [otlp]
+            processors: [elastictrace]
+            exporters: [elasticapm, elasticsearch, otlp/tracetest]
+      
+          metrics:
+            receivers: [otlp]
+            processors: []
+            exporters: [elasticsearch]
+      
+          metrics/aggregated:
+            receivers: [elasticapm]
+            processors: []
+            exporters: [elasticsearch]
+      
+          logs:
+            receivers: [otlp]
+            processors: []
+            exporters: [elasticapm, elasticsearch]
diff --git a/example-apps/chatbot-rag-app/test/tracetest/resources/chatbot-test.yaml b/example-apps/chatbot-rag-app/test/tracetest/resources/chatbot-test.yaml
diff --git a/example-apps/chatbot-rag-app/test/tracetest/resources/openai-chatbot-test.yaml b/example-apps/chatbot-rag-app/test/tracetest/resources/openai-chatbot-test.yaml
@@ -0,0 +1,29 @@
+type: Test
+spec:
+  id: chatbot-e2e-test
+  name: Chatbot Trace Test
+  description: Test to verify the chatbot's response to the work from home policy question.
+  trigger:
+    type: http
+    httpRequest:
+      method: POST
+      url: http://api-frontend:4000/api/chat
+      body: "{\"question\": \"What is the work from home policy?\"}"
+      headers:
+      - key: Content-Type
+        value: application/json
+  specs:
+  - selector: span[span.type="genai" name="chat gpt-4o-mini"]
+    name: Calls OpenAI API
+    assertions:
+    - attr:gen_ai.usage.output_tokens > 0
+    - attr:event.outcome = "success"
+  - selector: span[span.type="db" name="search" db.system="elasticsearch" db.operation="search" db.elasticsearch.path_parts.index = "workplace-app-docs"]
+    name: Searches docs index for RAG
+    assertions:
+    - attr:event.outcome  =  "success"
+    - attr:db.elasticsearch.path_parts.index  =  "workplace-app-docs"
+  - selector: span[span.type="db" name="index" db.system="elasticsearch" db.operation="index" db.elasticsearch.path_parts.index = "workplace-app-docs-chat-history"]
+    name: Updates chat history
+    assertions:
+    - attr:event.outcome = "success"
diff --git a/example-apps/chatbot-rag-app/test/tracetest/resources/run.sh b/example-apps/chatbot-rag-app/test/tracetest/resources/run.sh
diff --git a/example-apps/chatbot-rag-app/test/tracetest/resources/tracetest-provision.yaml b/example-apps/chatbot-rag-app/test/tracetest/resources/tracetest-provision.yaml
@@ -15,14 +15,6 @@ spec:
 ---
 type: DataStore
 spec:
-  id: current
-  name: elasticapm
-  type: elasticapm
+  name: Opentelemetry Collector pipeline
+  type: otlp
   default: true
-  elasticapm:
-    addresses:
-      - http://host.docker.internal:9200
-    username: elastic
-    password: elastic
-    index: traces-apm-default
-    insecureSkipVerify: true
diff --git a/example-apps/chatbot-rag-app/test/tracetest/run-tests.sh b/example-apps/chatbot-rag-app/test/tracetest/run-tests.sh