diff --git a/.github/workflows/release-notes-check.yml b/.github/workflows/release-notes-check.yml
index 2eb7cee1..9a9f0d1f 100644
--- a/.github/workflows/release-notes-check.yml
+++ b/.github/workflows/release-notes-check.yml
@@ -22,7 +22,7 @@ jobs:
         
     - name: Get changed files
       id: changed-files
-      uses: tj-actions/changed-files@v44
+      uses: tj-actions/changed-files@v46.0.1
       with:
         files_yaml: |
           code:
diff --git a/.gitignore b/.gitignore
index 02f0c972..003d72a2 100644
--- a/.gitignore
+++ b/.gitignore
@@ -45,3 +45,6 @@ flask_session
 tmp**cwd
 /tmp_images
 nul
+/.github/plans
+*.xlsx
+/artifacts/tests
diff --git a/CLAUDE.md b/CLAUDE.md
index 4e9c2830..2ad07a8a 100644
--- a/CLAUDE.md
+++ b/CLAUDE.md
@@ -58,6 +58,7 @@ return render_template('page.html', settings=public_settings)
 
 ## Version Management
 
+- Its important to update the version at the end of every plan
 - Version is stored in `config.py`: `VERSION = "X.XXX.XXX"`
 - When incrementing, only change the third segment (e.g., `0.238.024` -> `0.238.025`)
 - Include the current version in functional test file headers and documentation files
@@ -83,7 +84,7 @@ return render_template('page.html', settings=public_settings)
 
 ## Release Notes
 
-After completing code changes, offer to update `docs/explanation/release_notes.md`.
+After completing plans and code changes, offer to update `docs/explanation/release_notes.md`.
 
 - Add entries under the current version from `config.py`
 - If the version was bumped, create a new section at the top: `### **(vX.XXX.XXX)**`
diff --git a/README.md b/README.md
index 31ea020b..5ebf8208 100644
--- a/README.md
+++ b/README.md
@@ -121,6 +121,58 @@ This step will begin the deployment process.
 azd up 
 ```
 
+## Deployment Runtime Notes
+
+### Container
+> [!NOTE]
+>
+> The container deployments of Simple Chat does NOT need this step, when you run `azd up` for new installs or `azd deploy` for updates, the container is configured to run with gunicorn.
+
+- The repo-provided `azd`, Bicep, Terraform, and Azure CLI deployers are **container-based** App Service deployments.
+- For those container deployments, do **not** set an App Service Stack Settings Startup command. 
+    - The container already starts Gunicorn through `application/single_app/Dockerfile`.
+
+## Native Python
+- For **native Python App Service** deployments, deploy the `application/single_app` folder and set the App Service Startup command explicitly.
+
+Native Python deployment references:
+
+- [Manual deployment notes](./docs/reference/deploy/manual_deploy.md)
+- [Manual setup steps](./docs/setup_instructions_manual.md#installing-and-deploying-the-application-code)
+- [VS Code deployment steps](./docs/setup_instructions_manual.md#deploying-via-vs-code-recommended-for-simplicity)
+- [Azure CLI ZIP deploy steps](./docs/setup_instructions_manual.md#deploying-via-azure-cli-zip-deploy)
+
+To set the Startup command in Azure Portal:
+
+1. Go to the App Service.
+2. Open **Settings** > **Configuration** > **Stack Settings**.
+3. Enter the following Startup command.
+4. Save the change, then stop and start the app.
+
+Use this Startup command for native Python App Service deployments:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+> [!IMPORTANT]
+>
+> Running Simple Chat with gunicorn improves the experience with better request handling and concurrency.
+
+## Upgrade Paths
+
+- For a concise upgrade decision guide, see [docs/how-to/upgrade_paths.md](docs/how-to/upgrade_paths.md).
+
+### Container
+- **Container-based upgrades** should usually start with `azd deploy` for code-only changes. Use `azd up` only when the release also changes infrastructure.
+- If your App Service is already configured to pull from ACR and you want image-only rollouts, use the ACR/image refresh approach described in [docs/how-to/upgrade_paths.md](docs/how-to/upgrade_paths.md) instead of treating every release as a full reprovisioning event.
+
+### Native Python
+- **Native Python App Service upgrades** should reuse the manual deployment path, validate the Startup command above, and deploy the `application/single_app` folder with VS Code or Azure CLI ZIP deploy.
+- [Manual native Python upgrade guide](./docs/setup_instructions_manual.md#upgrading-the-application)
+- [Native Python ZIP deploy reference](./docs/setup_instructions_manual.md#deploying-via-azure-cli-zip-deploy)
+- [Native Python deployment notes](./docs/reference/deploy/manual_deploy.md)
+
 ## Architecture
 
 ![Architecture](./docs/images/architecture.png)
@@ -144,6 +196,7 @@ azd up
 - **Metadata Extraction (Optional)**: Apply an AI model (configurable GPT model via Admin Settings) to automatically generate keywords, two-sentence summaries, and infer author/date for uploaded documents. Allows manual override for richer search context.
 - **File Processing Logs (Optional)**: Enable verbose logging for all ingestion pipelines (workspaces and ephemeral chat uploads) to aid in debugging, monitoring, and auditing file processing steps.
 - **Redis Cache (Optional)**: Integrate Azure Cache for Redis to provide a distributed, high-performance session store. This enables true horizontal scaling and high availability by decoupling user sessions from individual app instances.
+- **SQL Database Agents (Optional)**: Connect agents to Azure SQL or other SQL databases through configurable SQL Query and SQL Schema plugins. Database schema is automatically discovered and injected into agent instructions at load time, enabling agents to answer natural language questions by generating and executing SQL queries without requiring users to know table or column names.
 - **Authentication & RBAC**: Secure access via Azure Active Directory (Entra ID) using MSAL. Supports Managed Identities for Azure service authentication, group-based controls, and custom application roles (`Admin`, `User`, `CreateGroup`, `SafetyAdmin`, `FeedbackAdmin`).
 - **Supported File Types**:
 
diff --git a/application/single_app/Dockerfile b/application/single_app/Dockerfile
index 65483ac6..6f04b41a 100644
--- a/application/single_app/Dockerfile
+++ b/application/single_app/Dockerfile
@@ -7,15 +7,13 @@ FROM mcr.microsoft.com/azurelinux/base/python:3.12 AS builder
 ARG UID
 ARG GID
 
-# Setup pip.conf if has content
-COPY pip.conf.d/ /etc/pip.conf.d
+# Copy pip.conf into the image for pip configuration
+COPY docker-customization/pip.conf /etc/pip.conf
 
 # CA
 # copy certs to /etc/pki/ca-trust/source/anchors
-COPY custom-ca-certificates/ /etc/ssl/certs
-RUN mkdir -p /etc/pki/ca-trust/source/anchors/ \
-    && update-ca-trust enable \
-    && cp /etc/ssl/certs/*.crt /etc/pki/ca-trust/source/anchors/ \
+COPY docker-customization/custom-ca-certificates/ /etc/pki/ca-trust/source/anchors
+RUN update-ca-trust enable \
     && update-ca-trust extract
 
 ENV PYTHONUNBUFFERED=1
@@ -44,6 +42,7 @@ ARG UID
 ARG GID
 
 COPY --from=builder /etc/pki /etc/pki
+COPY --from=builder /etc/ssl/certs /etc/ssl/certs
 COPY --from=builder /home/nonroot /home/nonroot
 COPY --from=builder /etc/passwd /etc/passwd
 COPY --from=builder /etc/group /etc/group
@@ -59,8 +58,11 @@ ENV HOME=/home/nonroot \
     PYTHONIOENCODING=utf-8 \
     LANG=C.UTF-8 \
     LC_ALL=C.UTF-8 \
-    PYTHONUNBUFFERED=1
-
+    PYTHONUNBUFFERED=1 \
+    SSL_CERT_FILE=/etc/ssl/certs/ca-bundle.crt \
+    SSL_CERT_DIR=/etc/ssl/certs \
+    REQUESTS_CA_BUNDLE=/etc/ssl/certs/ca-bundle.crt
+    
 WORKDIR /app
 
 # Copy application code and set ownership
@@ -69,4 +71,4 @@ COPY --chown=${UID}:${GID} application/single_app ./
 # Expose port
 EXPOSE 5000
 
-ENTRYPOINT [ "python3", "/app/app.py" ]
+ENTRYPOINT ["python3", "-m", "gunicorn", "-c", "/app/gunicorn.conf.py", "app:app"]
diff --git a/application/single_app/app.py b/application/single_app/app.py
index 2354b1b5..ca74071a 100644
--- a/application/single_app/app.py
+++ b/application/single_app/app.py
@@ -75,6 +75,7 @@
 from route_backend_public_prompts import *
 from route_backend_user_agreement import register_route_backend_user_agreement
 from route_backend_conversation_export import register_route_backend_conversation_export
+from route_backend_thoughts import register_route_backend_thoughts
 from route_backend_speech import register_route_backend_speech
 from route_backend_tts import register_route_backend_tts
 from route_enhanced_citations import register_enhanced_citations_routes
@@ -102,7 +103,7 @@
 
 # Ensure filesystem session directory (when used) points to a writable path inside container.
 if SESSION_TYPE == 'filesystem':
-    app.config['SESSION_FILE_DIR'] = SESSION_FILE_DIR if 'SESSION_FILE_DIR' in globals() else os.environ.get('SESSION_FILE_DIR', '/app/flask_session')
+    app.config['SESSION_FILE_DIR'] = globals().get('SESSION_FILE_DIR', os.environ.get('SESSION_FILE_DIR', '/app/flask_session'))
     try:
         os.makedirs(app.config['SESSION_FILE_DIR'], exist_ok=True)
     except Exception as e:
@@ -141,9 +142,30 @@
 from functions_settings import get_settings
 from functions_authentication import get_current_user_id
 from functions_global_agents import ensure_default_global_agent_exists
+from background_tasks import start_background_task_threads
 
 from route_external_health import *
 
+_app_init_lock = threading.Lock()
+_app_initialized = False
+_background_tasks_lock = threading.Lock()
+_background_tasks_started = False
+
+
+def is_running_under_gunicorn():
+    """Return True when the current process is a Gunicorn worker."""
+    server_software = os.environ.get('SERVER_SOFTWARE', '')
+    return 'gunicorn' in server_software.lower() or bool(os.environ.get('GUNICORN_CMD_ARGS'))
+
+
+def should_start_background_tasks():
+    """Enable background loops unless the runtime explicitly disables them."""
+    env_value = os.environ.get('SIMPLECHAT_RUN_BACKGROUND_TASKS')
+    if env_value is not None:
+        return env_value.strip().lower() not in ('0', 'false', 'no', 'off')
+
+    return True
+
 # =================== Session Configuration ===================
 def configure_sessions(settings):
     """Configure session backend (Redis or filesystem) once.
@@ -160,7 +182,7 @@ def configure_sessions(settings):
                 redis_client = None
                 try:
                     if redis_auth_type == 'managed_identity':
-                        print("Redis enabled using Managed Identity")
+                        log_event("Redis enabled using Managed Identity", level=logging.INFO)
                         from config import get_redis_cache_infrastructure_endpoint
                         credential = DefaultAzureCredential()
                         redis_hostname = redis_url.split('.')[0]
@@ -175,9 +197,25 @@ def configure_sessions(settings):
                             socket_connect_timeout=5,
                             socket_timeout=5
                         )
+                    elif redis_auth_type == 'key_vault':
+                        log_event("Redis enabled using Key Vault Secret", level=logging.INFO)
+                        from functions_keyvault import retrieve_secret_direct
+                        redis_key_secret_name = settings.get('redis_key', '').strip()
+                        redis_password = retrieve_secret_direct(redis_key_secret_name)
+                        if redis_password:
+                            redis_password = redis_password.strip()
+                        redis_client = Redis(
+                            host=redis_url,
+                            port=6380,
+                            db=0,
+                            password=redis_password,
+                            ssl=True,
+                            socket_connect_timeout=5,
+                            socket_timeout=5
+                        )
                     else:
                         redis_key = settings.get('redis_key', '').strip()
-                        print("Redis enabled using Access Key")
+                        log_event("Redis enabled using Access Key", level=logging.INFO)
                         redis_client = Redis(
                             host=redis_url,
                             port=6380,
@@ -190,7 +228,7 @@ def configure_sessions(settings):
                     
                     # Test the connection
                     redis_client.ping()
-                    print("✅ Redis connection successful")
+                    log_event("✅ Redis connection successful", level=logging.INFO)
                     app.config['SESSION_TYPE'] = 'redis'
                     app.config['SESSION_REDIS'] = redis_client
                     
@@ -212,202 +250,66 @@ def configure_sessions(settings):
     Session(app)
 
 # =================== Helper Functions ===================
-@app.before_first_request
-def before_first_request():
-    print("Initializing application...")
-    settings = get_settings(use_cosmos=True)
-    app_settings_cache.configure_app_cache(settings, get_redis_cache_infrastructure_endpoint(settings.get('redis_url', '').strip().split('.')[0]))
-    app_settings_cache.update_settings_cache(settings)
-    sanitized_settings = sanitize_settings_for_logging(settings)
-    debug_print(f"DEBUG:Application settings: {sanitized_settings}")
-    sanitized_settings_cache = sanitize_settings_for_logging(app_settings_cache.get_settings_cache())
-    debug_print(f"DEBUG:App settings cache initialized: {'Using Redis cache:' + str(app_settings_cache.app_cache_is_using_redis)} {sanitized_settings_cache}")
-
-    initialize_clients(settings)
-    ensure_custom_logo_file_exists(app, settings)
-    # Enable Application Insights logging globally if configured
-    print("Setting up Application Insights logging...")
-    setup_appinsights_logging(settings)
-    logging.basicConfig(level=logging.DEBUG)
-    print("Application initialized.")
-    ensure_default_global_agent_exists()
-
-    # Background task to check for expired logging timers
-    def check_logging_timers():
-        """Background task that checks for expired logging timers and disables logging accordingly"""
-        while True:
-            try:
-                settings = get_settings()
-                current_time = datetime.now()
-                settings_changed = False
-                
-                # Check debug logging timer
-                if (settings.get('enable_debug_logging', False) and 
-                    settings.get('debug_logging_timer_enabled', False) and 
-                    settings.get('debug_logging_turnoff_time')):
-                    
-                    turnoff_time = settings.get('debug_logging_turnoff_time')
-                    if isinstance(turnoff_time, str):
-                        try:
-                            turnoff_time = datetime.fromisoformat(turnoff_time)
-                        except:
-                            turnoff_time = None
-                    
-                    if turnoff_time and current_time >= turnoff_time:
-                        debug_print(f"logging timer expired at {turnoff_time}. Disabling debug logging.")
-                        settings['enable_debug_logging'] = False
-                        settings['debug_logging_timer_enabled'] = False
-                        settings['debug_logging_turnoff_time'] = None
-                        settings_changed = True
-                
-                # Check file processing logs timer
-                if (settings.get('enable_file_processing_logs', False) and 
-                    settings.get('file_processing_logs_timer_enabled', False) and 
-                    settings.get('file_processing_logs_turnoff_time')):
-                    
-                    turnoff_time = settings.get('file_processing_logs_turnoff_time')
-                    if isinstance(turnoff_time, str):
-                        try:
-                            turnoff_time = datetime.fromisoformat(turnoff_time)
-                        except:
-                            turnoff_time = None
-                    
-                    if turnoff_time and current_time >= turnoff_time:
-                        print(f"File processing logs timer expired at {turnoff_time}. Disabling file processing logs.")
-                        settings['enable_file_processing_logs'] = False
-                        settings['file_processing_logs_timer_enabled'] = False
-                        settings['file_processing_logs_turnoff_time'] = None
-                        settings_changed = True
-                
-                # Save settings if any changes were made
-                if settings_changed:
-                    update_settings(settings)
-                    print("Logging settings updated due to timer expiration.")
-                
-            except Exception as e:
-                print(f"Error in logging timer check: {e}")
-                log_event(f"Error in logging timer check: {e}", level=logging.ERROR)
-            
-            # Check every 60 seconds
-            time.sleep(60)
-
-    # Start the background timer check thread
-    timer_thread = threading.Thread(target=check_logging_timers, daemon=True)
-    timer_thread.start()
-    print("Logging timer background task started.")
-
-    # Background task to check for expired approval requests
-    def check_expired_approvals():
-        """Background task that checks for expired approval requests and auto-denies them"""
-        while True:
-            try:
-                from functions_approvals import auto_deny_expired_approvals
-                denied_count = auto_deny_expired_approvals()
-                if denied_count > 0:
-                    print(f"Auto-denied {denied_count} expired approval request(s).")
-            except Exception as e:
-                print(f"Error in approval expiration check: {e}")
-                log_event(f"Error in approval expiration check: {e}", level=logging.ERROR)
-            
-            # Check every 6 hours (21600 seconds)
-            time.sleep(21600)
-
-    # Start the approval expiration check thread
-    approval_thread = threading.Thread(target=check_expired_approvals, daemon=True)
-    approval_thread.start()
-    print("Approval expiration background task started.")
-
-    # Background task to check retention policy execution time
-    def check_retention_policy():
-        """Background task that executes retention policy at scheduled time"""
-        while True:
-            try:
-                settings = get_settings()
-                
-                # Check if any retention policy is enabled
-                personal_enabled = settings.get('enable_retention_policy_personal', False)
-                group_enabled = settings.get('enable_retention_policy_group', False)
-                public_enabled = settings.get('enable_retention_policy_public', False)
-                
-                if personal_enabled or group_enabled or public_enabled:
-                    current_time = datetime.now(timezone.utc)
-                    
-                    # Check if next scheduled run time has passed
-                    next_run = settings.get('retention_policy_next_run')
-                    should_run = False
-                    
-                    if next_run:
-                        try:
-                            next_run_dt = datetime.fromisoformat(next_run)
-                            # Run if we've passed the scheduled time
-                            if current_time >= next_run_dt:
-                                should_run = True
-                        except Exception as parse_error:
-                            print(f"Error parsing next_run timestamp: {parse_error}")
-                            # If we can't parse, fall back to checking last_run
-                            last_run = settings.get('retention_policy_last_run')
-                            if last_run:
-                                try:
-                                    last_run_dt = datetime.fromisoformat(last_run)
-                                    # Run if last run was more than 23 hours ago
-                                    if (current_time - last_run_dt).total_seconds() > (23 * 3600):
-                                        should_run = True
-                                except:
-                                    should_run = True
-                            else:
-                                should_run = True
-                    else:
-                        # No next_run set, check last_run instead
-                        last_run = settings.get('retention_policy_last_run')
-                        if last_run:
-                            try:
-                                last_run_dt = datetime.fromisoformat(last_run)
-                                # Run if last run was more than 23 hours ago
-                                if (current_time - last_run_dt).total_seconds() > (23 * 3600):
-                                    should_run = True
-                            except:
-                                should_run = True
-                        else:
-                            # Never run before, execute now
-                            should_run = True
-                    
-                    if should_run:
-                        print(f"Executing scheduled retention policy at {current_time.isoformat()}")
-                        from functions_retention_policy import execute_retention_policy
-                        results = execute_retention_policy(manual_execution=False)
-                        
-                        if results.get('success'):
-                            print(f"Retention policy execution completed: "
-                                 f"{results['personal']['conversations']} personal conversations, "
-                                 f"{results['personal']['documents']} personal documents, "
-                                 f"{results['group']['conversations']} group conversations, "
-                                 f"{results['group']['documents']} group documents, "
-                                 f"{results['public']['conversations']} public conversations, "
-                                 f"{results['public']['documents']} public documents deleted.")
-                        else:
-                            print(f"Retention policy execution failed: {results.get('errors')}")
-                
-            except Exception as e:
-                print(f"Error in retention policy check: {e}")
-                log_event(f"Error in retention policy check: {e}", level=logging.ERROR)
-            
-            # Check every 5 minutes for more responsive scheduling
-            time.sleep(300)
-
-    # Start the retention policy check thread
-    retention_thread = threading.Thread(target=check_retention_policy, daemon=True)
-    retention_thread.start()
-    print("Retention policy background task started.")
-
-    # Initialize Semantic Kernel and plugins
-    enable_semantic_kernel = settings.get('enable_semantic_kernel', False)
-    per_user_semantic_kernel = settings.get('per_user_semantic_kernel', False)
-    if enable_semantic_kernel and not per_user_semantic_kernel:
-        print("Semantic Kernel is enabled. Initializing...")
-        initialize_semantic_kernel()
+def start_background_tasks():
+    """Start background loops once per process when enabled for the current runtime."""
+    global _background_tasks_started
+
+    with _background_tasks_lock:
+        if _background_tasks_started:
+            return
+
+        if not should_start_background_tasks():
+            print("Background tasks disabled for this web process.")
+            _background_tasks_started = True
+            return
+        start_background_task_threads()
+        _background_tasks_started = True
+
+
+def initialize_application(force=False):
+    """Initialize caches, clients, sessions, and optional background services once per process."""
+    global _app_initialized
+
+    with _app_init_lock:
+        if _app_initialized and not force:
+            return
+
+        print("Initializing application...")
+        settings = get_settings(use_cosmos=True)
+        redis_hostname = settings.get('redis_url', '').strip().split('.')[0]
+        app_settings_cache.configure_app_cache(
+            settings,
+            get_redis_cache_infrastructure_endpoint(redis_hostname)
+        )
+        app_settings_cache.update_settings_cache(settings)
+        sanitized_settings = sanitize_settings_for_logging(settings)
+        debug_print(f"DEBUG:Application settings: {sanitized_settings}")
+        sanitized_settings_cache = sanitize_settings_for_logging(app_settings_cache.get_settings_cache())
+        debug_print(f"DEBUG:App settings cache initialized: {'Using Redis cache:' + str(app_settings_cache.app_cache_is_using_redis)} {sanitized_settings_cache}")
+
+        initialize_clients(settings)
+        ensure_custom_logo_file_exists(app, settings)
+        print("Setting up Application Insights logging...")
+        setup_appinsights_logging(settings)
+        logging.basicConfig(level=logging.DEBUG)
+        ensure_default_global_agent_exists()
+
+        start_background_tasks()
+
+        enable_semantic_kernel = settings.get('enable_semantic_kernel', False)
+        per_user_semantic_kernel = settings.get('per_user_semantic_kernel', False)
+        if enable_semantic_kernel and not per_user_semantic_kernel:
+            print("Semantic Kernel is enabled. Initializing...")
+            initialize_semantic_kernel()
+
+        configure_sessions(settings)
+        _app_initialized = True
+        print("Application initialized.")
 
-    # Unified session setup
-    configure_sessions(settings)
+
+@app.before_request
+def ensure_application_initialized():
+    initialize_application()
 
 @app.context_processor
 def inject_settings():
@@ -487,6 +389,16 @@ def markdown_filter(text):
 # Add the filter to the Jinja environment
 app.jinja_env.filters['markdown'] = markdown_filter
 
+# Register a custom Jinja filter for nl2br (newline to <br>)
+def nl2br_filter(value):
+    """Escape HTML then convert newline characters to <br> tags."""
+    from markupsafe import escape, Markup
+    if not value:
+        return Markup('')
+    return Markup(str(escape(value)).replace('\n', '<br>\n'))
+
+app.jinja_env.filters['nl2br'] = nl2br_filter
+
 # =================== Default Routes =====================
 @app.route('/')
 @swagger_route(security=get_auth_security())
@@ -641,16 +553,27 @@ def list_semantic_kernel_plugins():
 # ------------------- API User Agreement Routes ----------
 register_route_backend_user_agreement(app)
 
+# ------------------- API Thoughts Routes ----------------
+register_route_backend_thoughts(app)
+
 # ------------------- Extenral Health Routes ----------
 register_route_external_health(app)
 
 if __name__ == '__main__':
-    settings = get_settings(use_cosmos=True)
-    app_settings_cache.configure_app_cache(settings, get_redis_cache_infrastructure_endpoint(settings.get('redis_url', '').strip().split('.')[0]))
-    app_settings_cache.update_settings_cache(settings)
-    initialize_clients(settings)
-
     debug_mode = os.environ.get("FLASK_DEBUG", "0") == "1"
+    use_gunicorn = os.environ.get("SIMPLECHAT_USE_GUNICORN", "0").strip().lower() in ('1', 'true', 'yes', 'on')
+
+    if use_gunicorn and not debug_mode:
+        gunicorn_config_path = os.path.join(os.path.dirname(os.path.abspath(__file__)), 'gunicorn.conf.py')
+        print(f"Starting Gunicorn using {gunicorn_config_path}")
+        os.execvp(sys.executable, [sys.executable, '-m', 'gunicorn', '-c', gunicorn_config_path, 'app:app'])
+
+    if use_gunicorn and debug_mode:
+        print("⚠️  WARNING: Both Gunicorn and Flask debug mode are enabled, which is not supported. Please disable one of them, app will not run until resolved.")
+        log_event("WARNING: Running with both Gunicorn and Flask debug mode is not supported. Please disable one of them, app will not run until resolved.", level=logging.WARNING)
+        exit(1)
+
+    initialize_application(force=True)
 
     if debug_mode:
         # Local development with HTTPS
diff --git a/application/single_app/app_settings_cache.py b/application/single_app/app_settings_cache.py
index cf908540..e7345efb 100644
--- a/application/single_app/app_settings_cache.py
+++ b/application/single_app/app_settings_cache.py
@@ -5,9 +5,14 @@
 This supports the dynamic selection of redis or in-memory caching of settings.
 """
 import json
+import logging
 from redis import Redis
 from azure.identity import DefaultAzureCredential
 
+# NOTE: functions_keyvault is imported locally inside configure_app_cache to avoid a circular
+# import (functions_keyvault -> app_settings_cache -> functions_keyvault).
+# functions_appinsights is also imported locally for the same reason.
+
 _settings = None
 APP_SETTINGS_CACHE = {}
 update_settings_cache = None
@@ -16,6 +21,8 @@
 
 def configure_app_cache(settings, redis_cache_endpoint=None):
     global _settings, update_settings_cache, get_settings_cache, APP_SETTINGS_CACHE, app_cache_is_using_redis
+    # Local import to avoid circular dependency: functions_keyvault imports app_settings_cache.
+    from functions_appinsights import log_event
     _settings = settings
     use_redis = _settings.get('enable_redis_cache', False)
 
@@ -24,9 +31,8 @@ def configure_app_cache(settings, redis_cache_endpoint=None):
         redis_url = settings.get('redis_url', '').strip()
         redis_auth_type = settings.get('redis_auth_type', 'key').strip().lower()
         if redis_auth_type == 'managed_identity':
-            print("[ASC] Redis enabled using Managed Identity")
+            log_event("[ASC] Redis enabled using Managed Identity", level=logging.INFO)
             credential = DefaultAzureCredential()
-            redis_hostname = redis_url.split('.')[0]
             cache_endpoint = redis_cache_endpoint
             token = credential.get_token(cache_endpoint)
             redis_client = Redis(
@@ -36,9 +42,32 @@ def configure_app_cache(settings, redis_cache_endpoint=None):
                 password=token.token,
                 ssl=True
             )
+        elif redis_auth_type == 'key_vault':
+            log_event("[ASC] Redis enabled using Key Vault Secret", level=logging.INFO)
+            # Local import to avoid circular dependency: functions_keyvault imports app_settings_cache.
+            from functions_keyvault import retrieve_secret_direct
+            redis_key_secret_name = settings.get('redis_key', '').strip()
+            try:
+                # Pass settings directly: get_settings_cache() is still None at this point
+                # because configure_app_cache has not finished initialising the cache yet.
+                redis_password = retrieve_secret_direct(redis_key_secret_name, settings=settings)
+                if redis_password:
+                    redis_password = redis_password.strip()
+                log_event("[ASC] Redis key retrieved from Key Vault successfully", level=logging.INFO)
+            except Exception as kv_err:
+                log_event(f"[ASC] ERROR: Failed to retrieve Redis key from Key Vault: {kv_err}", level=logging.ERROR, exceptionTraceback=True)
+                raise
+
+            redis_client = Redis(
+                host=redis_url,
+                port=6380,
+                db=0,
+                password=redis_password,
+                ssl=True
+            )
         else:
             redis_key = settings.get('redis_key', '').strip()
-            print("[ASC] Redis enabled using Access Key")
+            log_event("[ASC] Redis enabled using Access Key", level=logging.INFO)
             redis_client = Redis(
                 host=redis_url,
                 port=6380,
diff --git a/application/single_app/background_tasks.py b/application/single_app/background_tasks.py
new file mode 100644
index 00000000..c978bf9e
--- /dev/null
+++ b/application/single_app/background_tasks.py
@@ -0,0 +1,330 @@
+# background_tasks.py
+
+"""Shared background task runners for web-process and dedicated scheduler use."""
+
+import logging
+import os
+import socket
+import threading
+import time
+import uuid
+from datetime import datetime, timedelta, timezone
+
+from azure.core import MatchConditions
+
+from config import cosmos_settings_container, exceptions
+from functions_appinsights import log_event
+from functions_debug import debug_print
+from functions_settings import get_settings, update_settings
+
+
+def _get_lock_holder_id():
+    """Return a process-unique holder id for distributed background task locks."""
+    return f"{socket.gethostname()}:{os.getpid()}:{threading.get_ident()}"
+
+
+def _is_expired_timestamp(timestamp_value, current_time):
+    """Return True when the stored lock expiration timestamp is missing or expired."""
+    if not timestamp_value:
+        return True
+
+    try:
+        expiration_time = datetime.fromisoformat(timestamp_value)
+    except Exception:
+        return True
+
+    return expiration_time <= current_time
+
+
+def acquire_distributed_task_lock(task_name, lease_seconds):
+    """Acquire a Cosmos-backed lease for a background task across workers and instances."""
+    current_time = datetime.now(timezone.utc)
+    expires_at = current_time + timedelta(seconds=lease_seconds)
+    lock_id = f"background_task_lock_{task_name}"
+    lock_body = {
+        'id': lock_id,
+        'type': 'background_task_lock',
+        'task_name': task_name,
+        'holder_id': _get_lock_holder_id(),
+        'acquired_at': current_time.isoformat(),
+        'expires_at': expires_at.isoformat(),
+        'lease_seconds': lease_seconds,
+        'lock_token': str(uuid.uuid4())
+    }
+
+    try:
+        cosmos_settings_container.create_item(body=lock_body)
+        return lock_body
+    except Exception as exc:
+        if getattr(exc, 'status_code', None) != 409:
+            log_event(
+                'background_task_lock_create_error',
+                {'task_name': task_name, 'error': str(exc)},
+                level=logging.ERROR
+            )
+            return None
+
+    try:
+        existing_lock = cosmos_settings_container.read_item(item=lock_id, partition_key=lock_id)
+    except Exception as exc:
+        log_event(
+            'background_task_lock_read_error',
+            {'task_name': task_name, 'error': str(exc)},
+            level=logging.ERROR
+        )
+        return None
+
+    if not _is_expired_timestamp(existing_lock.get('expires_at'), current_time):
+        return None
+
+    replacement_lock = dict(existing_lock)
+    replacement_lock.update(lock_body)
+
+    try:
+        cosmos_settings_container.replace_item(
+            item=lock_id,
+            body=replacement_lock,
+            etag=existing_lock.get('_etag'),
+            match_condition=MatchConditions.IfNotModified
+        )
+        return replacement_lock
+    except Exception as exc:
+        status_code = getattr(exc, 'status_code', None)
+        if status_code not in (409, 412):
+            log_event(
+                'background_task_lock_replace_error',
+                {'task_name': task_name, 'error': str(exc), 'status_code': status_code},
+                level=logging.ERROR
+            )
+        return None
+
+
+def release_distributed_task_lock(lock_document):
+    """Release a previously acquired distributed background task lock."""
+    if not lock_document:
+        return
+
+    lock_id = lock_document.get('id')
+    holder_id = lock_document.get('holder_id')
+    if not lock_id or not holder_id:
+        return
+
+    try:
+        current_lock = cosmos_settings_container.read_item(item=lock_id, partition_key=lock_id)
+    except Exception:
+        return
+
+    if current_lock.get('holder_id') != holder_id:
+        return
+
+    try:
+        cosmos_settings_container.delete_item(
+            item=lock_id,
+            partition_key=lock_id,
+            etag=current_lock.get('_etag'),
+            match_condition=MatchConditions.IfNotModified
+        )
+    except Exception:
+        return
+
+
+def _should_run_retention_policy(settings, current_time):
+    """Return True when retention policy work should run for the current schedule state."""
+    personal_enabled = settings.get('enable_retention_policy_personal', False)
+    group_enabled = settings.get('enable_retention_policy_group', False)
+    public_enabled = settings.get('enable_retention_policy_public', False)
+
+    if not (personal_enabled or group_enabled or public_enabled):
+        return False
+
+    next_run = settings.get('retention_policy_next_run')
+    if next_run:
+        try:
+            next_run_dt = datetime.fromisoformat(next_run)
+            return current_time >= next_run_dt
+        except Exception as parse_error:
+            print(f"Error parsing next_run timestamp: {parse_error}")
+
+    last_run = settings.get('retention_policy_last_run')
+    if last_run:
+        try:
+            last_run_dt = datetime.fromisoformat(last_run)
+            return (current_time - last_run_dt).total_seconds() > (23 * 3600)
+        except Exception:
+            return True
+
+    return True
+
+
+def check_logging_timers_once():
+    """Disable temporary logging settings after their timer expires."""
+    settings = get_settings()
+    current_time = datetime.now()
+    settings_changed = False
+
+    if (
+        settings.get('enable_debug_logging', False)
+        and settings.get('debug_logging_timer_enabled', False)
+        and settings.get('debug_logging_turnoff_time')
+    ):
+        turnoff_time = settings.get('debug_logging_turnoff_time')
+        if isinstance(turnoff_time, str):
+            try:
+                turnoff_time = datetime.fromisoformat(turnoff_time)
+            except Exception:
+                turnoff_time = None
+
+        if turnoff_time and current_time >= turnoff_time:
+            debug_print(f"logging timer expired at {turnoff_time}. Disabling debug logging.")
+            settings['enable_debug_logging'] = False
+            settings['debug_logging_timer_enabled'] = False
+            settings['debug_logging_turnoff_time'] = None
+            settings_changed = True
+
+    if (
+        settings.get('enable_file_processing_logs', False)
+        and settings.get('file_processing_logs_timer_enabled', False)
+        and settings.get('file_processing_logs_turnoff_time')
+    ):
+        turnoff_time = settings.get('file_processing_logs_turnoff_time')
+        if isinstance(turnoff_time, str):
+            try:
+                turnoff_time = datetime.fromisoformat(turnoff_time)
+            except Exception:
+                turnoff_time = None
+
+        if turnoff_time and current_time >= turnoff_time:
+            print(f"File processing logs timer expired at {turnoff_time}. Disabling file processing logs.")
+            settings['enable_file_processing_logs'] = False
+            settings['file_processing_logs_timer_enabled'] = False
+            settings['file_processing_logs_turnoff_time'] = None
+            settings_changed = True
+
+    if settings_changed:
+        update_settings(settings)
+        print("Logging settings updated due to timer expiration.")
+
+
+def check_expired_approvals_once():
+    """Auto-deny expired approval requests and return the affected count."""
+    from functions_approvals import auto_deny_expired_approvals
+
+    lock_document = acquire_distributed_task_lock('approval_expiry', lease_seconds=1800)
+    if not lock_document:
+        debug_print('Skipping approval expiration check because another worker holds the lease.')
+        return None
+
+    try:
+        denied_count = auto_deny_expired_approvals()
+        if denied_count > 0:
+            print(f"Auto-denied {denied_count} expired approval request(s).")
+    finally:
+        release_distributed_task_lock(lock_document)
+
+    return denied_count
+
+
+def check_retention_policy_once():
+    """Run scheduled retention processing when the next execution window is due."""
+    settings = get_settings()
+
+    current_time = datetime.now(timezone.utc)
+
+    if not _should_run_retention_policy(settings, current_time):
+        return None
+
+    lock_document = acquire_distributed_task_lock('retention_policy', lease_seconds=3600)
+    if not lock_document:
+        debug_print('Skipping retention policy check because another worker holds the lease.')
+        return None
+
+    settings = get_settings()
+    current_time = datetime.now(timezone.utc)
+    if not _should_run_retention_policy(settings, current_time):
+        release_distributed_task_lock(lock_document)
+        return None
+
+    print(f"Executing scheduled retention policy at {current_time.isoformat()}")
+    from functions_retention_policy import execute_retention_policy
+
+    try:
+        results = execute_retention_policy(manual_execution=False)
+        if results.get('success'):
+            print(
+                "Retention policy execution completed: "
+                f"{results['personal']['conversations']} personal conversations, "
+                f"{results['personal']['documents']} personal documents, "
+                f"{results['group']['conversations']} group conversations, "
+                f"{results['group']['documents']} group documents, "
+                f"{results['public']['conversations']} public conversations, "
+                f"{results['public']['documents']} public documents deleted."
+            )
+        else:
+            print(f"Retention policy execution failed: {results.get('errors')}")
+    finally:
+        release_distributed_task_lock(lock_document)
+
+    return results
+
+
+def run_logging_timer_loop():
+    """Run the logging timer monitor forever."""
+    while True:
+        try:
+            check_logging_timers_once()
+        except Exception as exc:
+            print(f"Error in logging timer check: {exc}")
+            log_event(f"Error in logging timer check: {exc}", level=logging.ERROR)
+
+        time.sleep(60)
+
+
+def run_approval_expiration_loop():
+    """Run approval expiration checks forever."""
+    while True:
+        try:
+            check_expired_approvals_once()
+        except Exception as exc:
+            print(f"Error in approval expiration check: {exc}")
+            log_event(f"Error in approval expiration check: {exc}", level=logging.ERROR)
+
+        time.sleep(21600)
+
+
+def run_retention_policy_loop():
+    """Run retention policy scheduling checks forever."""
+    while True:
+        try:
+            check_retention_policy_once()
+        except Exception as exc:
+            print(f"Error in retention policy check: {exc}")
+            log_event(f"Error in retention policy check: {exc}", level=logging.ERROR)
+
+        time.sleep(300)
+
+
+def start_background_task_threads():
+    """Start all background task loops for the current process."""
+    task_specs = [
+        ('Logging timer background task started.', run_logging_timer_loop),
+        ('Approval expiration background task started.', run_approval_expiration_loop),
+        ('Retention policy background task started.', run_retention_policy_loop),
+    ]
+
+    started_threads = []
+    for startup_message, task_target in task_specs:
+        worker_thread = threading.Thread(target=task_target, daemon=True)
+        worker_thread.start()
+        print(startup_message)
+        started_threads.append(worker_thread)
+
+    return started_threads
+
+
+def run_scheduler_forever():
+    """Start all scheduler loops and keep the process alive."""
+    start_background_task_threads()
+    print('SimpleChat scheduler is running.')
+
+    while True:
+        time.sleep(3600)
\ No newline at end of file
diff --git a/application/single_app/config.py b/application/single_app/config.py
index 91288225..059e8706 100644
--- a/application/single_app/config.py
+++ b/application/single_app/config.py
@@ -94,7 +94,7 @@
 EXECUTOR_TYPE = 'thread'
 EXECUTOR_MAX_WORKERS = 30
 SESSION_TYPE = 'filesystem'
-VERSION = "0.239.002"
+VERSION = "0.239.150"
 
 SECRET_KEY = os.getenv('SECRET_KEY', 'dev-secret-key-change-in-production')
 
@@ -150,7 +150,6 @@ def get_allowed_extensions(enable_video=False, enable_audio=False):
     
     Args:
         enable_video: Whether video file support is enabled
-        enable_audio: Whether audio file support is enabled
         
     Returns:
         set: Allowed file extensions
@@ -176,12 +175,14 @@ def get_allowed_extensions(enable_video=False, enable_audio=False):
 
 # Add Support for Custom Azure Environments
 CUSTOM_GRAPH_URL_VALUE = os.getenv("CUSTOM_GRAPH_URL_VALUE", "")
+CUSTOM_GRAPH_AUTHORITY_URL_VALUE = os.getenv("CUSTOM_GRAPH_AUTHORITY_URL_VALUE", "")
 CUSTOM_IDENTITY_URL_VALUE = os.getenv("CUSTOM_IDENTITY_URL_VALUE", "")
 CUSTOM_RESOURCE_MANAGER_URL_VALUE = os.getenv("CUSTOM_RESOURCE_MANAGER_URL_VALUE", "")
 CUSTOM_BLOB_STORAGE_URL_VALUE = os.getenv("CUSTOM_BLOB_STORAGE_URL_VALUE", "")
 CUSTOM_COGNITIVE_SERVICES_URL_VALUE = os.getenv("CUSTOM_COGNITIVE_SERVICES_URL_VALUE", "")
 CUSTOM_SEARCH_RESOURCE_MANAGER_URL_VALUE = os.getenv("CUSTOM_SEARCH_RESOURCE_MANAGER_URL_VALUE", "")
 CUSTOM_REDIS_CACHE_INFRASTRUCTURE_URL_VALUE = os.getenv("CUSTOM_REDIS_CACHE_INFRASTRUCTURE_URL_VALUE", "")
+CUSTOM_OIDC_METADATA_URL_VALUE = os.getenv("CUSTOM_OIDC_METADATA_URL_VALUE", "")
 
 
 # Azure AD Configuration
@@ -193,41 +194,42 @@ def get_allowed_extensions(enable_video=False, enable_audio=False):
 MICROSOFT_PROVIDER_AUTHENTICATION_SECRET = os.getenv("MICROSOFT_PROVIDER_AUTHENTICATION_SECRET")
 LOGIN_REDIRECT_URL = os.getenv("LOGIN_REDIRECT_URL")
 HOME_REDIRECT_URL = os.getenv("HOME_REDIRECT_URL")  # Front Door URL for home page
-
-OIDC_METADATA_URL = f"https://login.microsoftonline.com/{TENANT_ID}/v2.0/.well-known/openid-configuration"
 AZURE_ENVIRONMENT = os.getenv("AZURE_ENVIRONMENT", "public") # public, usgovernment, custom
 
-if AZURE_ENVIRONMENT == "custom":
-    AUTHORITY = f"{CUSTOM_IDENTITY_URL_VALUE}/{TENANT_ID}"
+WORD_CHUNK_SIZE = 400
+
+if AZURE_ENVIRONMENT == "custom" or CUSTOM_IDENTITY_URL_VALUE or CUSTOM_GRAPH_AUTHORITY_URL_VALUE:
+    AUTHORITY = f"{CUSTOM_IDENTITY_URL_VALUE.rstrip('/')}/{TENANT_ID}"
+    base_authority = CUSTOM_GRAPH_AUTHORITY_URL_VALUE or CUSTOM_IDENTITY_URL_VALUE
+    if not base_authority:
+        base_authority = AUTHORITY.rstrip('/').removesuffix(f"/{TENANT_ID}")
+    authority = base_authority
 elif AZURE_ENVIRONMENT == "usgovernment":
     AUTHORITY = f"https://login.microsoftonline.us/{TENANT_ID}"
+    authority = AzureAuthorityHosts.AZURE_GOVERNMENT
 else:
     AUTHORITY = f"https://login.microsoftonline.com/{TENANT_ID}"
+    authority = AzureAuthorityHosts.AZURE_PUBLIC_CLOUD
 
-WORD_CHUNK_SIZE = 400
-
-if AZURE_ENVIRONMENT == "usgovernment":
+if AZURE_ENVIRONMENT == "custom":
+    OIDC_METADATA_URL = CUSTOM_OIDC_METADATA_URL_VALUE or f"https://login.microsoftonline.com/{TENANT_ID}/v2.0/.well-known/openid-configuration"
+    resource_manager = CUSTOM_RESOURCE_MANAGER_URL_VALUE
+    video_indexer_endpoint = os.getenv("CUSTOM_VIDEO_INDEXER_ENDPOINT", "https://api.videoindexer.ai")
+    credential_scopes=[resource_manager + "/.default"]
+    cognitive_services_scope = CUSTOM_COGNITIVE_SERVICES_URL_VALUE  
+    search_resource_manager = CUSTOM_SEARCH_RESOURCE_MANAGER_URL_VALUE
+    KEY_VAULT_DOMAIN = os.getenv("KEY_VAULT_DOMAIN", ".vault.azure.net")
+elif AZURE_ENVIRONMENT == "usgovernment":
     OIDC_METADATA_URL = f"https://login.microsoftonline.us/{TENANT_ID}/v2.0/.well-known/openid-configuration"
     resource_manager = "https://management.usgovcloudapi.net"
-    authority = AzureAuthorityHosts.AZURE_GOVERNMENT
     credential_scopes=[resource_manager + "/.default"]
     cognitive_services_scope = "https://cognitiveservices.azure.us/.default"
     video_indexer_endpoint = "https://api.videoindexer.ai.azure.us"
     search_resource_manager = "https://search.azure.us"
     KEY_VAULT_DOMAIN = ".vault.usgovcloudapi.net"
-
-elif AZURE_ENVIRONMENT == "custom":
-    resource_manager = CUSTOM_RESOURCE_MANAGER_URL_VALUE
-    authority = CUSTOM_IDENTITY_URL_VALUE
-    video_indexer_endpoint = os.getenv("CUSTOM_VIDEO_INDEXER_ENDPOINT", "https://api.videoindexer.ai")
-    credential_scopes=[resource_manager + "/.default"]
-    cognitive_services_scope = CUSTOM_COGNITIVE_SERVICES_URL_VALUE  
-    search_resource_manager = CUSTOM_SEARCH_RESOURCE_MANAGER_URL_VALUE
-    KEY_VAULT_DOMAIN = os.getenv("KEY_VAULT_DOMAIN", ".vault.azure.net")
 else:
     OIDC_METADATA_URL = f"https://login.microsoftonline.com/{TENANT_ID}/v2.0/.well-known/openid-configuration"
     resource_manager = "https://management.azure.com"
-    authority = AzureAuthorityHosts.AZURE_PUBLIC_CLOUD
     credential_scopes=[resource_manager + "/.default"]
     cognitive_services_scope = "https://cognitiveservices.azure.com/.default"
     video_indexer_endpoint = "https://api.videoindexer.ai"
@@ -257,6 +259,8 @@ def get_redis_cache_infrastructure_endpoint(redis_hostname: str) -> str:
 storage_account_user_documents_container_name = "user-documents"
 storage_account_group_documents_container_name = "group-documents"
 storage_account_public_documents_container_name = "public-documents"
+storage_account_personal_chat_container_name = "personal-chat"
+storage_account_group_chat_container_name = "group-chat"
 
 # Initialize Azure Cosmos DB client
 cosmos_endpoint = os.getenv("AZURE_COSMOS_ENDPOINT")
@@ -459,6 +463,18 @@ def get_redis_cache_infrastructure_endpoint(redis_hostname: str) -> str:
     default_ttl=-1  # TTL disabled by default, enabled per-document for auto-cleanup
 )
 
+cosmos_thoughts_container_name = "thoughts"
+cosmos_thoughts_container = cosmos_database.create_container_if_not_exists(
+    id=cosmos_thoughts_container_name,
+    partition_key=PartitionKey(path="/user_id")
+)
+
+cosmos_archived_thoughts_container_name = "archive_thoughts"
+cosmos_archived_thoughts_container = cosmos_database.create_container_if_not_exists(
+    id=cosmos_archived_thoughts_container_name,
+    partition_key=PartitionKey(path="/user_id")
+)
+
 def ensure_custom_logo_file_exists(app, settings):
     """
     If custom_logo_base64 or custom_logo_dark_base64 is present in settings, ensure the appropriate
@@ -745,9 +761,11 @@ def initialize_clients(settings):
                 # This addresses the issue where the application assumes containers exist
                 if blob_service_client:
                     for container_name in [
-                        storage_account_user_documents_container_name, 
-                        storage_account_group_documents_container_name, 
-                        storage_account_public_documents_container_name
+                        storage_account_user_documents_container_name,
+                        storage_account_group_documents_container_name,
+                        storage_account_public_documents_container_name,
+                        storage_account_personal_chat_container_name,
+                        storage_account_group_chat_container_name
                         ]:
                         try:
                             container_client = blob_service_client.get_container_client(container_name)
diff --git a/application/single_app/example.env b/application/single_app/example.env
index 38318b48..7803e0b3 100644
--- a/application/single_app/example.env
+++ b/application/single_app/example.env
@@ -15,4 +15,9 @@ TENANT_ID="<your-azure-ad-tenant-id>"
 SECRET_KEY="Generate-A-Strong-Random-Secret-Key-Here!"
 # AZURE_ENVIRONMENT: Set based on your cloud environment
 # Options: "public", "usgovernment", "custom"
-AZURE_ENVIRONMENT="public"
\ No newline at end of file
+AZURE_ENVIRONMENT="public"
+
+# Optional Graph overrides (for cross-cloud identity/Graph scenarios)
+# Example values:
+# CUSTOM_GRAPH_URL_VALUE="https://graph.microsoft.com"
+# CUSTOM_GRAPH_AUTHORITY_URL_VALUE="https://login.microsoftonline.com"
\ No newline at end of file
diff --git a/application/single_app/functions_activity_logging.py b/application/single_app/functions_activity_logging.py
index 2a653a47..efb6e780 100644
--- a/application/single_app/functions_activity_logging.py
+++ b/application/single_app/functions_activity_logging.py
@@ -1393,3 +1393,332 @@ def log_retention_policy_force_push(
             level=logging.ERROR
         )
         debug_print(f"⚠️  Warning: Failed to log retention policy force push: {str(e)}")
+
+
+# === AGENT & ACTION ACTIVITY LOGGING ===
+
+def log_agent_creation(
+    user_id: str,
+    agent_id: str,
+    agent_name: str,
+    agent_display_name: Optional[str] = None,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an agent creation activity.
+
+    Args:
+        user_id: The ID of the user who created the agent
+        agent_id: The unique ID of the new agent
+        agent_name: The name of the agent
+        agent_display_name: The display name of the agent
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'agent_creation',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'agent',
+            'operation': 'create',
+            'entity': {
+                'id': agent_id,
+                'name': agent_name,
+                'display_name': agent_display_name or agent_name
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Agent created: {agent_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Agent creation logged: {agent_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging agent creation: {str(e)}",
+            extra={'user_id': user_id, 'agent_id': agent_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log agent creation: {str(e)}")
+
+
+def log_agent_update(
+    user_id: str,
+    agent_id: str,
+    agent_name: str,
+    agent_display_name: Optional[str] = None,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an agent update activity.
+
+    Args:
+        user_id: The ID of the user who updated the agent
+        agent_id: The unique ID of the agent
+        agent_name: The name of the agent
+        agent_display_name: The display name of the agent
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'agent_update',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'agent',
+            'operation': 'update',
+            'entity': {
+                'id': agent_id,
+                'name': agent_name,
+                'display_name': agent_display_name or agent_name
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Agent updated: {agent_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Agent update logged: {agent_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging agent update: {str(e)}",
+            extra={'user_id': user_id, 'agent_id': agent_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log agent update: {str(e)}")
+
+
+def log_agent_deletion(
+    user_id: str,
+    agent_id: str,
+    agent_name: str,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an agent deletion activity.
+
+    Args:
+        user_id: The ID of the user who deleted the agent
+        agent_id: The unique ID of the agent
+        agent_name: The name of the agent
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'agent_deletion',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'agent',
+            'operation': 'delete',
+            'entity': {
+                'id': agent_id,
+                'name': agent_name
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Agent deleted: {agent_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Agent deletion logged: {agent_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging agent deletion: {str(e)}",
+            extra={'user_id': user_id, 'agent_id': agent_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log agent deletion: {str(e)}")
+
+
+def log_action_creation(
+    user_id: str,
+    action_id: str,
+    action_name: str,
+    action_type: Optional[str] = None,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an action/plugin creation activity.
+
+    Args:
+        user_id: The ID of the user who created the action
+        action_id: The unique ID of the new action
+        action_name: The name of the action
+        action_type: The type of the action (e.g., 'openapi', 'sql_query')
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'action_creation',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'action',
+            'operation': 'create',
+            'entity': {
+                'id': action_id,
+                'name': action_name,
+                'type': action_type
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Action created: {action_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Action creation logged: {action_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging action creation: {str(e)}",
+            extra={'user_id': user_id, 'action_id': action_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log action creation: {str(e)}")
+
+
+def log_action_update(
+    user_id: str,
+    action_id: str,
+    action_name: str,
+    action_type: Optional[str] = None,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an action/plugin update activity.
+
+    Args:
+        user_id: The ID of the user who updated the action
+        action_id: The unique ID of the action
+        action_name: The name of the action
+        action_type: The type of the action
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'action_update',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'action',
+            'operation': 'update',
+            'entity': {
+                'id': action_id,
+                'name': action_name,
+                'type': action_type
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Action updated: {action_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Action update logged: {action_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging action update: {str(e)}",
+            extra={'user_id': user_id, 'action_id': action_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log action update: {str(e)}")
+
+
+def log_action_deletion(
+    user_id: str,
+    action_id: str,
+    action_name: str,
+    action_type: Optional[str] = None,
+    scope: str = 'personal',
+    group_id: Optional[str] = None
+) -> None:
+    """
+    Log an action/plugin deletion activity.
+
+    Args:
+        user_id: The ID of the user who deleted the action
+        action_id: The unique ID of the action
+        action_name: The name of the action
+        action_type: The type of the action
+        scope: 'personal', 'group', or 'global'
+        group_id: The group ID (only for group scope)
+    """
+    try:
+        activity_record = {
+            'id': str(uuid.uuid4()),
+            'user_id': user_id,
+            'activity_type': 'action_deletion',
+            'timestamp': datetime.utcnow().isoformat(),
+            'created_at': datetime.utcnow().isoformat(),
+            'entity_type': 'action',
+            'operation': 'delete',
+            'entity': {
+                'id': action_id,
+                'name': action_name,
+                'type': action_type
+            },
+            'workspace_type': scope,
+            'workspace_context': {}
+        }
+        if scope == 'group' and group_id:
+            activity_record['workspace_context']['group_id'] = group_id
+
+        cosmos_activity_logs_container.create_item(body=activity_record)
+        log_event(
+            message=f"Action deleted: {action_name} ({scope}) by user {user_id}",
+            extra=activity_record,
+            level=logging.INFO
+        )
+        debug_print(f"✅ Action deletion logged: {action_name} ({scope})")
+    except Exception as e:
+        log_event(
+            message=f"Error logging action deletion: {str(e)}",
+            extra={'user_id': user_id, 'action_id': action_id, 'scope': scope, 'error': str(e)},
+            level=logging.ERROR
+        )
+        debug_print(f"⚠️  Warning: Failed to log action deletion: {str(e)}")
diff --git a/application/single_app/functions_agent_payload.py b/application/single_app/functions_agent_payload.py
index 09f1f343..2a7935a5 100644
--- a/application/single_app/functions_agent_payload.py
+++ b/application/single_app/functions_agent_payload.py
@@ -45,6 +45,21 @@
     "azure_agent_apim_gpt_deployment",
     "azure_agent_apim_gpt_api_version",
 ]
+_SERVER_MANAGED_FIELDS = [
+    "_attachments",
+    "_etag",
+    "_rid",
+    "_self",
+    "_ts",
+    "created_at",
+    "created_by",
+    "modified_at",
+    "modified_by",
+    "updated_at",
+    "last_updated",
+    "user_id",
+    "group_id",
+]
 
 _MAX_FIELD_LENGTHS = {
     "name": 100,
@@ -146,12 +161,18 @@ def _validate_foundry_field_lengths(foundry_settings: Dict[str, Any]) -> None:
         if isinstance(value, str) and len(value) > max_len:
             raise AgentPayloadError(f"azure_ai_foundry.{field} exceeds maximum length of {max_len}.")
 
+
+def _strip_server_managed_fields(payload: Dict[str, Any]) -> None:
+    for field in _SERVER_MANAGED_FIELDS:
+        payload.pop(field, None)
+
 def sanitize_agent_payload(agent: Dict[str, Any]) -> Dict[str, Any]:
     """Return a sanitized copy of the agent payload or raise AgentPayloadError."""
     if not isinstance(agent, dict):
         raise AgentPayloadError("Agent payload must be an object.")
 
     sanitized = deepcopy(agent)
+    _strip_server_managed_fields(sanitized)
     _normalize_text_fields(sanitized)
 
     for field in _STRING_DEFAULT_FIELDS:
diff --git a/application/single_app/functions_authentication.py b/application/single_app/functions_authentication.py
index e4bcf480..79487696 100644
--- a/application/single_app/functions_authentication.py
+++ b/application/single_app/functions_authentication.py
@@ -52,11 +52,12 @@ def _save_cache(cache):
             # Decide how to handle this, maybe clear cache or log extensively
             # session.pop("token_cache", None) # Option: Clear on serialization failure
 
-def _build_msal_app(cache=None):
+def _build_msal_app(cache=None, authority_override=None):
     """Builds the MSAL ConfidentialClientApplication, optionally initializing with a cache."""
+    authority = authority_override or AUTHORITY
     return ConfidentialClientApplication(
         CLIENT_ID,
-        authority=AUTHORITY,
+        authority=authority,
         client_credential=CLIENT_SECRET,
         token_cache=cache  # Pass the cache instance here
     )
@@ -88,7 +89,7 @@ def get_valid_access_token(scopes=None):
 
     required_scopes = scopes or SCOPE # Use default SCOPE if none provided
 
-    msal_app = _build_msal_app(cache=_load_cache())
+    msal_app = _build_msal_app(cache=_load_cache(), authority_override=get_graph_authority())
     user_info = session.get("user", {})
     # MSAL uses home_account_id which combines oid and tid
     # Construct it carefully based on your id_token_claims structure
@@ -160,7 +161,7 @@ def get_valid_access_token_for_plugins(scopes=None):
 
     required_scopes = scopes or SCOPE # Use default SCOPE if none provided
 
-    msal_app = _build_msal_app(cache=_load_cache())
+    msal_app = _build_msal_app(cache=_load_cache(), authority_override=get_graph_authority())
     user_info = session.get("user", {})
     # MSAL uses home_account_id which combines oid and tid
     # Construct it carefully based on your id_token_claims structure
@@ -844,6 +845,103 @@ def get_current_user_info():
         "displayName": user.get("name")
     }
 
+
+def _normalize_authority(authority_base, tenant_id):
+    """Normalize an authority URL and append tenant when appropriate."""
+    base = (authority_base or "").strip().rstrip("/")
+    tenant = (tenant_id or "").strip()
+
+    if not base or not tenant:
+        return base
+
+    lowered = base.lower()
+    tenant_lower = tenant.lower()
+
+    if lowered.endswith(f"/{tenant_lower}"):
+        return base
+
+    if lowered.endswith("/common") or lowered.endswith("/organizations") or lowered.endswith("/consumers"):
+        return base
+
+    return f"{base}/{tenant}"
+
+
+def get_graph_authority():
+    """
+    Resolve authority for Graph token acquisition, independent of general Azure environment defaults.
+
+    Precedence:
+    1. CUSTOM_GRAPH_AUTHORITY_URL_VALUE if provided
+    2. Custom cloud identity authority for AZURE_ENVIRONMENT=custom
+    3. Gov/Public cloud authority based on AZURE_ENVIRONMENT
+    """
+    custom_graph_authority = (CUSTOM_GRAPH_AUTHORITY_URL_VALUE or "").strip()
+    if custom_graph_authority:
+        return _normalize_authority(custom_graph_authority, TENANT_ID)
+
+    if AZURE_ENVIRONMENT == "custom":
+        return _normalize_authority(CUSTOM_IDENTITY_URL_VALUE, TENANT_ID)
+
+    if AZURE_ENVIRONMENT == "usgovernment":
+        return f"https://login.microsoftonline.us/{TENANT_ID}"
+
+    return f"https://login.microsoftonline.com/{TENANT_ID}"
+
+
+def get_graph_base_url():
+    """
+    Resolve the Microsoft Graph base URL for this deployment.
+
+    Precedence:
+    1. CUSTOM_GRAPH_URL_VALUE if provided (works in any AZURE_ENVIRONMENT mode)
+    2. Azure Gov Graph for usgovernment
+    3. Public Graph by default
+
+    Returns:
+        str: Normalized Graph base URL ending with /v1.0
+    """
+    custom_graph_url = (CUSTOM_GRAPH_URL_VALUE or "").strip().rstrip("/")
+    if custom_graph_url:
+        normalized = custom_graph_url
+        lowered = normalized.lower()
+
+        # Allow legacy values such as https://.../v1.0/users
+        if lowered.endswith("/users"):
+            normalized = normalized[:-6].rstrip("/")
+            lowered = normalized.lower()
+
+        if "/v1.0" not in lowered:
+            normalized = f"{normalized}/v1.0"
+
+        return normalized
+
+    if AZURE_ENVIRONMENT == "usgovernment":
+        return "https://graph.microsoft.us/v1.0"
+
+    return "https://graph.microsoft.com/v1.0"
+
+
+def get_graph_endpoint(path=""):
+    """
+    Build a full Graph endpoint from a relative path.
+
+    Args:
+        path (str): Relative Graph path (for example: "/users" or "users/{id}")
+
+    Returns:
+        str: Fully qualified Microsoft Graph URL
+    """
+    base_url = get_graph_base_url().rstrip("/")
+    path = (path or "").strip()
+
+    if not path:
+        return base_url
+
+    if not path.startswith("/"):
+        path = f"/{path}"
+
+    return f"{base_url}{path}"
+
 def get_user_profile_image():
     """
     Fetches the user's profile image from Microsoft Graph and returns it as base64.
@@ -854,13 +952,7 @@ def get_user_profile_image():
         debug_print("get_user_profile_image: Could not acquire access token")
         return None
 
-    # Determine the correct Graph endpoint based on Azure environment
-    if AZURE_ENVIRONMENT == "usgovernment":
-        profile_image_endpoint = "https://graph.microsoft.us/v1.0/me/photo/$value"
-    elif AZURE_ENVIRONMENT == "custom":
-        profile_image_endpoint = f"{CUSTOM_GRAPH_URL_VALUE}/me/photo/$value"
-    else:
-        profile_image_endpoint = "https://graph.microsoft.com/v1.0/me/photo/$value"
+    profile_image_endpoint = get_graph_endpoint("/me/photo/$value")
     
     headers = {
         "Authorization": f"Bearer {token}",
diff --git a/application/single_app/functions_content.py b/application/single_app/functions_content.py
index 376d23f4..e9636bb7 100644
--- a/application/single_app/functions_content.py
+++ b/application/single_app/functions_content.py
@@ -1,5 +1,7 @@
 # functions_content.py
 
+import email.utils
+
 from functions_debug import debug_print
 from config import *
 from functions_settings import *
@@ -306,6 +308,57 @@ def chunk_word_file_into_pages(di_pages):
     # Current logic returns empty list if no words.
     return new_pages
 
+
+def _parse_retry_after_seconds(response_headers):
+    """Return retry delay in seconds from rate-limit headers when available."""
+    if response_headers is None:
+        return None
+
+    for header_name in ('retry-after-ms', 'x-ms-retry-after-ms'):
+        try:
+            retry_ms = response_headers.get(header_name)
+            if retry_ms is None:
+                continue
+
+            retry_after = float(retry_ms) / 1000
+            if retry_after > 0:
+                return retry_after
+        except (TypeError, ValueError):
+            continue
+
+    retry_header = response_headers.get('retry-after')
+    try:
+        retry_after = float(retry_header)
+        if retry_after > 0:
+            return retry_after
+    except (TypeError, ValueError):
+        pass
+
+    if not retry_header:
+        return None
+
+    retry_date_tuple = email.utils.parsedate_tz(retry_header)
+    if retry_date_tuple is None:
+        return None
+
+    retry_after = float(email.utils.mktime_tz(retry_date_tuple) - time.time())
+    if retry_after <= 0:
+        return None
+
+    return retry_after
+
+
+def _get_rate_limit_wait_time(rate_limit_error, fallback_delay):
+    """Prefer service-provided retry timing and fall back to jittered backoff."""
+    response = getattr(rate_limit_error, 'response', None)
+    response_headers = getattr(response, 'headers', None)
+    retry_after = _parse_retry_after_seconds(response_headers)
+
+    if retry_after is not None and retry_after <= 60:
+        return retry_after
+
+    return fallback_delay * random.uniform(1.0, 1.5)
+
 def generate_embedding(
     text,
     max_retries=5,
@@ -352,7 +405,7 @@ def generate_embedding(
                 embedding_model = selected_embedding_model['deploymentName']
 
     while True:
-        random_delay = random.uniform(0.5, 2.0)
+        random_delay = random.uniform(0.05, 0.2)
         time.sleep(random_delay)
 
         try:
@@ -379,9 +432,116 @@ def generate_embedding(
             if retries > max_retries:
                 return None
 
-            wait_time = current_delay * random.uniform(1.0, 1.5)
+            wait_time = _get_rate_limit_wait_time(e, current_delay)
+            debug_print(
+                f"[EMBEDDING] Rate limited, retrying in {wait_time:.2f}s "
+                f"(attempt {retries}/{max_retries})"
+            )
             time.sleep(wait_time)
             current_delay *= delay_multiplier
 
         except Exception as e:
             raise
+
+def generate_embeddings_batch(
+    texts,
+    batch_size=16,
+    max_retries=5,
+    initial_delay=1.0,
+    delay_multiplier=2.0
+):
+    """Generate embeddings for multiple texts in batches.
+
+    Azure OpenAI embeddings API accepts a list of strings as input.
+    This reduces per-call overhead and delay significantly.
+
+    Args:
+        texts: List of text strings to embed.
+        batch_size: Number of texts per API call (default 16).
+        max_retries: Max retries on rate limit errors.
+        initial_delay: Initial retry delay in seconds.
+        delay_multiplier: Multiplier for exponential backoff.
+
+    Returns:
+        list of (embedding, token_usage) tuples, one per input text.
+    """
+    settings = get_settings()
+
+    enable_embedding_apim = settings.get('enable_embedding_apim', False)
+
+    if enable_embedding_apim:
+        embedding_model = settings.get('azure_apim_embedding_deployment')
+        embedding_client = AzureOpenAI(
+            api_version=settings.get('azure_apim_embedding_api_version'),
+            azure_endpoint=settings.get('azure_apim_embedding_endpoint'),
+            api_key=settings.get('azure_apim_embedding_subscription_key'))
+    else:
+        if (settings.get('azure_openai_embedding_authentication_type') == 'managed_identity'):
+            token_provider = get_bearer_token_provider(DefaultAzureCredential(), cognitive_services_scope)
+
+            embedding_client = AzureOpenAI(
+                api_version=settings.get('azure_openai_embedding_api_version'),
+                azure_endpoint=settings.get('azure_openai_embedding_endpoint'),
+                azure_ad_token_provider=token_provider
+            )
+
+            embedding_model_obj = settings.get('embedding_model', {})
+            if embedding_model_obj and embedding_model_obj.get('selected'):
+                selected_embedding_model = embedding_model_obj['selected'][0]
+                embedding_model = selected_embedding_model['deploymentName']
+        else:
+            embedding_client = AzureOpenAI(
+                api_version=settings.get('azure_openai_embedding_api_version'),
+                azure_endpoint=settings.get('azure_openai_embedding_endpoint'),
+                api_key=settings.get('azure_openai_embedding_key')
+            )
+
+            embedding_model_obj = settings.get('embedding_model', {})
+            if embedding_model_obj and embedding_model_obj.get('selected'):
+                selected_embedding_model = embedding_model_obj['selected'][0]
+                embedding_model = selected_embedding_model['deploymentName']
+
+    results = []
+    for i in range(0, len(texts), batch_size):
+        batch = texts[i:i + batch_size]
+        retries = 0
+        current_delay = initial_delay
+
+        while True:
+            random_delay = random.uniform(0.05, 0.2)
+            time.sleep(random_delay)
+
+            try:
+                response = embedding_client.embeddings.create(
+                    model=embedding_model,
+                    input=batch
+                )
+
+                for item in response.data:
+                    token_usage = None
+                    if hasattr(response, 'usage') and response.usage:
+                        token_usage = {
+                            'prompt_tokens': response.usage.prompt_tokens // len(batch),
+                            'total_tokens': response.usage.total_tokens // len(batch),
+                            'model_deployment_name': embedding_model
+                        }
+                    results.append((item.embedding, token_usage))
+                break
+
+            except RateLimitError as e:
+                retries += 1
+                if retries > max_retries:
+                    raise
+
+                wait_time = _get_rate_limit_wait_time(e, current_delay)
+                debug_print(
+                    f"[EMBEDDING_BATCH] Rate limited, retrying in {wait_time:.2f}s "
+                    f"(attempt {retries}/{max_retries})"
+                )
+                time.sleep(wait_time)
+                current_delay *= delay_multiplier
+
+            except Exception as e:
+                raise
+
+    return results
diff --git a/application/single_app/functions_conversation_unread.py b/application/single_app/functions_conversation_unread.py
new file mode 100644
index 00000000..583c4080
--- /dev/null
+++ b/application/single_app/functions_conversation_unread.py
@@ -0,0 +1,44 @@
+# functions_conversation_unread.py
+
+"""Helpers for conversation unread assistant-response state."""
+
+from datetime import datetime
+
+
+def normalize_conversation_unread_state(conversation_item):
+    """Ensure unread assistant-response fields always exist on a conversation."""
+    if not isinstance(conversation_item, dict):
+        return conversation_item
+
+    conversation_item['has_unread_assistant_response'] = bool(
+        conversation_item.get('has_unread_assistant_response', False)
+    )
+    conversation_item['last_unread_assistant_message_id'] = conversation_item.get(
+        'last_unread_assistant_message_id'
+    )
+    conversation_item['last_unread_assistant_at'] = conversation_item.get(
+        'last_unread_assistant_at'
+    )
+    return conversation_item
+
+
+def mark_conversation_unread(
+    conversation_item,
+    assistant_message_id,
+    unread_timestamp=None,
+):
+    """Mark a conversation as having an unread assistant response."""
+    normalized_item = normalize_conversation_unread_state(conversation_item)
+    normalized_item['has_unread_assistant_response'] = True
+    normalized_item['last_unread_assistant_message_id'] = assistant_message_id
+    normalized_item['last_unread_assistant_at'] = unread_timestamp or datetime.utcnow().isoformat()
+    return normalized_item
+
+
+def clear_conversation_unread(conversation_item):
+    """Clear unread assistant-response state from a conversation."""
+    normalized_item = normalize_conversation_unread_state(conversation_item)
+    normalized_item['has_unread_assistant_response'] = False
+    normalized_item['last_unread_assistant_message_id'] = None
+    normalized_item['last_unread_assistant_at'] = None
+    return normalized_item
diff --git a/application/single_app/functions_debug.py b/application/single_app/functions_debug.py
index 5cbf6a2e..d7615be3 100644
--- a/application/single_app/functions_debug.py
+++ b/application/single_app/functions_debug.py
@@ -3,32 +3,48 @@
 from app_settings_cache import get_settings_cache
 from functions_settings import *
 
-def debug_print(message, category="INFO", **kwargs):
+
+def _format_debug_message(message, args):
+    """Support legacy printf-style debug calls while preserving plain strings."""
+    message_text = str(message)
+    if not args:
+        return message_text
+
+    try:
+        return message_text % args
+    except Exception:
+        rendered_args = ", ".join(str(arg) for arg in args)
+        return f"{message_text} {rendered_args}"
+
+
+def _emit_debug_message(settings, message, category, flush, kwargs):
+    if settings.get('enable_debug_logging', False):
+        debug_msg = f"[DEBUG] [{category}]: {message}"
+        if kwargs:
+            kwargs_str = ", ".join(f"{k}={v}" for k, v in kwargs.items())
+            debug_msg += f" ({kwargs_str})"
+        print(debug_msg, flush=flush)
+
+
+def debug_print(message, *args, category="INFO", **kwargs):
     """
     Print debug message only if debug logging is enabled in settings.
     
     Args:
         message (str): The debug message to print
+        *args: Optional printf-style values applied to the message
         category (str): Optional category for the debug message
         **kwargs: Additional key-value pairs to include in debug output
     """
-    #print(f"DEBUG_PRINT CALLED WITH MESSAGE: {message}")
+    flush = kwargs.pop('flush', False)
+    formatted_message = _format_debug_message(message, args)
+
     try:
         cache = get_settings_cache()
-        if cache.get('enable_debug_logging', False):
-            debug_msg = f"[DEBUG] [{category}]: {message}"
-            if kwargs:
-                kwargs_str = ", ".join(f"{k}={v}" for k, v in kwargs.items())
-                debug_msg += f" ({kwargs_str})"
-            print(debug_msg)
+        _emit_debug_message(cache, formatted_message, category, flush, kwargs)
     except Exception:
         settings = get_settings()
-        if settings.get('enable_debug_logging', False):
-            debug_msg = f"[DEBUG] [{category}]: {message}"
-            if kwargs:
-                kwargs_str = ", ".join(f"{k}={v}" for k, v in kwargs.items())
-                debug_msg += f" ({kwargs_str})"
-            print(debug_msg)
+        _emit_debug_message(settings, formatted_message, category, flush, kwargs)
 
 
 def is_debug_enabled():
diff --git a/application/single_app/functions_documents.py b/application/single_app/functions_documents.py
index ce08066d..87c9ba34 100644
--- a/application/single_app/functions_documents.py
+++ b/application/single_app/functions_documents.py
@@ -1646,6 +1646,191 @@ def save_chunks(page_text_content, page_number, file_name, user_id, document_id,
     # Return token usage information for accumulation
     return token_usage
 
+def save_chunks_batch(chunks_data, user_id, document_id, group_id=None, public_workspace_id=None):
+    """
+    Save multiple chunks at once using batch embedding and batch AI Search upload.
+    Significantly faster than calling save_chunks() per chunk.
+
+    Args:
+        chunks_data: list of dicts with keys: page_text_content, page_number, file_name
+        user_id: The user ID
+        document_id: The document ID
+        group_id: Optional group ID for group documents
+        public_workspace_id: Optional public workspace ID for public documents
+
+    Returns:
+        dict with 'total_tokens', 'prompt_tokens', 'model_deployment_name'
+    """
+    from functions_content import generate_embeddings_batch
+
+    current_time = datetime.now(timezone.utc).strftime('%Y-%m-%dT%H:%M:%SZ')
+    is_group = group_id is not None
+    is_public_workspace = public_workspace_id is not None
+
+    # Retrieve metadata once for all chunks
+    try:
+        if is_public_workspace:
+            metadata = get_document_metadata(
+                document_id=document_id,
+                user_id=user_id,
+                public_workspace_id=public_workspace_id
+            )
+        elif is_group:
+            metadata = get_document_metadata(
+                document_id=document_id,
+                user_id=user_id,
+                group_id=group_id
+            )
+        else:
+            metadata = get_document_metadata(
+                document_id=document_id,
+                user_id=user_id
+            )
+
+        if not metadata:
+            raise ValueError(f"No metadata found for document {document_id}")
+
+        version = metadata.get("version") if metadata.get("version") else 1
+    except Exception as e:
+        log_event(f"[save_chunks_batch] Error retrieving metadata for document {document_id}: {repr(e)}", level=logging.ERROR)
+        raise
+
+    # Generate all embeddings in batches
+    texts = [c['page_text_content'] for c in chunks_data]
+    try:
+        embedding_results = generate_embeddings_batch(texts)
+    except Exception as e:
+        log_event(f"[save_chunks_batch] Error generating batch embeddings for document {document_id}: {e}", level=logging.ERROR)
+        raise
+
+    # Check for vision analysis once
+    vision_analysis = metadata.get('vision_analysis')
+    vision_text = ""
+    if vision_analysis:
+        vision_text_parts = []
+        vision_text_parts.append("\n\n=== AI Vision Analysis ===")
+        vision_text_parts.append(f"Model: {vision_analysis.get('model', 'unknown')}")
+        if vision_analysis.get('description'):
+            vision_text_parts.append(f"\nDescription: {vision_analysis['description']}")
+        if vision_analysis.get('objects'):
+            objects_list = vision_analysis['objects']
+            if isinstance(objects_list, list):
+                vision_text_parts.append(f"\nObjects Detected: {', '.join(objects_list)}")
+            else:
+                vision_text_parts.append(f"\nObjects Detected: {objects_list}")
+        if vision_analysis.get('text'):
+            vision_text_parts.append(f"\nVisible Text: {vision_analysis['text']}")
+        if vision_analysis.get('analysis'):
+            vision_text_parts.append(f"\nContextual Analysis: {vision_analysis['analysis']}")
+        vision_text = "\n".join(vision_text_parts)
+
+    # Build all chunk documents
+    chunk_documents = []
+    total_token_usage = {'total_tokens': 0, 'prompt_tokens': 0, 'model_deployment_name': None}
+
+    for idx, chunk_info in enumerate(chunks_data):
+        embedding, token_usage = embedding_results[idx]
+        page_number = chunk_info['page_number']
+        file_name = chunk_info['file_name']
+        page_text_content = chunk_info['page_text_content']
+
+        if token_usage:
+            total_token_usage['total_tokens'] += token_usage.get('total_tokens', 0)
+            total_token_usage['prompt_tokens'] += token_usage.get('prompt_tokens', 0)
+            if not total_token_usage['model_deployment_name']:
+                total_token_usage['model_deployment_name'] = token_usage.get('model_deployment_name')
+
+        chunk_id = f"{document_id}_{page_number}"
+        enhanced_chunk_text = page_text_content + vision_text if vision_text else page_text_content
+
+        if is_public_workspace:
+            chunk_document = {
+                "id": chunk_id,
+                "document_id": document_id,
+                "chunk_id": str(page_number),
+                "chunk_text": enhanced_chunk_text,
+                "embedding": embedding,
+                "file_name": file_name,
+                "chunk_keywords": [],
+                "chunk_summary": "",
+                "page_number": page_number,
+                "author": [],
+                "title": "",
+                "document_classification": "None",
+                "document_tags": metadata.get('tags', []),
+                "chunk_sequence": page_number,
+                "upload_date": current_time,
+                "version": version,
+                "public_workspace_id": public_workspace_id
+            }
+        elif is_group:
+            shared_group_ids = metadata.get('shared_group_ids', []) if metadata else []
+            chunk_document = {
+                "id": chunk_id,
+                "document_id": document_id,
+                "chunk_id": str(page_number),
+                "chunk_text": enhanced_chunk_text,
+                "embedding": embedding,
+                "file_name": file_name,
+                "chunk_keywords": [],
+                "chunk_summary": "",
+                "page_number": page_number,
+                "author": [],
+                "title": "",
+                "document_classification": "None",
+                "document_tags": metadata.get('tags', []),
+                "chunk_sequence": page_number,
+                "upload_date": current_time,
+                "version": version,
+                "group_id": group_id,
+                "shared_group_ids": shared_group_ids
+            }
+        else:
+            shared_user_ids = metadata.get('shared_user_ids', []) if metadata else []
+            chunk_document = {
+                "id": chunk_id,
+                "document_id": document_id,
+                "chunk_id": str(page_number),
+                "chunk_text": enhanced_chunk_text,
+                "embedding": embedding,
+                "file_name": file_name,
+                "chunk_keywords": [],
+                "chunk_summary": "",
+                "page_number": page_number,
+                "author": [],
+                "title": "",
+                "document_classification": "None",
+                "document_tags": metadata.get('tags', []),
+                "chunk_sequence": page_number,
+                "upload_date": current_time,
+                "version": version,
+                "user_id": user_id,
+                "shared_user_ids": shared_user_ids
+            }
+
+        chunk_documents.append(chunk_document)
+
+    # Batch upload to AI Search
+    try:
+        if is_public_workspace:
+            search_client = CLIENTS["search_client_public"]
+        elif is_group:
+            search_client = CLIENTS["search_client_group"]
+        else:
+            search_client = CLIENTS["search_client_user"]
+
+        # Upload in sub-batches of 32 to avoid request size limits
+        upload_batch_size = 32
+        for i in range(0, len(chunk_documents), upload_batch_size):
+            sub_batch = chunk_documents[i:i + upload_batch_size]
+            search_client.upload_documents(documents=sub_batch)
+
+    except Exception as e:
+        log_event(f"[save_chunks_batch] Error uploading batch to AI Search for document {document_id}: {e}", level=logging.ERROR)
+        raise
+
+    return total_token_usage
+
 def get_document_metadata_for_citations(document_id, user_id=None, group_id=None, public_workspace_id=None):
     """
     Retrieve keywords and abstract from a document for creating metadata citations.
@@ -4669,37 +4854,30 @@ def process_single_tabular_sheet(df, document_id, user_id, file_name, update_cal
     # Consider accumulating page count in the caller if needed.
     update_callback(number_of_pages=num_chunks_final)
 
-    # Save chunks, prepending the header to each
+    # Save chunks, prepending the header to each — use batch processing for speed
+    all_chunks = []
     for idx, chunk_rows_content in enumerate(final_chunks_content, start=1):
-        # Prepend header - header length does not count towards chunk size limit
         chunk_with_header = header_string + chunk_rows_content
-
-        update_callback(
-            current_file_chunk=idx,
-            status=f"Saving chunk {idx}/{num_chunks_final} from {file_name}..."
-        )
-
-        args = {
+        all_chunks.append({
             "page_text_content": chunk_with_header,
             "page_number": idx,
-            "file_name": file_name,
-            "user_id": user_id,
-            "document_id": document_id
-        }
+            "file_name": file_name
+        })
 
-        if is_public_workspace:
-            args["public_workspace_id"] = public_workspace_id
-        elif is_group:
-            args["group_id"] = group_id
+    if all_chunks:
+        update_callback(
+            current_file_chunk=1,
+            status=f"Batch processing {num_chunks_final} chunks from {file_name}..."
+        )
 
-        token_usage = save_chunks(**args)
-        total_chunks_saved += 1
-        
-        # Accumulate embedding tokens
-        if token_usage:
-            total_embedding_tokens += token_usage.get('total_tokens', 0)
-            if not embedding_model_name:
-                embedding_model_name = token_usage.get('model_deployment_name')
+        batch_token_usage = save_chunks_batch(
+            all_chunks, user_id, document_id,
+            group_id=group_id, public_workspace_id=public_workspace_id
+        )
+        total_chunks_saved = len(all_chunks)
+        if batch_token_usage:
+            total_embedding_tokens = batch_token_usage.get('total_tokens', 0)
+            embedding_model_name = batch_token_usage.get('model_deployment_name')
 
     return total_chunks_saved, total_embedding_tokens, embedding_model_name
 
@@ -4729,63 +4907,93 @@ def process_tabular(document_id, user_id, temp_file_path, original_filename, fil
             args["group_id"] = group_id
 
         upload_to_blob(**args)
+        update_callback(enhanced_citations=True, status=f"Enhanced citations enabled for {file_ext}")
 
-    try:
-        if file_ext == '.csv':
-            # Process CSV
-             # Read CSV, attempt to infer header, keep data as string initially
-            df = pandas.read_csv(
-                temp_file_path, 
-                keep_default_na=False, 
-                dtype=str
-            )
-            args = {
-                "df": df,
-                "document_id": document_id,
-                "user_id": user_id,
+    # When enhanced citations is on, index a single schema summary chunk
+    # instead of row-by-row chunking. The tabular processing plugin handles analysis.
+    if enable_enhanced_citations:
+        try:
+            if file_ext == '.csv':
+                df_preview = pandas.read_csv(temp_file_path, keep_default_na=False, dtype=str, nrows=5)
+                full_df = pandas.read_csv(temp_file_path, keep_default_na=False, dtype=str)
+                row_count = len(full_df)
+                columns = [str(column) for column in df_preview.columns]
+                preview_rows = df_preview.head(5).to_string(index=False)
+
+                schema_summary = (
+                    f"Tabular data file: {original_filename}\n"
+                    f"Columns ({len(columns)}): {', '.join(columns)}\n"
+                    f"Total rows: {row_count}\n"
+                    f"Preview (first 5 rows):\n{preview_rows}\n\n"
+                    f"This file is available for detailed analysis via the Tabular Processing plugin."
+                )
+            elif file_ext in ('.xlsx', '.xls', '.xlsm'):
+                engine = 'openpyxl' if file_ext in ('.xlsx', '.xlsm') else 'xlrd'
+                excel_file = pandas.ExcelFile(temp_file_path, engine=engine)
+                workbook_sections = []
+
+                for sheet_name in excel_file.sheet_names:
+                    df_preview = excel_file.parse(sheet_name, keep_default_na=False, dtype=str, nrows=3)
+                    full_df = excel_file.parse(sheet_name, keep_default_na=False, dtype=str)
+                    columns = [str(column) for column in df_preview.columns]
+                    preview_rows = df_preview.head(3).to_string(index=False)
+                    workbook_sections.append(
+                        f"Sheet: {sheet_name}\n"
+                        f"Columns ({len(columns)}): {', '.join(columns)}\n"
+                        f"Total rows: {len(full_df)}\n"
+                        f"Preview (first 3 rows):\n{preview_rows}"
+                    )
+
+                schema_summary = (
+                    f"Tabular workbook: {original_filename}\n"
+                    f"Sheets ({len(excel_file.sheet_names)}): {', '.join(excel_file.sheet_names)}\n\n"
+                    + "\n\n".join(workbook_sections)
+                    + "\n\nThis workbook is available for detailed analysis via the Tabular Processing plugin."
+                )
+            else:
+                raise ValueError(f"Unsupported tabular file type: {file_ext}")
+
+            update_callback(number_of_pages=1, status=f"Indexing schema summary for {original_filename}...")
+
+            save_args = {
+                "page_text_content": schema_summary,
+                "page_number": 1,
                 "file_name": original_filename,
-                "update_callback": update_callback
+                "user_id": user_id,
+                "document_id": document_id
             }
-
             if is_public_workspace:
-                args["public_workspace_id"] = public_workspace_id
+                save_args["public_workspace_id"] = public_workspace_id
             elif is_group:
-                args["group_id"] = group_id
+                save_args["group_id"] = group_id
 
-            result = process_single_tabular_sheet(**args)
-            if isinstance(result, tuple) and len(result) == 3:
-                chunks, tokens, model = result
-                total_chunks_saved = chunks
-                total_embedding_tokens += tokens
-                if not embedding_model_name:
-                    embedding_model_name = model
-            else:
-                total_chunks_saved = result
-
-        elif file_ext in ('.xlsx', '.xls', '.xlsm'):
-            # Process Excel (potentially multiple sheets)
-            excel_file = pandas.ExcelFile(
-                temp_file_path, 
-                engine='openpyxl' if file_ext in ('.xlsx', '.xlsm') else 'xlrd'
-            )
-            sheet_names = excel_file.sheet_names
-            base_name, ext = os.path.splitext(original_filename)
-
-            accumulated_total_chunks = 0
-            for sheet_name in sheet_names:
-                update_callback(status=f"Processing sheet '{sheet_name}'...")
-                # Read specific sheet, get values (not formulas), keep data as string
-                # Note: pandas typically reads values, not formulas by default.
-                df = excel_file.parse(sheet_name, keep_default_na=False, dtype=str)
+            token_usage = save_chunks(**save_args)
+            total_chunks_saved = 1
+            if token_usage:
+                total_embedding_tokens = token_usage.get('total_tokens', 0)
+                embedding_model_name = token_usage.get('model_deployment_name')
 
-                # Create effective filename for this sheet
-                effective_filename = f"{base_name}-{sheet_name}{ext}" if len(sheet_names) > 1 else original_filename
+            # Don't return here — fall through to metadata extraction below
+        except Exception as e:
+            log_event(f"[process_tabular] Error creating schema summary, falling back to row-by-row: {e}", level=logging.WARNING)
+            # Fall through to existing row-by-row processing
 
+    # Only do row-by-row chunking if schema-only didn't produce chunks
+    if total_chunks_saved == 0:
+        try:
+            if file_ext == '.csv':
+                # Process CSV
+                # Read CSV, attempt to infer header, keep data as string initially
+                df = pandas.read_csv(
+                    temp_file_path,
+                    keep_default_na=False,
+                    dtype=str
+                )
                 args = {
                     "df": df,
                     "document_id": document_id,
                     "user_id": user_id,
-                    "file_name": effective_filename,
+                    "file_name": original_filename,
                     "update_callback": update_callback
                 }
 
@@ -4797,21 +5005,62 @@ def process_tabular(document_id, user_id, temp_file_path, original_filename, fil
                 result = process_single_tabular_sheet(**args)
                 if isinstance(result, tuple) and len(result) == 3:
                     chunks, tokens, model = result
-                    accumulated_total_chunks += chunks
+                    total_chunks_saved = chunks
                     total_embedding_tokens += tokens
                     if not embedding_model_name:
                         embedding_model_name = model
                 else:
-                    accumulated_total_chunks += result
+                    total_chunks_saved = result
 
-            total_chunks_saved = accumulated_total_chunks # Total across all sheets
+            elif file_ext in ('.xlsx', '.xls', '.xlsm'):
+                # Process Excel (potentially multiple sheets)
+                excel_file = pandas.ExcelFile(
+                    temp_file_path,
+                    engine='openpyxl' if file_ext in ('.xlsx', '.xlsm') else 'xlrd'
+                )
+                sheet_names = excel_file.sheet_names
+                base_name, ext = os.path.splitext(original_filename)
 
+                accumulated_total_chunks = 0
+                for sheet_name in sheet_names:
+                    update_callback(status=f"Processing sheet '{sheet_name}'...")
+                    # Read specific sheet, get values (not formulas), keep data as string
+                    # Note: pandas typically reads values, not formulas by default.
+                    df = excel_file.parse(sheet_name, keep_default_na=False, dtype=str)
 
-    except pandas.errors.EmptyDataError:
-        print(f"Warning: Tabular file or sheet is empty: {original_filename}")
-        update_callback(status=f"Warning: File/sheet is empty - {original_filename}", number_of_pages=0)
-    except Exception as e:
-        raise Exception(f"Failed processing Tabular file {original_filename}: {e}")
+                    # Create effective filename for this sheet
+                    effective_filename = f"{base_name}-{sheet_name}{ext}" if len(sheet_names) > 1 else original_filename
+
+                    args = {
+                        "df": df,
+                        "document_id": document_id,
+                        "user_id": user_id,
+                        "file_name": effective_filename,
+                        "update_callback": update_callback
+                    }
+
+                    if is_public_workspace:
+                        args["public_workspace_id"] = public_workspace_id
+                    elif is_group:
+                        args["group_id"] = group_id
+
+                    result = process_single_tabular_sheet(**args)
+                    if isinstance(result, tuple) and len(result) == 3:
+                        chunks, tokens, model = result
+                        accumulated_total_chunks += chunks
+                        total_embedding_tokens += tokens
+                        if not embedding_model_name:
+                            embedding_model_name = model
+                    else:
+                        accumulated_total_chunks += result
+
+                total_chunks_saved = accumulated_total_chunks # Total across all sheets
+
+        except pandas.errors.EmptyDataError:
+            log_event(f"[process_tabular] Warning: Tabular file or sheet is empty: {original_filename}", level=logging.WARNING)
+            update_callback(status=f"Warning: File/sheet is empty - {original_filename}", number_of_pages=0)
+        except Exception as e:
+            raise Exception(f"Failed processing Tabular file {original_filename}: {e}")
 
     # Extract metadata if enabled and chunks were processed
     settings = get_settings()
diff --git a/application/single_app/functions_global_actions.py b/application/single_app/functions_global_actions.py
index 91f0d9f9..122ea9e8 100644
--- a/application/single_app/functions_global_actions.py
+++ b/application/single_app/functions_global_actions.py
@@ -11,6 +11,7 @@
 import traceback
 from datetime import datetime
 from config import cosmos_global_actions_container
+from functions_authentication import get_current_user_id
 from functions_keyvault import keyvault_plugin_save_helper, keyvault_plugin_get_helper, keyvault_plugin_delete_helper, SecretReturnType
 
 def get_global_actions(return_type=SecretReturnType.TRIGGER):
@@ -60,27 +61,57 @@ def get_global_action(action_id, return_type=SecretReturnType.TRIGGER):
         return None
 
 
-def save_global_action(action_data):
+def save_global_action(action_data, user_id=None):
     """
     Save or update a global action.
     
     Args:
         action_data (dict): Action data to save
+        user_id (str, optional): The user ID of the person performing the action
         
     Returns:
         dict: Saved action data or None if failed
     """
     try:
+        if user_id is None:
+            user_id = get_current_user_id()
+        if not user_id:
+            user_id = "system"
+
         # Ensure required fields
         if 'id' not in action_data:
             action_data['id'] = str(uuid.uuid4())
         # Add metadata
         action_data['is_global'] = True
-        action_data['created_at'] = datetime.utcnow().isoformat()
-        action_data['updated_at'] = datetime.utcnow().isoformat()
+        now = datetime.utcnow().isoformat()
+
+        # Check if this is a new action or an update to preserve created_by/created_at
+        existing_action = None
+        try:
+            existing_action = cosmos_global_actions_container.read_item(
+                item=action_data['id'],
+                partition_key=action_data['id']
+            )
+        except Exception:
+            pass
+
+        if existing_action:
+            action_data['created_by'] = existing_action.get('created_by') or user_id
+            action_data['created_at'] = existing_action.get('created_at') or now
+        else:
+            action_data['created_by'] = user_id
+            action_data['created_at'] = now
+        action_data['modified_by'] = user_id
+        action_data['modified_at'] = now
+        action_data['updated_at'] = now
         print(f"💾 Saving global action: {action_data.get('name', 'Unknown')}")
         # Store secrets in Key Vault before upsert
-        action_data = keyvault_plugin_save_helper(action_data, scope_value=action_data.get('id'), scope="global")
+        action_data = keyvault_plugin_save_helper(
+            action_data,
+            scope_value=action_data.get('id'),
+            scope="global",
+            existing_plugin=existing_action,
+        )
         result = cosmos_global_actions_container.upsert_item(body=action_data)
         print(f"✅ Global action saved successfully: {result['id']}")
         return result
@@ -104,7 +135,7 @@ def delete_global_action(action_id):
     try:
         print(f"🗑️ Deleting global action: {action_id}")
         # Delete secrets from Key Vault before deleting the action
-        action = get_global_action(action_id)
+        action = get_global_action(action_id, return_type=SecretReturnType.NAME)
         if action:
             keyvault_plugin_delete_helper(action, scope_value=action_id, scope="global")
         cosmos_global_actions_container.delete_item(
diff --git a/application/single_app/functions_global_agents.py b/application/single_app/functions_global_agents.py
index 5cf6a3d4..87976510 100644
--- a/application/single_app/functions_global_agents.py
+++ b/application/single_app/functions_global_agents.py
@@ -163,25 +163,46 @@ def get_global_agent(agent_id):
         return None
 
 
-def save_global_agent(agent_data):
+def save_global_agent(agent_data, user_id=None):
     """
     Save or update a global agent.
     
     Args:
         agent_data (dict): Agent data to save
+        user_id (str, optional): The user ID of the person performing the action
         
     Returns:
         dict: Saved agent data or None if failed
     """
     try:
-        user_id = get_current_user_id()
+        if user_id is None:
+            user_id = get_current_user_id()
         cleaned_agent = sanitize_agent_payload(agent_data)
         if 'id' not in cleaned_agent:
             cleaned_agent['id'] = str(uuid.uuid4())
         cleaned_agent['is_global'] = True
         cleaned_agent['is_group'] = False
-        cleaned_agent['created_at'] = datetime.utcnow().isoformat()
-        cleaned_agent['updated_at'] = datetime.utcnow().isoformat()
+        now = datetime.utcnow().isoformat()
+
+        # Check if this is a new agent or an update to preserve created_by/created_at
+        existing_agent = None
+        try:
+            existing_agent = cosmos_global_agents_container.read_item(
+                item=cleaned_agent['id'],
+                partition_key=cleaned_agent['id']
+            )
+        except Exception:
+            pass
+
+        if existing_agent:
+            cleaned_agent['created_by'] = existing_agent.get('created_by', user_id)
+            cleaned_agent['created_at'] = existing_agent.get('created_at', now)
+        else:
+            cleaned_agent['created_by'] = user_id
+            cleaned_agent['created_at'] = now
+        cleaned_agent['modified_by'] = user_id
+        cleaned_agent['modified_at'] = now
+        cleaned_agent['updated_at'] = now
         log_event(
             "Saving global agent.",
             extra={"agent_name": cleaned_agent.get('name', 'Unknown')},
diff --git a/application/single_app/functions_group_actions.py b/application/single_app/functions_group_actions.py
index bc6aa4ea..450d34e5 100644
--- a/application/single_app/functions_group_actions.py
+++ b/application/single_app/functions_group_actions.py
@@ -82,14 +82,36 @@ def get_group_action(
     return _clean_action(action, group_id, return_type)
 
 
-def save_group_action(group_id: str, action_data: Dict[str, Any]) -> Dict[str, Any]:
+def save_group_action(group_id: str, action_data: Dict[str, Any], user_id: Optional[str] = None) -> Dict[str, Any]:
     """Create or update a group action entry."""
     payload = dict(action_data)
     action_id = payload.get("id") or str(uuid.uuid4())
 
     payload["id"] = action_id
     payload["group_id"] = group_id
-    payload["last_updated"] = datetime.utcnow().isoformat()
+    now = datetime.utcnow().isoformat()
+    payload["last_updated"] = now
+
+    # Track who created/modified this action
+    existing_action = None
+    try:
+        existing_action = cosmos_group_actions_container.read_item(
+            item=action_id,
+            partition_key=group_id,
+        )
+    except exceptions.CosmosResourceNotFoundError:
+        pass
+    except Exception:
+        pass
+
+    if existing_action:
+        payload["created_by"] = existing_action.get("created_by", user_id)
+        payload["created_at"] = existing_action.get("created_at", now)
+    else:
+        payload["created_by"] = user_id
+        payload["created_at"] = now
+    payload["modified_by"] = user_id
+    payload["modified_at"] = now
 
     payload.setdefault("name", "")
     payload.setdefault("displayName", payload.get("name", ""))
@@ -107,7 +129,12 @@ def save_group_action(group_id: str, action_data: Dict[str, Any]) -> Dict[str, A
 
     payload.pop("user_id", None)
 
-    payload = keyvault_plugin_save_helper(payload, scope_value=group_id, scope="group")
+    payload = keyvault_plugin_save_helper(
+        payload,
+        scope_value=group_id,
+        scope="group",
+        existing_plugin=existing_action,
+    )
 
     try:
         stored = cosmos_group_actions_container.upsert_item(body=payload)
diff --git a/application/single_app/functions_group_agents.py b/application/single_app/functions_group_agents.py
index 8bf6f87c..7cbb8324 100644
--- a/application/single_app/functions_group_agents.py
+++ b/application/single_app/functions_group_agents.py
@@ -63,16 +63,38 @@ def get_group_agent(group_id: str, agent_id: str) -> Optional[Dict[str, Any]]:
         return None
 
 
-def save_group_agent(group_id: str, agent_data: Dict[str, Any]) -> Dict[str, Any]:
+def save_group_agent(group_id: str, agent_data: Dict[str, Any], user_id: Optional[str] = None) -> Dict[str, Any]:
     """Create or update a group agent entry."""
     payload = sanitize_agent_payload(agent_data)
     agent_id = payload.get("id") or str(uuid.uuid4())
     payload["id"] = agent_id
     payload["group_id"] = group_id
-    payload["last_updated"] = datetime.utcnow().isoformat()
+    now = datetime.utcnow().isoformat()
+    payload["last_updated"] = now
     payload["is_global"] = False
     payload["is_group"] = True
 
+    # Track who created/modified this agent
+    existing_agent = None
+    try:
+        existing_agent = cosmos_group_agents_container.read_item(
+            item=agent_id,
+            partition_key=group_id,
+        )
+    except exceptions.CosmosResourceNotFoundError:
+        pass
+    except Exception:
+        pass
+
+    if existing_agent:
+        payload["created_by"] = existing_agent.get("created_by", user_id)
+        payload["created_at"] = existing_agent.get("created_at", now)
+    else:
+        payload["created_by"] = user_id
+        payload["created_at"] = now
+    payload["modified_by"] = user_id
+    payload["modified_at"] = now
+
     # Required/defaulted fields
     payload.setdefault("name", "")
     payload.setdefault("display_name", payload.get("name", ""))
diff --git a/application/single_app/functions_keyvault.py b/application/single_app/functions_keyvault.py
index 2094814f..fbf693c2 100644
--- a/application/single_app/functions_keyvault.py
+++ b/application/single_app/functions_keyvault.py
@@ -44,12 +44,126 @@
 ]
 
 ui_trigger_word = "Stored_In_KeyVault"
+SQL_PLUGIN_TYPES = {"sql_query", "sql_schema"}
+SQL_PLUGIN_SENSITIVE_ADDITIONAL_FIELDS = {"connection_string", "password"}
+SQL_PLUGIN_SENSITIVE_AUTH_FIELDS = {"client_secret"}
+REDACTED_SECRET_VALUE = "***REDACTED***"
 
 class SecretReturnType(Enum):
     VALUE = "value"
     TRIGGER = "trigger"
     NAME = "name"
 
+
+def _get_nested_dict_value(data, path):
+    """Return a nested dictionary value, or None when the path is missing."""
+    current = data
+    for key in path:
+        if not isinstance(current, dict) or key not in current:
+            return None
+        current = current.get(key)
+    return current
+
+
+def _set_nested_dict_value(data, path, value):
+    """Set a nested dictionary value while preserving dictionary copies."""
+    current = data
+    for key in path[:-1]:
+        nested = current.get(key)
+        if not isinstance(nested, dict):
+            nested = {}
+        else:
+            nested = dict(nested)
+        current[key] = nested
+        current = nested
+    current[path[-1]] = value
+
+
+def _get_existing_secret_reference(existing_plugin, path):
+    """Return an existing Key Vault reference for the provided path, when present."""
+    existing_value = _get_nested_dict_value(existing_plugin or {}, path)
+    if isinstance(existing_value, str) and validate_secret_name_dynamic(existing_value):
+        return existing_value
+    return None
+
+
+def _build_plugin_additional_field_secret_name(plugin_name, field_name):
+    """Build a stable Key Vault secret base name for plugin additional fields."""
+    return f"{plugin_name}-{field_name}".replace("__", "-")
+
+
+def _is_sql_plugin(plugin_dict):
+    """Return True when the plugin manifest is a SQL action."""
+    plugin_type = (plugin_dict or {}).get("type", "")
+    return isinstance(plugin_type, str) and plugin_type.lower() in SQL_PLUGIN_TYPES
+
+
+def _is_sql_sensitive_additional_field(plugin_dict, field_name):
+    """Return True when the additional field should be treated as a SQL secret."""
+    return _is_sql_plugin(plugin_dict) and field_name in SQL_PLUGIN_SENSITIVE_ADDITIONAL_FIELDS
+
+
+def _store_plugin_secret_reference(updated_plugin, existing_plugin, path, secret_name, scope_value, source, scope):
+    """Store or preserve a plugin secret reference for the provided nested path."""
+    value = _get_nested_dict_value(updated_plugin, path)
+    if not value:
+        return
+
+    existing_reference = _get_existing_secret_reference(existing_plugin, path)
+
+    if value == ui_trigger_word:
+        if existing_reference:
+            _set_nested_dict_value(updated_plugin, path, existing_reference)
+            return
+        _set_nested_dict_value(
+            updated_plugin,
+            path,
+            build_full_secret_name(secret_name, scope_value, source, scope),
+        )
+        return
+
+    if validate_secret_name_dynamic(value):
+        _set_nested_dict_value(updated_plugin, path, value)
+        return
+
+    full_secret_name = store_secret_in_key_vault(
+        secret_name,
+        value,
+        scope_value,
+        source=source,
+        scope=scope,
+    )
+    _set_nested_dict_value(updated_plugin, path, full_secret_name)
+
+
+def redact_plugin_secret_values(plugin_dict, redaction_value=REDACTED_SECRET_VALUE):
+    """Return a copy of the plugin manifest with secret-bearing values redacted."""
+    if not isinstance(plugin_dict, dict):
+        return plugin_dict
+
+    redacted = dict(plugin_dict)
+    auth = redacted.get("auth", {})
+    if isinstance(auth, dict):
+        new_auth = dict(auth)
+        if new_auth.get("key"):
+            new_auth["key"] = redaction_value
+        for auth_field in SQL_PLUGIN_SENSITIVE_AUTH_FIELDS:
+            if new_auth.get(auth_field):
+                new_auth[auth_field] = redaction_value
+        redacted["auth"] = new_auth
+
+    additional_fields = redacted.get("additionalFields", {})
+    if isinstance(additional_fields, dict):
+        new_additional_fields = dict(additional_fields)
+        for key, value in additional_fields.items():
+            if not value:
+                continue
+            if key.endswith("__Secret") or _is_sql_sensitive_additional_field(redacted, key):
+                new_additional_fields[key] = redaction_value
+        redacted["additionalFields"] = new_additional_fields
+
+    return redacted
+
 def retrieve_secret_from_key_vault(secret_name, scope_value, scope="global", source="global"):
     """
     Retrieve a secret from Key Vault using a dynamic name based on source, scope, and scope_value.
@@ -66,10 +180,10 @@ def retrieve_secret_from_key_vault(secret_name, scope_value, scope="global", sou
         Exception: If retrieval fails or configuration is invalid.
     """
     if source not in supported_sources:
-        logging.error(f"Source '{source}' is not supported. Supported sources: {supported_sources}")
+        log_event(f"Source '{source}' is not supported. Supported sources: {supported_sources}", level=logging.ERROR)
         raise ValueError(f"Source '{source}' is not supported. Supported sources: {supported_sources}")
     if scope not in supported_scopes:
-        logging.error(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
+        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level=logging.ERROR)
         raise ValueError(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
 
     full_secret_name = build_full_secret_name(secret_name, scope_value, source, scope)
@@ -104,12 +218,59 @@ def retrieve_secret_from_key_vault_by_full_name(full_secret_name):
         secret_client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential())
 
         retrieved_secret = secret_client.get_secret(full_secret_name)
-        print(f"Secret '{full_secret_name}' retrieved successfully from Key Vault.")
+        log_event(f"Secret '{full_secret_name}' retrieved successfully from Key Vault.", level=logging.INFO)
         return retrieved_secret.value
     except Exception as e:
-        logging.error(f"Failed to retrieve secret '{full_secret_name}' from Key Vault: {str(e)}")
+        log_event(f"Failed to retrieve secret '{full_secret_name}' from Key Vault: {str(e)}", level=logging.ERROR, exceptionTraceback=True)
         return full_secret_name
         
+def retrieve_secret_direct(secret_name, settings=None):
+    """
+    Retrieve a secret directly from Key Vault by its exact name, bypassing source/scope name
+    validation and the enable_key_vault_secret_storage guard. Use this for infrastructure
+    secrets (e.g. Redis key) where the secret name is arbitrary and not controlled by the
+    scope_value--source--scope--secret_name convention.
+
+    Args:
+        secret_name (str): The exact Key Vault secret name.
+        settings (dict, optional): Settings dict to use directly. If None, falls back to
+            app_settings_cache.get_settings_cache(). Pass settings explicitly when calling
+            before the cache is initialised (e.g. during configure_app_cache bootstrap).
+
+    Returns:
+        str: The secret value.
+
+    Raises:
+        ValueError: If Key Vault is not configured in settings.
+        Exception: If the secret cannot be retrieved.
+    """
+    # Use provided settings directly when supplied (e.g. during bootstrap before the
+    # settings cache is initialised), otherwise fall back to the cache.
+    if settings is None:
+        settings = app_settings_cache.get_settings_cache()
+
+    
+    enable_key_vault_secret_storage = settings.get("enable_key_vault_secret_storage", False)
+
+    if not enable_key_vault_secret_storage:
+        raise ValueError("Key Vault secret storage is not enabled in settings.")
+
+    key_vault_name = settings.get("key_vault_name", "").strip()
+    if not key_vault_name:
+        raise ValueError("Key Vault name is not configured in settings (key_vault_name).")
+    if not secret_name:
+        raise ValueError("secret_name must not be empty.")
+
+    try:
+        key_vault_url = f"https://{key_vault_name}{KEY_VAULT_DOMAIN}"
+        # Pass settings through so get_keyvault_credential doesn't call the uninitialised cache.
+        secret_client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential(settings=settings))
+        retrieved = secret_client.get_secret(secret_name)
+        log_event(f"Secret '{secret_name}' retrieved successfully from Key Vault.", level=logging.INFO)
+        return retrieved.value
+    except Exception as e:
+        log_event(f"Failed to retrieve secret '{secret_name}' from Key Vault: {str(e)}", level=logging.ERROR, exceptionTraceback=True)
+        raise
 
 def store_secret_in_key_vault(secret_name, secret_value, scope_value, source="global", scope="global"):
     """
@@ -130,32 +291,31 @@ def store_secret_in_key_vault(secret_name, secret_value, scope_value, source="gl
     settings = app_settings_cache.get_settings_cache()
     enable_key_vault_secret_storage = settings.get("enable_key_vault_secret_storage", False)
     if not enable_key_vault_secret_storage:
-        logging.warn(f"Key Vault secret storage is not enabled.")
+        log_event("Key Vault secret storage is not enabled.", level=logging.WARNING)
         return secret_value
 
     key_vault_name = settings.get("key_vault_name", None)
     if not key_vault_name:
-        logging.warn(f"Key Vault name is not configured.")
+        log_event("Key Vault name is not configured.", level=logging.WARNING)
         return secret_value
 
     if source not in supported_sources:
-        logging.error(f"Source '{source}' is not supported. Supported sources: {supported_sources}")
+        log_event(f"Source '{source}' is not supported. Supported sources: {supported_sources}", level=logging.ERROR)
         raise ValueError(f"Source '{source}' is not supported. Supported sources: {supported_sources}")
     if scope not in supported_scopes:
-        logging.error(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
+        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level=logging.ERROR)
         raise ValueError(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
 
-
     full_secret_name = build_full_secret_name(secret_name, scope_value, source, scope)
 
     try:
         key_vault_url = f"https://{key_vault_name}{KEY_VAULT_DOMAIN}"
         secret_client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential())
         secret_client.set_secret(full_secret_name, secret_value)
-        print(f"Secret '{full_secret_name}' stored successfully in Key Vault.")
+        log_event(f"Secret '{full_secret_name}' stored successfully in Key Vault.", level=logging.INFO)
         return full_secret_name
     except Exception as e:
-        logging.error(f"Failed to store secret '{full_secret_name}' in Key Vault: {str(e)}")
+        log_event(f"Failed to store secret '{full_secret_name}' in Key Vault: {str(e)}", level=logging.ERROR, exceptionTraceback=True)
         return secret_value
 
 def build_full_secret_name(secret_name, scope_value, source, scope):
@@ -175,7 +335,7 @@ def build_full_secret_name(secret_name, scope_value, source, scope):
     """
     full_secret_name = f"{clean_name_for_keyvault(scope_value)}--{source}--{scope}--{clean_name_for_keyvault(secret_name)}"
     if not validate_secret_name_dynamic(full_secret_name):
-        logging.error(f"The full secret name '{full_secret_name}' is invalid.")
+        log_event(f"The full secret name '{full_secret_name}' is invalid.", level=logging.ERROR)
         raise ValueError(f"The full secret name '{full_secret_name}' is invalid.")
     return full_secret_name
 
@@ -240,10 +400,10 @@ def keyvault_agent_save_helper(agent_dict, scope_value, scope="global"):
                 full_secret_name = store_secret_in_key_vault(secret_name, value, scope_value, source=source, scope=scope)
                 updated[key] = full_secret_name
             except Exception as e:
-                logging.error(f"Failed to store agent key '{key}' in Key Vault: {e}")
+                log_event(f"Failed to store agent key '{key}' in Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
                 raise Exception(f"Failed to store agent key '{key}' in Key Vault: {e}")
     else:
-        log_event(f"Agent key '{key}' not found while APIM is '{use_apim}' or empty in agent '{agent_name}'. No action taken.", level="INFO")
+        log_event(f"Agent key '{key}' not found while APIM is '{use_apim}' or empty in agent '{agent_name}'. No action taken.", level=logging.INFO)
     return updated
 
 def keyvault_agent_get_helper(agent_dict, scope_value, scope="global", return_type=SecretReturnType.TRIGGER):
@@ -283,19 +443,21 @@ def keyvault_agent_get_helper(agent_dict, scope_value, scope="global", return_ty
                 else:
                     updated[key] = ui_trigger_word
             except Exception as e:
-                logging.error(f"Failed to retrieve agent key '{key}' for agent '{agent_name}' from Key Vault: {e}")
+                log_event(f"Failed to retrieve agent key '{key}' for agent '{agent_name}' from Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
                 return updated
     return updated
 
-def keyvault_plugin_save_helper(plugin_dict, scope_value, scope="global"):
+def keyvault_plugin_save_helper(plugin_dict, scope_value, scope="global", existing_plugin=None):
     """
     For plugin dicts, store the auth.key in Key Vault if auth.type is 'key', 'servicePrincipal', 'basic', or 'connection_string',
-    and replace its value with the Key Vault secret name. Also supports dynamic secret storage for any additionalFields key ending with '__Secret'.
+    and replace its value with the Key Vault secret name. Also supports dynamic secret storage for any additionalFields key ending with '__Secret',
+    along with SQL plugin secret-bearing additional fields such as connection strings and passwords.
 
     Args:
         plugin_dict (dict): The plugin dictionary to process.
         scope_value (str): The value for the scope (e.g., plugin id).
         scope (str): The scope (e.g., 'user', 'global').
+        existing_plugin (dict, optional): Existing stored plugin manifest used to preserve Key Vault references during edit flows.
 
     Returns:
         dict: A new plugin dict with sensitive values replaced by Key Vault references.
@@ -307,58 +469,98 @@ def keyvault_plugin_save_helper(plugin_dict, scope_value, scope="global"):
         This allows plugin writers to dynamically store secrets without custom code.
     """
     if scope not in supported_scopes:
-        logging.error(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
+        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level=logging.ERROR)
         raise ValueError(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
     source = "action"  # Use 'action' for plugins per app convention
     updated = dict(plugin_dict)
     plugin_name = updated.get('name', 'plugin')
     auth = updated.get('auth', {})
     if isinstance(auth, dict):
+        auth = dict(auth)
+        updated['auth'] = auth
         auth_type = auth.get('type', None)
         if auth_type in supported_action_auth_types and 'key' in auth and auth['key']:
-            value = auth['key']
-            if value == ui_trigger_word:
-                auth['key'] = build_full_secret_name(plugin_name, scope_value, source, scope)
-                updated['auth'] = auth
-            elif validate_secret_name_dynamic(value):
-                auth['key'] = build_full_secret_name(plugin_name, scope_value, source, scope)
-                updated['auth'] = auth
-            else:
+            try:
+                _store_plugin_secret_reference(
+                    updated,
+                    existing_plugin,
+                    ('auth', 'key'),
+                    plugin_name,
+                    scope_value,
+                    source,
+                    scope,
+                )
+            except Exception as e:
+                log_event(f"Failed to store plugin key in Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
+                raise Exception(f"Failed to store plugin key in Key Vault: {e}")
+        else:
+            log_event(f"Auth type '{auth_type}' does not require Key Vault storage for plugin '{plugin_name}'.", level=logging.INFO)
+
+        for auth_field in SQL_PLUGIN_SENSITIVE_AUTH_FIELDS:
+            if auth.get(auth_field):
                 try:
-                    full_secret_name = store_secret_in_key_vault(plugin_name, value, scope_value, source=source, scope=scope)
-                    new_auth = dict(auth)
-                    new_auth['key'] = full_secret_name
-                    updated['auth'] = new_auth
+                    _store_plugin_secret_reference(
+                        updated,
+                        existing_plugin,
+                        ('auth', auth_field),
+                        f"{plugin_name}-{auth_field}",
+                        scope_value,
+                        source,
+                        scope,
+                    )
                 except Exception as e:
-                    logging.error(f"Failed to store plugin key in Key Vault: {e}")
-                    raise Exception(f"Failed to store plugin key in Key Vault: {e}")
-        else:
-            print(f"Auth type '{auth_type}' does not require Key Vault storage. Does not match ")
+                    log_event(
+                        f"Failed to store plugin auth secret '{auth_field}' in Key Vault: {e}",
+                        level=logging.ERROR,
+                        exceptionTraceback=True,
+                    )
+                    raise Exception(f"Failed to store plugin auth secret '{auth_field}' in Key Vault: {e}")
 
     # Handle additionalFields dynamic secrets
     additional_fields = updated.get('additionalFields', {})
     if isinstance(additional_fields, dict):
         new_additional_fields = dict(additional_fields)
+        updated['additionalFields'] = new_additional_fields
         for k, v in additional_fields.items():
-            if k.endswith('__Secret') and v:
+            if not v:
+                continue
+            if k.endswith('__Secret'):
                 addset_source = 'action-addset'
                 base_field = k[:-8]  # Remove '__Secret'
-                akv_key = f"{plugin_name}-{base_field}".replace('__', '-')
-                full_secret_name = build_full_secret_name(akv_key, scope_value, addset_source, scope)
-                if v == ui_trigger_word:
-                    new_additional_fields[k] = full_secret_name
-                    continue
-                elif validate_secret_name_dynamic(v):
-                    new_additional_fields[k] = full_secret_name
-                    continue
-                else:
-                    try:
-                        full_secret_name = store_secret_in_key_vault(akv_key, v, scope_value, source=addset_source, scope=scope)
-                        new_additional_fields[k] = full_secret_name
-                    except Exception as e:
-                        logging.error(f"Failed to store plugin additionalField secret '{k}' in Key Vault: {e}")
-                        raise Exception(f"Failed to store plugin additionalField secret '{k}' in Key Vault: {e}")
-        updated['additionalFields'] = new_additional_fields
+                akv_key = _build_plugin_additional_field_secret_name(plugin_name, base_field)
+                try:
+                    _store_plugin_secret_reference(
+                        updated,
+                        existing_plugin,
+                        ('additionalFields', k),
+                        akv_key,
+                        scope_value,
+                        addset_source,
+                        scope,
+                    )
+                except Exception as e:
+                    log_event(f"Failed to store plugin additionalField secret '{k}' in Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
+                    raise Exception(f"Failed to store plugin additionalField secret '{k}' in Key Vault: {e}")
+            elif _is_sql_sensitive_additional_field(updated, k):
+                addset_source = 'action-addset'
+                akv_key = _build_plugin_additional_field_secret_name(plugin_name, k)
+                try:
+                    _store_plugin_secret_reference(
+                        updated,
+                        existing_plugin,
+                        ('additionalFields', k),
+                        akv_key,
+                        scope_value,
+                        addset_source,
+                        scope,
+                    )
+                except Exception as e:
+                    log_event(
+                        f"Failed to store SQL plugin additionalField secret '{k}' in Key Vault: {e}",
+                        level=logging.ERROR,
+                        exceptionTraceback=True,
+                    )
+                    raise Exception(f"Failed to store SQL plugin additionalField secret '{k}' in Key Vault: {e}")
     return updated
 # Helper to retrieve plugin secrets from Key Vault
 def keyvault_plugin_get_helper(plugin_dict, scope_value, scope="global", return_type=SecretReturnType.TRIGGER):
@@ -375,51 +577,45 @@ def keyvault_plugin_get_helper(plugin_dict, scope_value, scope="global", return_
         dict: A new plugin dict with sensitive values replaced by ui_trigger_word if stored in Key Vault.
     """
     if scope not in supported_scopes:
-        logging.error(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
+        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level=logging.ERROR)
         raise ValueError(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
     updated = dict(plugin_dict)
     plugin_name = updated.get('name', 'plugin')
     auth = updated.get('auth', {})
     if isinstance(auth, dict):
-        if 'key' in auth and auth['key']:
-            value = auth['key']
-            if validate_secret_name_dynamic(value):
+        new_auth = dict(auth)
+        auth_updated = False
+        for auth_field in ('key', *SQL_PLUGIN_SENSITIVE_AUTH_FIELDS):
+            value = auth.get(auth_field)
+            if value and validate_secret_name_dynamic(value):
                 try:
                     if return_type == SecretReturnType.VALUE:
-                        actual_key = retrieve_secret_from_key_vault_by_full_name(value)
-                        new_auth = dict(auth)
-                        new_auth['key'] = actual_key
-                        updated['auth'] = new_auth
+                        new_auth[auth_field] = retrieve_secret_from_key_vault_by_full_name(value)
                     elif return_type == SecretReturnType.NAME:
-                        new_auth = dict(auth)
-                        new_auth['key'] = value
-                        updated['auth'] = new_auth
+                        new_auth[auth_field] = value
                     else:
-                        new_auth = dict(auth)
-                        new_auth['key'] = ui_trigger_word
-                        updated['auth'] = new_auth
+                        new_auth[auth_field] = ui_trigger_word
+                    auth_updated = True
                 except Exception as e:
-                    logging.error(f"Failed to retrieve action {plugin_name} key from Key Vault: {e}")
-                    raise Exception(f"Failed to retrieve action {plugin_name} key from Key Vault: {e}")
+                    log_event(f"Failed to retrieve action {plugin_name} auth field '{auth_field}' from Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
+                    raise Exception(f"Failed to retrieve action {plugin_name} auth field '{auth_field}' from Key Vault: {e}")
+        if auth_updated:
+            updated['auth'] = new_auth
 
     additional_fields = updated.get('additionalFields', {})
     if isinstance(additional_fields, dict):
         new_additional_fields = dict(additional_fields)
         for k, v in additional_fields.items():
-            if k.endswith('__Secret') and v and validate_secret_name_dynamic(v):
-                addset_source = 'action-addset'
-                base_field = k[:-8]  # Remove '__Secret'
-                akv_key = f"{plugin_name}-{base_field}".replace('__', '-')
+            if (k.endswith('__Secret') or _is_sql_sensitive_additional_field(updated, k)) and v and validate_secret_name_dynamic(v):
                 try:
                     if return_type == SecretReturnType.VALUE:
-                        actual_secret = retrieve_secret_from_key_vault(f"{akv_key}", scope_value, scope, addset_source)
-                        new_additional_fields[k] = actual_secret
+                        new_additional_fields[k] = retrieve_secret_from_key_vault_by_full_name(v)
                     elif return_type == SecretReturnType.NAME:
                         new_additional_fields[k] = v
                     else:
                         new_additional_fields[k] = ui_trigger_word
                 except Exception as e:
-                    logging.error(f"Failed to retrieve action additionalField secret '{k}' from Key Vault: {e}")
+                    log_event(f"Failed to retrieve action additionalField secret '{k}' from Key Vault: {e}", level=logging.ERROR, exceptionTraceback=True)
                     raise Exception(f"Failed to retrieve action additionalField secret '{k}' from Key Vault: {e}")
         updated['additionalFields'] = new_additional_fields
     return updated
@@ -439,45 +635,41 @@ def keyvault_plugin_delete_helper(plugin_dict, scope_value, scope="global"):
     Raises:
     """
     if scope not in supported_scopes:
-        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level="WARNING")
+        log_event(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}", level=logging.WARNING)
         raise ValueError(f"Scope '{scope}' is not supported. Supported scopes: {supported_scopes}")
     settings = app_settings_cache.get_settings_cache()
     enable_key_vault_secret_storage = settings.get("enable_key_vault_secret_storage", False)
     key_vault_name = settings.get("key_vault_name", None)
     if not enable_key_vault_secret_storage or not key_vault_name:
-        log_event(f"Key Vault secret storage is not enabled or key vault name is missing.", level="WARNING")
+        log_event("Key Vault secret storage is not enabled or key vault name is missing.", level=logging.WARNING)
         return plugin_dict
     source = "action"
     plugin_name = plugin_dict.get('name', 'plugin')
     auth = plugin_dict.get('auth', {})
     if isinstance(auth, dict):
-        if 'key' in auth and auth['key']:
-            secret_name = auth['key']
-            if validate_secret_name_dynamic(secret_name):
+        for auth_field in ('key', *SQL_PLUGIN_SENSITIVE_AUTH_FIELDS):
+            secret_name = auth.get(auth_field)
+            if secret_name and validate_secret_name_dynamic(secret_name):
                 try:
                     key_vault_url = f"https://{key_vault_name}{KEY_VAULT_DOMAIN}"
-                    log_event(f"Deleting action secret '{secret_name}' for action '{plugin_name}' for '{scope}' '{scope_value}'", level="INFO")
+                    log_event(f"Deleting action auth secret '{auth_field}' for action '{plugin_name}' for '{scope}' '{scope_value}'", level=logging.INFO)
                     client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential())
                     client.begin_delete_secret(secret_name)
                 except Exception as e:
-                    logging.error(f"Error deleting action secret '{secret_name}' for action '{plugin_name}': {e}")
-                    raise Exception(f"Error deleting action secret '{secret_name}' for action '{plugin_name}': {e}")
+                    log_event(f"Error deleting action auth secret '{auth_field}' for action '{plugin_name}': {e}", level=logging.ERROR, exceptionTraceback=True)
+                    raise Exception(f"Error deleting action auth secret '{auth_field}' for action '{plugin_name}': {e}")
 
     additional_fields = plugin_dict.get('additionalFields', {})
     if isinstance(additional_fields, dict):
         for k, v in additional_fields.items():
-            if k.endswith('__Secret') and v and validate_secret_name_dynamic(v):
-                addset_source = 'action-addset'
-                base_field = k[:-8]  # Remove '__Secret'
-                akv_key = f"{plugin_name}-{base_field}".replace('__', '-')
+            if (k.endswith('__Secret') or _is_sql_sensitive_additional_field(plugin_dict, k)) and v and validate_secret_name_dynamic(v):
                 try:
-                    keyvault_secret_name = build_full_secret_name(akv_key, scope_value, addset_source, scope)
                     key_vault_url = f"https://{key_vault_name}{KEY_VAULT_DOMAIN}"
-                    log_event(f"Deleting action additionalField secret '{k}' for action '{plugin_name}' for '{scope}' '{scope_value}'", level="INFO")
+                    log_event(f"Deleting action additionalField secret '{k}' for action '{plugin_name}' for '{scope}' '{scope_value}'", level=logging.INFO)
                     client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential())
-                    client.begin_delete_secret(keyvault_secret_name)
+                    client.begin_delete_secret(v)
                 except Exception as e:
-                    logging.error(f"Error deleting action additionalField secret '{k}' for action '{plugin_name}': {e}")
+                    log_event(f"Error deleting action additionalField secret '{k}' for action '{plugin_name}': {e}", level=logging.ERROR, exceptionTraceback=True)
                     raise Exception(f"Error deleting action additionalField secret '{k}' for action '{plugin_name}': {e}")
     return plugin_dict
 
@@ -511,22 +703,29 @@ def keyvault_agent_delete_helper(agent_dict, scope_value, scope="global"):
             if validate_secret_name_dynamic(secret_name):
                 try:
                     key_vault_url = f"https://{key_vault_name}{KEY_VAULT_DOMAIN}"
-                    log_event(f"Deleting agent secret '{secret_name}' for agent '{agent_name}' for '{scope}' '{scope_value}'", level="INFO")
+                    log_event(f"Deleting agent secret '{secret_name}' for agent '{agent_name}' for '{scope}' '{scope_value}'", level=logging.INFO)
                     client = SecretClient(vault_url=key_vault_url, credential=get_keyvault_credential())
                     client.begin_delete_secret(secret_name)
                 except Exception as e:
-                    logging.error(f"Error deleting secret '{secret_name}' for agent '{agent_name}': {e}")
+                    log_event(f"Error deleting secret '{secret_name}' for agent '{agent_name}': {e}", level=logging.ERROR, exceptionTraceback=True)
                     raise Exception(f"Error deleting secret '{secret_name}' for agent '{agent_name}': {e}")
     return agent_dict
 
-def get_keyvault_credential():
+def get_keyvault_credential(settings=None):
     """
     Get the Key Vault credential using DefaultAzureCredential, optionally with a managed identity client ID.
 
+    Args:
+        settings (dict, optional): Settings dict to use directly. If None, falls back to
+            app_settings_cache.get_settings_cache(). Pass settings explicitly when calling
+            before the cache is initialised (e.g. during configure_app_cache bootstrap).
+
     Returns:
         DefaultAzureCredential: The credential object for Key Vault access.
     """
-    settings = app_settings_cache.get_settings_cache()
+    if settings is None:
+        settings = app_settings_cache.get_settings_cache()
+
     key_vault_identity = settings.get("key_vault_identity", None)
     if key_vault_identity is not None:
         credential = DefaultAzureCredential(managed_identity_client_id=key_vault_identity)
diff --git a/application/single_app/functions_logging.py b/application/single_app/functions_logging.py
index a78f413e..a011a7a8 100644
--- a/application/single_app/functions_logging.py
+++ b/application/single_app/functions_logging.py
@@ -5,7 +5,7 @@
 
 def add_file_task_to_file_processing_log(document_id, user_id, content):
     settings = get_settings()
-    enable_file_processing_log = settings.get('enable_file_processing_log', True)
+    enable_file_processing_log = settings.get('enable_file_processing_logs', True)
 
     if enable_file_processing_log:
         try:
diff --git a/application/single_app/functions_notifications.py b/application/single_app/functions_notifications.py
index 15ce11e4..9bb4409c 100644
--- a/application/single_app/functions_notifications.py
+++ b/application/single_app/functions_notifications.py
@@ -31,6 +31,10 @@
         'icon': 'bi-file-earmark-check',
         'color': 'success'
     },
+    'chat_response_complete': {
+        'icon': 'bi-chat-dots',
+        'color': 'success'
+    },
     'document_processing_failed': {
         'icon': 'bi-file-earmark-x',
         'color': 'danger'
@@ -218,6 +222,41 @@ def create_public_workspace_notification(
     )
 
 
+def create_chat_response_notification(
+    user_id,
+    conversation_id,
+    message_id,
+    conversation_title='',
+    response_preview='',
+):
+    """Create a personal notification when a chat response completes."""
+    normalized_title = str(conversation_title or '').strip() or 'Conversation'
+    normalized_preview = str(response_preview or '').strip()
+    if len(normalized_preview) > 160:
+        normalized_preview = f"{normalized_preview[:157]}..."
+
+    notification_message = (
+        normalized_preview
+        or f'The AI model responded in {normalized_title}.'
+    )
+
+    return create_notification(
+        user_id=user_id,
+        notification_type='chat_response_complete',
+        title=f'AI responded in {normalized_title}',
+        message=notification_message,
+        link_url=f'/chats?conversationId={conversation_id}',
+        link_context={
+            'workspace_type': 'personal',
+            'conversation_id': conversation_id,
+        },
+        metadata={
+            'conversation_id': conversation_id,
+            'message_id': message_id,
+        }
+    )
+
+
 def get_user_notifications(user_id, page=1, per_page=20, include_read=True, include_dismissed=False, user_roles=None):
     """
     Fetch notifications visible to a user from personal, group, and public workspace scopes.
@@ -452,6 +491,46 @@ def mark_notification_read(notification_id, user_id):
         return False
 
 
+def mark_chat_response_notifications_read_for_conversation(user_id, conversation_id):
+    """Mark personal chat-completion notifications read for a conversation."""
+    try:
+        query = """
+            SELECT * FROM c
+            WHERE c.user_id = @user_id
+            AND c.notification_type = @notification_type
+            AND c.metadata.conversation_id = @conversation_id
+        """
+        params = [
+            {'name': '@user_id', 'value': user_id},
+            {'name': '@notification_type', 'value': 'chat_response_complete'},
+            {'name': '@conversation_id', 'value': conversation_id},
+        ]
+
+        notifications = list(cosmos_notifications_container.query_items(
+            query=query,
+            parameters=params,
+            partition_key=user_id
+        ))
+
+        marked_count = 0
+        for notification in notifications:
+            read_by = notification.get('read_by', [])
+            if user_id in read_by:
+                continue
+
+            read_by.append(user_id)
+            notification['read_by'] = read_by
+            cosmos_notifications_container.upsert_item(notification)
+            marked_count += 1
+
+        return marked_count
+    except Exception as e:
+        debug_print(
+            f"Error marking chat response notifications as read for conversation {conversation_id}: {e}"
+        )
+        return 0
+
+
 def dismiss_notification(notification_id, user_id):
     """
     Dismiss a notification for a specific user (adds to dismissed_by).
diff --git a/application/single_app/functions_personal_actions.py b/application/single_app/functions_personal_actions.py
index 6345438e..56d5e36f 100644
--- a/application/single_app/functions_personal_actions.py
+++ b/application/single_app/functions_personal_actions.py
@@ -109,19 +109,40 @@ def save_personal_action(user_id, action_data):
     try:
         # Check if an action with this name already exists
         existing_action = None
+        if action_data.get('id'):
+            existing_action = get_personal_action(
+                user_id,
+                action_data['id'],
+                return_type=SecretReturnType.NAME,
+            )
         if 'name' in action_data and action_data['name']:
-            existing_action = get_personal_action(user_id, action_data['name'])
+            existing_action = existing_action or get_personal_action(
+                user_id,
+                action_data['name'],
+                return_type=SecretReturnType.NAME,
+            )
         
         # Preserve existing ID if updating, or generate new ID if creating
+        now = datetime.utcnow().isoformat()
         if existing_action:
-            # Update existing action - preserve the original ID
+            # Update existing action - preserve the original ID and creation tracking
             action_data['id'] = existing_action['id']
+            action_data['created_by'] = existing_action.get('created_by', user_id)
+            action_data['created_at'] = existing_action.get('created_at', now)
         elif 'id' not in action_data or not action_data['id']:
             # New action - generate UUID for ID
             action_data['id'] = str(uuid.uuid4())
-            
+            action_data['created_by'] = user_id
+            action_data['created_at'] = now
+        else:
+            # Has an ID but no existing action found - treat as new
+            action_data['created_by'] = user_id
+            action_data['created_at'] = now
+        action_data['modified_by'] = user_id
+        action_data['modified_at'] = now
+
         action_data['user_id'] = user_id
-        action_data['last_updated'] = datetime.utcnow().isoformat()
+        action_data['last_updated'] = now
         
         # Validate required fields
         required_fields = ['name', 'displayName', 'type', 'description']
@@ -145,7 +166,12 @@ def save_personal_action(user_id, action_data):
             action_data['auth']['type'] = 'identity'
         
         # Store secrets in Key Vault before upsert
-        action_data = keyvault_plugin_save_helper(action_data, scope_value=user_id, scope="user")
+        action_data = keyvault_plugin_save_helper(
+            action_data,
+            scope_value=user_id,
+            scope="user",
+            existing_plugin=existing_action,
+        )
         result = cosmos_personal_actions_container.upsert_item(body=action_data)
         # Remove Cosmos metadata from response
         cleaned_result = {k: v for k, v in result.items() if not k.startswith('_')}
@@ -168,7 +194,7 @@ def delete_personal_action(user_id, action_id):
     """
     try:
         # Try to find the action first to get the correct ID
-        action = get_personal_action(user_id, action_id)
+        action = get_personal_action(user_id, action_id, return_type=SecretReturnType.NAME)
         if not action:
             return False
             
diff --git a/application/single_app/functions_personal_agents.py b/application/single_app/functions_personal_agents.py
index a4a5e47d..3c6c275e 100644
--- a/application/single_app/functions_personal_agents.py
+++ b/application/single_app/functions_personal_agents.py
@@ -128,9 +128,33 @@ def save_personal_agent(user_id, agent_data):
             cleaned_agent.setdefault(field, '')
         if 'id' not in cleaned_agent:
             cleaned_agent['id'] = str(f"{user_id}_{cleaned_agent.get('name', 'default')}")
-            
+
+        # Check if this is a new agent or an update to preserve created_by/created_at
+        existing_agent = None
+        try:
+            existing_agent = cosmos_personal_agents_container.read_item(
+                item=cleaned_agent['id'],
+                partition_key=user_id
+            )
+        except exceptions.CosmosResourceNotFoundError:
+            pass
+        except Exception:
+            pass
+
+        now = datetime.utcnow().isoformat()
+        if existing_agent:
+            # Preserve original creation tracking
+            cleaned_agent['created_by'] = existing_agent.get('created_by', user_id)
+            cleaned_agent['created_at'] = existing_agent.get('created_at', now)
+        else:
+            # New agent
+            cleaned_agent['created_by'] = user_id
+            cleaned_agent['created_at'] = now
+        cleaned_agent['modified_by'] = user_id
+        cleaned_agent['modified_at'] = now
+
         cleaned_agent['user_id'] = user_id
-        cleaned_agent['last_updated'] = datetime.utcnow().isoformat()
+        cleaned_agent['last_updated'] = now
         cleaned_agent['is_global'] = False
         cleaned_agent['is_group'] = False
         
diff --git a/application/single_app/functions_settings.py b/application/single_app/functions_settings.py
index 8176939d..a282f409 100644
--- a/application/single_app/functions_settings.py
+++ b/application/single_app/functions_settings.py
@@ -25,6 +25,7 @@ def get_settings(use_cosmos=False):
         'enable_text_plugin': True,
         'enable_default_embedding_model_plugin': False,
         'enable_fact_memory_plugin': True,
+        'enable_tabular_processing_plugin': False,
         'enable_multi_agent_orchestration': False,
         'max_rounds_per_agent': 1,
         'enable_semantic_kernel': False,
@@ -205,6 +206,9 @@ def get_settings(use_cosmos=False):
         'require_member_of_feedback_admin': False,
         'enable_conversation_archiving': False,
 
+        # Processing Thoughts
+        'enable_thoughts': False,
+
         # Search and Extract
         'azure_ai_search_endpoint': '',
         'azure_ai_search_key': '',
@@ -258,8 +262,12 @@ def get_settings(use_cosmos=False):
 
         # Other
         'max_file_size_mb': 150,
+        'tabular_preview_max_blob_size_mb': 200,
         'conversation_history_limit': 10,
         'default_system_prompt': '',
+        # Access denied message shown on the home page for signed-in users who lack required roles.
+        # Default is hard-coded; admins can override via Admin Settings (persisted in Cosmos DB).
+        'access_denied_message': 'You are logged in but do not have the required permissions to access this application.\nPlease contact an administrator for access.',
         'enable_file_processing_logs': True,
         'file_processing_logs_timer_enabled': False,
         'file_timer_value': 1,
@@ -268,7 +276,7 @@ def get_settings(use_cosmos=False):
         'enable_external_healthcheck': False,
         
         # Streaming settings
-        'streamingEnabled': False,
+        'streamingEnabled': True,
         
         # Reasoning effort settings (per-model)
         'reasoningEffortSettings': {},
@@ -391,6 +399,9 @@ def update_settings(new_settings):
         # always fetch the latest settings doc, which includes your merges
         settings_item = get_settings()
         settings_item.update(new_settings)
+        # Dependency enforcement: tabular processing requires enhanced citations
+        if not settings_item.get('enable_enhanced_citations', False):
+            settings_item['enable_tabular_processing_plugin'] = False
         cosmos_settings_container.upsert_item(settings_item)
         cache_updater = getattr(app_settings_cache, "update_settings_cache", None)
         if callable(cache_updater):
diff --git a/application/single_app/functions_thoughts.py b/application/single_app/functions_thoughts.py
new file mode 100644
index 00000000..c6ffe9dd
--- /dev/null
+++ b/application/single_app/functions_thoughts.py
@@ -0,0 +1,256 @@
+# functions_thoughts.py
+
+import uuid
+import time
+from datetime import datetime, timezone
+from config import cosmos_thoughts_container, cosmos_archived_thoughts_container, cosmos_messages_container
+from functions_appinsights import log_event
+from functions_settings import get_settings
+
+
+class ThoughtTracker:
+    """Stateful per-request tracker that writes processing step records to Cosmos DB.
+
+    Each add_thought() call immediately upserts a document so that polling
+    clients can see partial progress before the final response is sent.
+
+    All Cosmos writes are wrapped in try/except so thought errors never
+    interrupt the chat processing flow.
+    """
+
+    def __init__(self, conversation_id, message_id, thread_id, user_id):
+        self.conversation_id = conversation_id
+        self.message_id = message_id
+        self.thread_id = thread_id
+        self.user_id = user_id
+        self.current_index = 0
+        settings = get_settings()
+        self.enabled = settings.get('enable_thoughts', False)
+
+    def add_thought(self, step_type, content, detail=None):
+        """Write a thought step to Cosmos immediately.
+
+        Args:
+            step_type: One of search, tabular_analysis, web_search,
+                       agent_tool_call, generation, content_safety.
+            content: Short human-readable description of the step.
+            detail: Optional technical detail (function names, params, etc.).
+
+        Returns:
+            The thought document id, or None if disabled/failed.
+        """
+        if not self.enabled:
+            return None
+
+        thought_id = str(uuid.uuid4())
+        thought_doc = {
+            'id': thought_id,
+            'conversation_id': self.conversation_id,
+            'message_id': self.message_id,
+            'thread_id': self.thread_id,
+            'user_id': self.user_id,
+            'step_index': self.current_index,
+            'step_type': step_type,
+            'content': content,
+            'detail': detail,
+            'duration_ms': None,
+            'timestamp': datetime.now(timezone.utc).isoformat()
+        }
+        self.current_index += 1
+
+        try:
+            cosmos_thoughts_container.upsert_item(thought_doc)
+        except Exception as e:
+            log_event(f"ThoughtTracker.add_thought failed: {e}", level="WARNING")
+            return None
+
+        return thought_id
+
+    def complete_thought(self, thought_id, duration_ms):
+        """Patch an existing thought with its duration after the step finishes."""
+        if not self.enabled or not thought_id:
+            return
+
+        try:
+            thought_doc = cosmos_thoughts_container.read_item(
+                item=thought_id,
+                partition_key=self.user_id
+            )
+            thought_doc['duration_ms'] = duration_ms
+            cosmos_thoughts_container.upsert_item(thought_doc)
+        except Exception as e:
+            log_event(f"ThoughtTracker.complete_thought failed: {e}", level="WARNING")
+
+    def timed_thought(self, step_type, content, detail=None):
+        """Convenience: add a thought and return a timer helper.
+
+        Usage:
+            timer = tracker.timed_thought('search', 'Searching documents...')
+            # ... do work ...
+            timer.stop()
+        """
+        start = time.time()
+        thought_id = self.add_thought(step_type, content, detail)
+        return _ThoughtTimer(self, thought_id, start)
+
+
+class _ThoughtTimer:
+    """Helper returned by ThoughtTracker.timed_thought() for auto-duration capture."""
+
+    def __init__(self, tracker, thought_id, start_time):
+        self._tracker = tracker
+        self._thought_id = thought_id
+        self._start = start_time
+
+    def stop(self):
+        elapsed_ms = int((time.time() - self._start) * 1000)
+        self._tracker.complete_thought(self._thought_id, elapsed_ms)
+        return elapsed_ms
+
+
+# ---------------------------------------------------------------------------
+# CRUD helpers
+# ---------------------------------------------------------------------------
+
+def get_thoughts_for_message(conversation_id, message_id, user_id):
+    """Return all thoughts for a specific assistant message, ordered by step_index."""
+    try:
+        query = (
+            "SELECT * FROM c "
+            "WHERE c.conversation_id = @conv_id "
+            "AND c.message_id = @msg_id "
+            "ORDER BY c.step_index ASC"
+        )
+        params = [
+            {"name": "@conv_id", "value": conversation_id},
+            {"name": "@msg_id", "value": message_id},
+        ]
+        results = list(cosmos_thoughts_container.query_items(
+            query=query,
+            parameters=params,
+            partition_key=user_id
+        ))
+        return results
+    except Exception as e:
+        log_event(f"get_thoughts_for_message failed: {e}", level="WARNING")
+        return []
+
+
+def get_pending_thoughts(conversation_id, user_id):
+    """Return the latest thoughts for a conversation that are still in-progress.
+
+    Used by the polling endpoint.  Retrieves thoughts created within the last
+    5 minutes for the conversation, grouped by the most recent message_id.
+    """
+    try:
+        five_minutes_ago = datetime.now(timezone.utc)
+        from datetime import timedelta
+        five_minutes_ago = (five_minutes_ago - timedelta(minutes=5)).isoformat()
+
+        query = (
+            "SELECT * FROM c "
+            "WHERE c.conversation_id = @conv_id "
+            "AND c.timestamp >= @since "
+            "ORDER BY c.timestamp DESC"
+        )
+        params = [
+            {"name": "@conv_id", "value": conversation_id},
+            {"name": "@since", "value": five_minutes_ago},
+        ]
+        results = list(cosmos_thoughts_container.query_items(
+            query=query,
+            parameters=params,
+            partition_key=user_id
+        ))
+
+        if not results:
+            return []
+
+        # Group by the most recent message_id
+        latest_message_id = results[0].get('message_id')
+        latest_thoughts = [
+            t for t in results if t.get('message_id') == latest_message_id
+        ]
+        # Return in ascending step_index order
+        latest_thoughts.sort(key=lambda t: t.get('step_index', 0))
+        return latest_thoughts
+    except Exception as e:
+        log_event(f"get_pending_thoughts failed: {e}", level="WARNING")
+        return []
+
+
+def get_thoughts_for_conversation(conversation_id, user_id):
+    """Return all thoughts for a conversation."""
+    try:
+        query = (
+            "SELECT * FROM c "
+            "WHERE c.conversation_id = @conv_id "
+            "ORDER BY c.timestamp ASC"
+        )
+        params = [
+            {"name": "@conv_id", "value": conversation_id},
+        ]
+        results = list(cosmos_thoughts_container.query_items(
+            query=query,
+            parameters=params,
+            partition_key=user_id
+        ))
+        return results
+    except Exception as e:
+        log_event(f"get_thoughts_for_conversation failed: {e}", level="WARNING")
+        return []
+
+
+def archive_thoughts_for_conversation(conversation_id, user_id):
+    """Copy all thoughts for a conversation to the archive container, then delete originals."""
+    try:
+        thoughts = get_thoughts_for_conversation(conversation_id, user_id)
+        for thought in thoughts:
+            archived = dict(thought)
+            archived['archived_at'] = datetime.now(timezone.utc).isoformat()
+            cosmos_archived_thoughts_container.upsert_item(archived)
+
+        for thought in thoughts:
+            cosmos_thoughts_container.delete_item(
+                item=thought['id'],
+                partition_key=user_id
+            )
+    except Exception as e:
+        log_event(f"archive_thoughts_for_conversation failed: {e}", level="WARNING")
+
+
+def delete_thoughts_for_conversation(conversation_id, user_id):
+    """Delete all thoughts for a conversation."""
+    try:
+        thoughts = get_thoughts_for_conversation(conversation_id, user_id)
+        for thought in thoughts:
+            cosmos_thoughts_container.delete_item(
+                item=thought['id'],
+                partition_key=user_id
+            )
+    except Exception as e:
+        log_event(f"delete_thoughts_for_conversation failed: {e}", level="WARNING")
+
+
+def delete_thoughts_for_message(message_id, user_id):
+    """Delete all thoughts associated with a specific assistant message."""
+    try:
+        query = (
+            "SELECT * FROM c "
+            "WHERE c.message_id = @msg_id"
+        )
+        params = [
+            {"name": "@msg_id", "value": message_id},
+        ]
+        results = list(cosmos_thoughts_container.query_items(
+            query=query,
+            parameters=params,
+            partition_key=user_id
+        ))
+        for thought in results:
+            cosmos_thoughts_container.delete_item(
+                item=thought['id'],
+                partition_key=user_id
+            )
+    except Exception as e:
+        log_event(f"delete_thoughts_for_message failed: {e}", level="WARNING")
diff --git a/application/single_app/gunicorn.conf.py b/application/single_app/gunicorn.conf.py
new file mode 100644
index 00000000..8f7e3d5e
--- /dev/null
+++ b/application/single_app/gunicorn.conf.py
@@ -0,0 +1,28 @@
+# gunicorn.conf.py
+import os
+
+
+def _env_int(name, default):
+    value = os.environ.get(name)
+    if value is None or value == '':
+        return default
+
+    try:
+        return int(value)
+    except ValueError:
+        return default
+
+
+bind = os.environ.get('GUNICORN_BIND', f"0.0.0.0:{os.environ.get('PORT', '5000')}")
+worker_class = os.environ.get('GUNICORN_WORKER_CLASS', 'gthread')
+workers = _env_int('GUNICORN_WORKERS', 2)
+threads = _env_int('GUNICORN_THREADS', 8)
+timeout = _env_int('GUNICORN_TIMEOUT', 900)
+graceful_timeout = _env_int('GUNICORN_GRACEFUL_TIMEOUT', 60)
+keepalive = _env_int('GUNICORN_KEEPALIVE', 75)
+max_requests = _env_int('GUNICORN_MAX_REQUESTS', 500)
+max_requests_jitter = _env_int('GUNICORN_MAX_REQUESTS_JITTER', 50)
+accesslog = '-'
+errorlog = '-'
+capture_output = True
+preload_app = False
diff --git a/application/single_app/openapi_security.py b/application/single_app/openapi_security.py
index 52b1751a..e61dfeda 100644
--- a/application/single_app/openapi_security.py
+++ b/application/single_app/openapi_security.py
@@ -1,35 +1,26 @@
 """
 OpenAPI File Security Validator
 
-This module provides security validation for OpenAPI specification files
-to prevent malicious content from being uploaded or processed.
+This module provides security validation for uploaded OpenAPI specification
+files to prevent malicious content from being uploaded or processed.
 """
 
 import os
 import yaml
 import json
-import tempfile
-import requests
 import re
-from typing import Dict, Any, List, Optional, Tuple
-from urllib.parse import urlparse
+from typing import Dict, Any, List, Tuple
 from werkzeug.utils import secure_filename
 
 class OpenApiSecurityValidator:
-    """Security validator for OpenAPI specification files and URLs."""
+    """Security validator for uploaded OpenAPI specification files."""
     
     # Maximum file size for OpenAPI specs (5MB)
     MAX_FILE_SIZE = 5 * 1024 * 1024
     
-    # Maximum content size when fetching from URL (10MB)
-    MAX_URL_CONTENT_SIZE = 10 * 1024 * 1024
-    
     # Allowed file extensions
     ALLOWED_EXTENSIONS = {'.yaml', '.yml', '.json'}
     
-    # Timeout for URL requests (30 seconds)
-    URL_TIMEOUT = 30
-    
     # Dangerous patterns that should not appear in OpenAPI specs
     DANGEROUS_PATTERNS = [
         # Code injection attempts
@@ -96,43 +87,6 @@ def validate_filename(self, filename: str) -> Tuple[bool, str]:
         
         return True, ""
     
-    def validate_url(self, url: str) -> Tuple[bool, str]:
-        """Validate URL for security."""
-        if not url:
-            return False, "URL is required"
-        
-        try:
-            parsed = urlparse(url)
-            
-            # Only allow HTTP/HTTPS
-            if parsed.scheme not in ['http', 'https']:
-                return False, "Only HTTP and HTTPS URLs are allowed"
-            
-            # Block localhost and private networks
-            hostname = parsed.hostname
-            if not hostname:
-                return False, "Invalid hostname"
-            
-            # Block dangerous hostnames
-            blocked_hosts = [
-                'localhost', '127.0.0.1', '0.0.0.0',
-                '::1', '169.254.169.254'  # AWS metadata service
-            ]
-            
-            if hostname.lower() in blocked_hosts:
-                return False, "Access to localhost and metadata services is not allowed"
-            
-            # Block private IP ranges (basic check)
-            if (hostname.startswith('10.') or 
-                hostname.startswith('192.168.') or 
-                hostname.startswith('172.')):
-                return False, "Access to private networks is not allowed"
-            
-            return True, ""
-            
-        except Exception as e:
-            return False, f"Invalid URL format: {str(e)}"
-    
     def scan_content_for_threats(self, content: str) -> Tuple[bool, List[str]]:
         """Scan content for dangerous patterns."""
         threats = []
@@ -143,12 +97,10 @@ def scan_content_for_threats(self, content: str) -> Tuple[bool, List[str]]:
         
         return len(threats) == 0, threats
     
-    def validate_file_size(self, file_size: int, is_url: bool = False) -> Tuple[bool, str]:
+    def validate_file_size(self, file_size: int) -> Tuple[bool, str]:
         """Validate file size limits."""
-        max_size = self.MAX_URL_CONTENT_SIZE if is_url else self.MAX_FILE_SIZE
-        
-        if file_size > max_size:
-            max_mb = max_size / (1024 * 1024)
+        if file_size > self.MAX_FILE_SIZE:
+            max_mb = self.MAX_FILE_SIZE / (1024 * 1024)
             return False, f"File size exceeds maximum allowed size of {max_mb}MB"
         
         return True, ""
@@ -236,86 +188,6 @@ def validate_file_content(self, file_path: str) -> Tuple[bool, Dict[str, Any], s
         except Exception as e:
             return False, {}, f"Error validating file: {str(e)}"
     
-    def validate_url_content(self, url: str) -> Tuple[bool, Dict[str, Any], str]:
-        """Validate OpenAPI spec from URL."""
-        try:
-            # Validate URL format
-            url_valid, url_error = self.validate_url(url)
-            if not url_valid:
-                return False, {}, url_error
-            
-            # Fetch content with security headers
-            headers = {
-                'User-Agent': 'SimpleChat-OpenAPI-Validator/1.0',
-                'Accept': 'application/json, application/x-yaml, text/yaml, text/plain',
-                'Accept-Encoding': 'gzip, deflate'
-            }
-            
-            response = requests.get(
-                url, 
-                headers=headers, 
-                timeout=self.URL_TIMEOUT,
-                stream=True,
-                allow_redirects=True,
-                verify=True  # Verify SSL certificates
-            )
-            
-            response.raise_for_status()
-            
-            # Check content size before loading
-            content_length = response.headers.get('content-length')
-            if content_length and int(content_length) > self.MAX_URL_CONTENT_SIZE:
-                return False, {}, f"Content size exceeds maximum allowed size"
-            
-            # Read content with size limit
-            content = ""
-            total_size = 0
-            for chunk in response.iter_content(chunk_size=8192, decode_unicode=True):
-                # chunk is already a string when decode_unicode=True
-                chunk_size = len(chunk.encode('utf-8')) if isinstance(chunk, str) else len(chunk)
-                total_size += chunk_size
-                if total_size > self.MAX_URL_CONTENT_SIZE:
-                    return False, {}, "Content size exceeds maximum allowed size"
-                content += chunk
-            
-            # Validate content size
-            size_valid, size_error = self.validate_file_size(total_size, is_url=True)
-            if not size_valid:
-                return False, {}, size_error
-            
-            # Scan for dangerous patterns
-            safe, threats = self.scan_content_for_threats(content)
-            if not safe:
-                return False, {}, f"Security threats detected: {'; '.join(threats)}"
-            
-            # Parse content
-            content_type = response.headers.get('content-type', '').lower()
-            try:
-                if 'yaml' in content_type or url.endswith(('.yaml', '.yml')):
-                    spec = yaml.safe_load(content)
-                else:
-                    spec = json.loads(content)
-            except (yaml.YAMLError, json.JSONDecodeError) as e:
-                return False, {}, f"Invalid content format: {str(e)}"
-            
-            # Validate OpenAPI structure
-            structure_valid, structure_error = self.validate_openapi_structure(spec)
-            if not structure_valid:
-                return False, {}, structure_error
-            
-            return True, spec, ""
-            
-        except requests.exceptions.Timeout:
-            return False, {}, "Request timeout - URL took too long to respond"
-        except requests.exceptions.SSLError:
-            return False, {}, "SSL certificate verification failed"
-        except requests.exceptions.ConnectionError:
-            return False, {}, "Connection error - unable to reach URL"
-        except requests.exceptions.HTTPError as e:
-            return False, {}, f"HTTP error: {e.response.status_code}"
-        except Exception as e:
-            return False, {}, f"Error fetching URL content: {str(e)}"
-    
     def create_safe_filename(self, original_filename: str) -> str:
         """Create a safe filename for storage."""
         # Use werkzeug's secure_filename but ensure we keep the extension
@@ -344,11 +216,6 @@ def validate_openapi_file(file_path: str) -> Tuple[bool, Dict[str, Any], str]:
     return openapi_validator.validate_file_content(file_path)
 
 
-def validate_openapi_url(url: str) -> Tuple[bool, Dict[str, Any], str]:
-    """Convenience function to validate an OpenAPI spec from URL."""
-    return openapi_validator.validate_url_content(url)
-
-
 def is_safe_openapi_filename(filename: str) -> bool:
     """Quick check if filename is safe for OpenAPI specs."""
     valid, _ = openapi_validator.validate_filename(filename)
diff --git a/application/single_app/requirements.txt b/application/single_app/requirements.txt
index c8156eab..ac378dde 100644
--- a/application/single_app/requirements.txt
+++ b/application/single_app/requirements.txt
@@ -38,7 +38,7 @@ langchain-text-splitters==0.3.9
 beautifulsoup4==4.13.3
 openpyxl==3.1.5
 xlrd==2.0.1
-pillow==11.1.0
+pillow==12.1.1
 ffmpeg-binaries-compat==1.0.1
 ffmpeg-python==0.2.0
 semantic-kernel>=1.39.4
diff --git a/application/single_app/route_backend_agents.py b/application/single_app/route_backend_agents.py
index 57097ee5..2f631af7 100644
--- a/application/single_app/route_backend_agents.py
+++ b/application/single_app/route_backend_agents.py
@@ -23,6 +23,11 @@
 from functions_appinsights import log_event
 from json_schema_validation import validate_agent
 from swagger_wrapper import swagger_route, get_auth_security
+from functions_activity_logging import (
+    log_agent_creation,
+    log_agent_update,
+    log_agent_deletion,
+)
 
 bpa = Blueprint('admin_agents', __name__)
 
@@ -147,6 +152,18 @@ def set_user_agents():
     for agent_name in agents_to_delete:
         delete_personal_agent(user_id, agent_name)
     
+    # Log individual agent activities
+    for agent in filtered_agents:
+        a_name = agent.get('name', '')
+        a_id = agent.get('id', '')
+        a_display = agent.get('display_name', a_name)
+        if a_name in current_agent_names:
+            log_agent_update(user_id=user_id, agent_id=a_id, agent_name=a_name, agent_display_name=a_display, scope='personal')
+        else:
+            log_agent_creation(user_id=user_id, agent_id=a_id, agent_name=a_name, agent_display_name=a_display, scope='personal')
+    for agent_name in agents_to_delete:
+        log_agent_deletion(user_id=user_id, agent_id=agent_name, agent_name=agent_name, scope='personal')
+
     log_event("User agents updated", extra={"user_id": user_id, "agents_count": len(filtered_agents)})
     return jsonify({'success': True})
 
@@ -175,6 +192,9 @@ def delete_user_agent(agent_name):
     # Delete from personal_agents container
     delete_personal_agent(user_id, agent_name)
     
+    # Log agent deletion activity
+    log_agent_deletion(user_id=user_id, agent_id=agent_to_delete.get('id', agent_name), agent_name=agent_name, scope='personal')
+
     # Check if there are any agents left and if they match global_selected_agent
     remaining_agents = get_personal_agents(user_id)
     if len(remaining_agents) > 0:
@@ -270,11 +290,12 @@ def create_group_agent_route():
         cleaned_payload.pop(key, None)
 
     try:
-        saved = save_group_agent(active_group, cleaned_payload)
+        saved = save_group_agent(active_group, cleaned_payload, user_id=user_id)
     except Exception as exc:
         debug_print('Failed to save group agent: %s', exc)
         return jsonify({'error': 'Unable to save agent'}), 500
 
+    log_agent_creation(user_id=user_id, agent_id=saved.get('id', ''), agent_name=saved.get('name', ''), agent_display_name=saved.get('display_name', ''), scope='group', group_id=active_group)
     return jsonify(saved), 201
 
 
@@ -325,11 +346,12 @@ def update_group_agent_route(agent_id):
         return jsonify({'error': str(exc)}), 400
 
     try:
-        saved = save_group_agent(active_group, cleaned_payload)
+        saved = save_group_agent(active_group, cleaned_payload, user_id=user_id)
     except Exception as exc:
         debug_print('Failed to update group agent %s: %s', agent_id, exc)
         return jsonify({'error': 'Unable to update agent'}), 500
 
+    log_agent_update(user_id=user_id, agent_id=agent_id, agent_name=saved.get('name', ''), agent_display_name=saved.get('display_name', ''), scope='group', group_id=active_group)
     return jsonify(saved), 200
 
 
@@ -360,6 +382,7 @@ def delete_group_agent_route(agent_id):
 
     if not removed:
         return jsonify({'error': 'Agent not found'}), 404
+    log_agent_deletion(user_id=user_id, agent_id=agent_id, agent_name=agent_id, scope='group', group_id=active_group)
     return jsonify({'message': 'Agent deleted'}), 200
 
 # User endpoint to set selected agent (new model, not legacy default_agent)
@@ -504,10 +527,11 @@ def add_agent():
                 cleaned_agent['id'] = '15b0c92a-741d-42ff-ba0b-367c7ee0c848'
         
         # Save to global agents container
-        result = save_global_agent(cleaned_agent)
+        result = save_global_agent(cleaned_agent, user_id=str(get_current_user_id()))
         if not result:
             return jsonify({'error': 'Failed to save agent.'}), 500
 
+        log_agent_creation(user_id=str(get_current_user_id()), agent_id=cleaned_agent.get('id', ''), agent_name=cleaned_agent.get('name', ''), agent_display_name=cleaned_agent.get('display_name', ''), scope='global')
         log_event("Agent added", extra={"action": "add", "agent": {k: v for k, v in cleaned_agent.items() if k != 'id'}, "user": str(get_current_user_id())})
         # --- HOT RELOAD TRIGGER ---
         setattr(builtins, "kernel_reload_needed", True)
@@ -615,10 +639,11 @@ def edit_agent(agent_name):
             return jsonify({'error': 'Agent not found.'}), 404
         
         # Save the updated agent
-        result = save_global_agent(cleaned_agent)
+        result = save_global_agent(cleaned_agent, user_id=str(get_current_user_id()))
         if not result:
             return jsonify({'error': 'Failed to save agent.'}), 500
 
+        log_agent_update(user_id=str(get_current_user_id()), agent_id=cleaned_agent.get('id', ''), agent_name=agent_name, agent_display_name=cleaned_agent.get('display_name', ''), scope='global')
         log_event(
             f"Agent {agent_name} edited",
             extra={
@@ -660,6 +685,7 @@ def delete_agent(agent_name):
         if not success:
             return jsonify({'error': 'Failed to delete agent.'}), 500
         
+        log_agent_deletion(user_id=str(get_current_user_id()), agent_id=agent_to_delete.get('id', ''), agent_name=agent_name, scope='global')
         log_event("Agent deleted", extra={"action": "delete", "agent_name": agent_name, "user": str(get_current_user_id())})
         # --- HOT RELOAD TRIGGER ---
         setattr(builtins, "kernel_reload_needed", True)
diff --git a/application/single_app/route_backend_chats.py b/application/single_app/route_backend_chats.py
index e452fed4..33ce24aa 100644
--- a/application/single_app/route_backend_chats.py
+++ b/application/single_app/route_backend_chats.py
@@ -13,10 +13,13 @@
 import asyncio, types
 import ast
 import json
+import os
+import queue
 import re
+import threading
 from typing import Any, Dict, List, Mapping, Optional
 from config import *
-from flask import g
+from flask import Response, copy_current_request_context, g, stream_with_context
 from functions_authentication import *
 from functions_search import *
 from functions_settings import *
@@ -24,28 +27,2096 @@
 from functions_group import find_group_by_id, get_user_role_in_group
 from functions_chat import *
 from functions_conversation_metadata import collect_conversation_metadata, update_conversation_with_metadata
+from functions_conversation_unread import mark_conversation_unread
 from functions_debug import debug_print
+from functions_notifications import create_chat_response_notification
 from functions_activity_logging import log_chat_activity, log_conversation_creation, log_token_usage
 from flask import current_app
 from swagger_wrapper import swagger_route, get_auth_security
+from functions_thoughts import ThoughtTracker
+
+
+def get_tabular_discovery_function_names():
+    """Return discovery-oriented tabular function names from the plugin."""
+    from semantic_kernel_plugins.tabular_processing_plugin import TabularProcessingPlugin
+
+    return TabularProcessingPlugin.get_discovery_function_names()
+
+
+def get_tabular_analysis_function_names():
+    """Return analytical tabular function names from the plugin."""
+    from semantic_kernel_plugins.tabular_processing_plugin import TabularProcessingPlugin
+
+    return TabularProcessingPlugin.get_analysis_function_names()
+
+
+def get_tabular_thought_excluded_parameter_names():
+    """Return tabular parameter names hidden from thought details."""
+    from semantic_kernel_plugins.tabular_processing_plugin import TabularProcessingPlugin
+
+    return TabularProcessingPlugin.get_thought_excluded_parameter_names()
+
+
+def is_tabular_schema_summary_question(user_question):
+    """Return True for workbook-structure questions that should use schema summary tooling."""
+    normalized_question = re.sub(r'\s+', ' ', str(user_question or '').strip().lower())
+    if not normalized_question:
+        return False
+
+    direct_phrases = (
+        'summarize this workbook',
+        'summarize the workbook',
+        'describe this workbook',
+        'describe the workbook',
+        'what worksheets',
+        'which worksheets',
+        'what sheets',
+        'which sheets',
+        'what tabs',
+        'which tabs',
+        'what does each worksheet represent',
+        'what does each sheet represent',
+        'what does each tab represent',
+        'what do the worksheets represent',
+        'what do the sheets represent',
+        'how are they related',
+        'how do they relate',
+        'workbook schema',
+        'worksheet schema',
+        'sheet schema',
+    )
+    if any(phrase in normalized_question for phrase in direct_phrases):
+        return True
+
+    structure_patterns = (
+        r'\bwhich sheet\b.*\b(contain|contains|has|holds)\b',
+        r'\bwhat sheet\b.*\b(contain|contains|has|holds)\b',
+        r'\bhow (are|do)\b.*\b(worksheets|sheets|tabs)\b.*\b(relate|related)\b',
+    )
+    return any(re.search(pattern, normalized_question) for pattern in structure_patterns)
+
+
+def is_tabular_entity_lookup_question(user_question):
+    """Return True for cross-sheet entity lookup questions that need related-record traversal."""
+    normalized_question = re.sub(r'\s+', ' ', str(user_question or '').strip().lower())
+    if not normalized_question or is_tabular_schema_summary_question(normalized_question):
+        return False
+
+    direct_phrases = (
+        'find taxpayer',
+        'find return',
+        'show their profile',
+        'related records',
+        'full story',
+        'case history',
+    )
+    relationship_keywords = (
+        'profile',
+        'tax return summary',
+        'w-2',
+        'w2',
+        '1099',
+        'payment',
+        'refund',
+        'notice',
+        'audit',
+        'installment agreement',
+        'installment',
+        'related',
+    )
+    if any(phrase in normalized_question for phrase in direct_phrases) and any(
+        keyword in normalized_question for keyword in relationship_keywords
+    ):
+        return True
+
+    entity_lookup_patterns = (
+        r'\bfind\b.*\b(show|summarize|explain)\b.*\b(profile|related|record|records)\b',
+        r'\b(show|summarize)\b.*\b(profile|related|record|records)\b.*\b(w-2|w2|1099|payment|refund|notice|audit|installment)\b',
+    )
+    return any(re.search(pattern, normalized_question) for pattern in entity_lookup_patterns)
+
+
+def is_tabular_cross_sheet_bridge_question(user_question):
+    """Return True for grouped analytical questions that may need multiple worksheets."""
+    normalized_question = re.sub(r'\s+', ' ', str(user_question or '').strip().lower())
+    if (
+        not normalized_question
+        or is_tabular_schema_summary_question(normalized_question)
+        or is_tabular_entity_lookup_question(normalized_question)
+    ):
+        return False
+
+    aggregate_keywords = (
+        'how many',
+        'count',
+        'counts',
+        'total',
+        'totals',
+        'sum',
+        'average',
+        'avg',
+        'minimum',
+        'maximum',
+        'min',
+        'max',
+    )
+    grouping_patterns = (
+        r'\bfor each\b',
+        r'\beach\b',
+        r'\bper\b',
+        r'\bby\b\s+[a-z0-9_\-]+(?:\s+[a-z0-9_\-]+){0,2}',
+    )
+
+    return any(keyword in normalized_question for keyword in aggregate_keywords) and any(
+        re.search(pattern, normalized_question) for pattern in grouping_patterns
+    )
+
+
+def get_tabular_execution_mode(user_question):
+    """Select the tabular orchestration mode for the user's question."""
+    if is_tabular_schema_summary_question(user_question):
+        return 'schema_summary'
+    if is_tabular_entity_lookup_question(user_question):
+        return 'entity_lookup'
+    return 'analysis'
+
+
+def build_tabular_fallback_system_message(tabular_filenames_str, execution_mode='analysis'):
+    """Build the final GPT fallback guidance after the mini SK pass fails."""
+    if execution_mode == 'schema_summary':
+        return (
+            f"IMPORTANT: The selected workspace tabular file(s) are {tabular_filenames_str}. "
+            "The search results include a workbook schema summary with worksheet names, columns, and sample rows, but they do not include the full data. "
+            "For workbook-structure questions such as what worksheets exist, what each worksheet represents, and how the sheets relate, answer from the schema summary only. "
+            "Do not mention running additional plugin tools or performing calculations that were not completed. "
+            "If a relationship is only implied by shared columns or names, describe it as an inferred relationship rather than a confirmed join."
+        )
+
+    return (
+        f"IMPORTANT: The selected workspace tabular file(s) are {tabular_filenames_str}. "
+        "The prior tabular tool pass could not compute tool-backed results. "
+        "The search results contain only a schema summary (column names and a few sample rows), NOT the full data. "
+        "Answer cautiously using only the schema summary already provided. "
+        "Do not invent numeric totals, claim that full-data analysis succeeded, or mention additional plugin calls that were not completed. "
+        "If the user's question requires computed values that are not present in the schema summary, say that the computation could not be completed from the available tool results."
+    )
+
+
+def build_search_augmentation_system_prompt(retrieved_content):
+    """Build the retrieval augmentation prompt without blocking later tool-backed results."""
+    return f"""You are an AI assistant. Use the following retrieved document excerpts to answer the user's question. Cite sources using the format (Source: filename, Page: page number).
+
+                        Retrieved Excerpts:
+                        {retrieved_content}
+
+                        Base your answer only on information supported by the retrieved excerpts and any computed tool-backed results included elsewhere in this conversation context. If the answer is not supported by that information, say so.
+                        If computed tabular results are provided in another system message, treat them as authoritative for row-level values, calculations, and numeric conclusions. Do not say that you lack direct access to the data when those computed results are present.
+
+                        Example
+                        User: What is the policy on double dipping?
+                        Assistant: The policy prohibits entities from using federal funds received through one program to apply for additional funds through another program, commonly known as 'double dipping' (Source: PolicyDocument.pdf, Page: 12)
+                        """
+
+
+def build_tabular_computed_results_system_message(source_label, tabular_analysis):
+    """Build the outer-model handoff message for successful tabular analysis."""
+    return (
+        f"The following tabular results were computed from {source_label} using "
+        f"tabular_processing plugin functions:\n\n"
+        f"{tabular_analysis}\n\n"
+        "These are tool-backed results derived from the full underlying tabular data, not just retrieved schema excerpts. "
+        "Treat them as authoritative for row-level facts, calculations, and numeric conclusions. "
+        "Do not say that you lack direct access to the data if the answer is present in these computed results."
+    )
 
 
 def get_kernel():
     return getattr(g, 'kernel', None) or getattr(builtins, 'kernel', None)
 
-def get_kernel_agents():
-    g_agents = getattr(g, 'kernel_agents', None)
-    builtins_agents = getattr(builtins, 'kernel_agents', None)
-    log_event(f"[SKChat] get_kernel_agents - g.kernel_agents: {type(g_agents)} ({len(g_agents) if g_agents else 0} agents), builtins.kernel_agents: {type(builtins_agents)} ({len(builtins_agents) if builtins_agents else 0} agents)", level=logging.INFO)
-    return g_agents or builtins_agents
+
+def get_kernel_agents():
+    g_agents = getattr(g, 'kernel_agents', None)
+    builtins_agents = getattr(builtins, 'kernel_agents', None)
+    log_event(f"[SKChat] get_kernel_agents - g.kernel_agents: {type(g_agents)} ({len(g_agents) if g_agents else 0} agents), builtins.kernel_agents: {type(builtins_agents)} ({len(builtins_agents) if builtins_agents else 0} agents)", level=logging.INFO)
+    return g_agents or builtins_agents
+
+
+def is_personal_chat_conversation(conversation_item):
+    """Return True when a conversation belongs to personal chat scope."""
+    chat_type = str((conversation_item or {}).get('chat_type') or '').strip().lower()
+    return not chat_type.startswith('group') and not chat_type.startswith('public')
+
+
+class BackgroundStreamBridge:
+    """Relay SSE events from a background worker to the active HTTP stream."""
+
+    def __init__(self, max_queue_size=200):
+        self._queue = queue.Queue(maxsize=max_queue_size)
+        self._sentinel = object()
+        self._consumer_attached = True
+        self._state_lock = threading.Lock()
+
+    def push(self, event):
+        """Queue an SSE event unless the consumer has already detached."""
+        while True:
+            with self._state_lock:
+                consumer_attached = self._consumer_attached
+
+            if not consumer_attached:
+                return False
+
+            try:
+                self._queue.put(event, timeout=0.25)
+                return True
+            except queue.Full:
+                continue
+
+    def finish(self):
+        """Signal stream completion to the active consumer."""
+        while True:
+            with self._state_lock:
+                consumer_attached = self._consumer_attached
+
+            if not consumer_attached:
+                return
+
+            try:
+                self._queue.put(self._sentinel, timeout=0.25)
+                return
+            except queue.Full:
+                continue
+
+    def iter_events(self):
+        """Yield queued SSE events until the worker finishes."""
+        while True:
+            next_item = self._queue.get()
+            if next_item is self._sentinel:
+                break
+            yield next_item
+
+    def detach_consumer(self):
+        """Stop queueing new events once the HTTP consumer disconnects."""
+        with self._state_lock:
+            already_detached = not self._consumer_attached
+            self._consumer_attached = False
+
+        if already_detached:
+            return
+
+        while True:
+            try:
+                self._queue.get_nowait()
+            except queue.Empty:
+                break
+
+
+def get_new_plugin_invocations(invocations, baseline_count):
+    """Return only the plugin invocations created after the baseline count."""
+    if not invocations:
+        return []
+
+    if baseline_count <= 0:
+        return list(invocations)
+
+    if baseline_count >= len(invocations):
+        return []
+
+    return list(invocations[baseline_count:])
+
+
+def split_tabular_plugin_invocations(invocations):
+    """Split tabular plugin invocations into discovery and analytical categories."""
+    discovery_invocations = []
+    analytical_invocations = []
+    other_invocations = []
+
+    for invocation in invocations or []:
+        function_name = getattr(invocation, 'function_name', '')
+
+        if function_name in get_tabular_discovery_function_names():
+            discovery_invocations.append(invocation)
+        elif function_name in get_tabular_analysis_function_names():
+            analytical_invocations.append(invocation)
+        else:
+            other_invocations.append(invocation)
+
+    return discovery_invocations, analytical_invocations, other_invocations
+
+
+def get_tabular_invocation_result_payload(invocation):
+    """Parse a tabular invocation result payload when it is JSON-like."""
+    result = getattr(invocation, 'result', None)
+    if isinstance(result, dict):
+        return result
+    if not isinstance(result, str):
+        return None
+
+    try:
+        payload = json.loads(result)
+    except Exception:
+        return None
+
+    return payload if isinstance(payload, dict) else None
+
+
+def get_tabular_invocation_error_message(invocation):
+    """Return an error message for a tabular invocation, including JSON error payloads."""
+    explicit_error_message = getattr(invocation, 'error_message', None)
+    if explicit_error_message:
+        return str(explicit_error_message)
+
+    result_payload = get_tabular_invocation_result_payload(invocation)
+    if result_payload and result_payload.get('error'):
+        return str(result_payload['error'])
+
+    return None
+
+
+def get_tabular_invocation_candidate_sheets(invocation):
+    """Return candidate workbook sheets suggested by a tabular tool error payload."""
+    result_payload = get_tabular_invocation_result_payload(invocation)
+    candidate_sheets = result_payload.get('candidate_sheets') if result_payload else None
+    if not isinstance(candidate_sheets, list):
+        return []
+
+    normalized_candidate_sheets = []
+    seen_candidate_sheets = set()
+    for candidate_sheet in candidate_sheets:
+        normalized_candidate_sheet = str(candidate_sheet or '').strip()
+        if not normalized_candidate_sheet:
+            continue
+
+        lowercase_candidate_sheet = normalized_candidate_sheet.lower()
+        if lowercase_candidate_sheet in seen_candidate_sheets:
+            continue
+
+        seen_candidate_sheets.add(lowercase_candidate_sheet)
+        normalized_candidate_sheets.append(normalized_candidate_sheet)
+
+    return normalized_candidate_sheets
+
+
+def get_tabular_invocation_selected_sheet(invocation):
+    """Return the resolved sheet used by a tabular invocation when available."""
+    result_payload = get_tabular_invocation_result_payload(invocation) or {}
+    invocation_parameters = getattr(invocation, 'parameters', {}) or {}
+
+    selected_sheet = str(
+        result_payload.get('selected_sheet')
+        or invocation_parameters.get('sheet_name')
+        or ''
+    ).strip()
+    return selected_sheet or None
+
+
+def get_tabular_invocation_data_rows(invocation):
+    """Return tabular result rows when the invocation payload includes them."""
+    result_payload = get_tabular_invocation_result_payload(invocation) or {}
+    rows = result_payload.get('data')
+    return rows if isinstance(rows, list) else []
+
+
+def normalize_tabular_overlap_value(value):
+    """Normalize row identifier values so they can be intersected reliably."""
+    if isinstance(value, (dict, list, tuple)):
+        return json.dumps(value, sort_keys=True, default=str)
+    if value is None:
+        return None
+    return str(value)
+
+
+def get_tabular_overlap_identifier_column(row_sets):
+    """Return a shared identifier column suitable for intersecting row sets."""
+    common_columns = None
+
+    for rows in row_sets or []:
+        if not rows:
+            return None
+
+        row_columns = set()
+        for row in rows:
+            if not isinstance(row, dict):
+                continue
+            row_columns.update(str(column_name) for column_name in row.keys())
+
+        if not row_columns:
+            return None
+
+        if common_columns is None:
+            common_columns = row_columns
+        else:
+            common_columns &= row_columns
+
+    if not common_columns:
+        return None
+
+    identifier_candidates = [
+        column_name for column_name in common_columns
+        if column_name.lower() == 'id' or column_name.lower().endswith('id')
+    ]
+    if not identifier_candidates:
+        return None
+
+    preferred_order = {
+        'flightid': 0,
+        'returnid': 1,
+        'taxpayerid': 2,
+        'paymentid': 3,
+        'caseid': 4,
+        'accountid': 5,
+        'recordid': 6,
+        'id': 7,
+    }
+
+    return sorted(
+        identifier_candidates,
+        key=lambda column_name: (
+            preferred_order.get(column_name.lower(), 99),
+            column_name.lower(),
+        ),
+    )[0]
+
+
+def describe_tabular_invocation_conditions(invocation):
+    """Render a compact description of the invocation filters for raw fallbacks."""
+    parameters = getattr(invocation, 'parameters', {}) or {}
+
+    query_expression = str(parameters.get('query_expression') or '').strip()
+    if query_expression:
+        return query_expression
+
+    column_name = str(parameters.get('column') or '').strip()
+    operator = str(parameters.get('operator') or '').strip()
+    value = parameters.get('value')
+    if column_name and operator:
+        return f"{column_name} {operator} {value}"
+
+    lookup_column = str(parameters.get('lookup_column') or '').strip()
+    lookup_value = parameters.get('lookup_value')
+    if lookup_column:
+        return f"{lookup_column} == {lookup_value}"
+
+    return None
+
+
+def get_tabular_query_overlap_summary(invocations, max_rows=25):
+    """Summarize overlap across successful row-returning tabular calls.
+
+    This is a defensive fallback for cases where tool execution succeeded but the
+    inner SK synthesis step failed before it could combine the results.
+    """
+    grouped_invocations = {}
+
+    for invocation in invocations or []:
+        function_name = getattr(invocation, 'function_name', '')
+        if function_name not in {'query_tabular_data', 'filter_rows'}:
+            continue
+
+        rows = get_tabular_invocation_data_rows(invocation)
+        if not rows:
+            continue
+
+        result_payload = get_tabular_invocation_result_payload(invocation) or {}
+        group_key = (
+            str(result_payload.get('filename') or '').strip(),
+            str(get_tabular_invocation_selected_sheet(invocation) or '').strip(),
+        )
+        grouped_invocations.setdefault(group_key, []).append({
+            'invocation': invocation,
+            'rows': rows,
+            'payload': result_payload,
+        })
+
+    best_summary = None
+
+    for (filename, selected_sheet), grouped_items in grouped_invocations.items():
+        if len(grouped_items) < 2:
+            continue
+
+        row_sets = [grouped_item['rows'] for grouped_item in grouped_items]
+        identifier_column = get_tabular_overlap_identifier_column(row_sets)
+        if not identifier_column:
+            continue
+
+        overlapping_keys = None
+        for rows in row_sets:
+            row_keys = {
+                normalize_tabular_overlap_value(row.get(identifier_column))
+                for row in rows
+                if isinstance(row, dict) and normalize_tabular_overlap_value(row.get(identifier_column)) is not None
+            }
+            if overlapping_keys is None:
+                overlapping_keys = row_keys
+            else:
+                overlapping_keys &= row_keys
+
+        if not overlapping_keys:
+            continue
+
+        ordered_sample_rows = []
+        seen_sample_keys = set()
+        for row in grouped_items[0]['rows']:
+            if not isinstance(row, dict):
+                continue
+
+            row_key = normalize_tabular_overlap_value(row.get(identifier_column))
+            if row_key not in overlapping_keys or row_key in seen_sample_keys:
+                continue
+
+            ordered_sample_rows.append(row)
+            seen_sample_keys.add(row_key)
+            if len(ordered_sample_rows) >= max_rows:
+                break
+
+        source_queries = []
+        for grouped_item in grouped_items:
+            rendered_conditions = describe_tabular_invocation_conditions(grouped_item['invocation'])
+            if rendered_conditions:
+                source_queries.append(rendered_conditions)
+
+        overlap_summary = {
+            'filename': filename or None,
+            'selected_sheet': selected_sheet or None,
+            'identifier_column': identifier_column,
+            'overlap_count': len(overlapping_keys),
+            'sample_rows': ordered_sample_rows,
+            'sample_rows_limited': len(overlapping_keys) > len(ordered_sample_rows),
+            'source_queries': source_queries,
+        }
+
+        if best_summary is None or overlap_summary['overlap_count'] > best_summary['overlap_count']:
+            best_summary = overlap_summary
+
+    return best_summary
+
+
+def get_tabular_invocation_compact_payload(invocation, max_rows=10):
+    """Return a compact, prompt-safe summary of a successful tabular invocation."""
+    result_payload = get_tabular_invocation_result_payload(invocation)
+    if not result_payload:
+        return None
+
+    function_name = getattr(invocation, 'function_name', '')
+    compact_payload = {
+        'function': function_name,
+        'filename': result_payload.get('filename'),
+        'selected_sheet': result_payload.get('selected_sheet'),
+    }
+
+    if function_name == 'aggregate_column':
+        compact_payload.update({
+            'column': result_payload.get('column'),
+            'operation': result_payload.get('operation'),
+            'result': result_payload.get('result'),
+        })
+    elif function_name in {'group_by_aggregate', 'group_by_datetime_component'}:
+        for key_name in (
+            'group_by',
+            'date_component',
+            'aggregate_column',
+            'operation',
+            'groups',
+            'highest_group',
+            'highest_value',
+            'lowest_group',
+            'lowest_value',
+            'top_results',
+        ):
+            if key_name in result_payload:
+                compact_payload[key_name] = result_payload.get(key_name)
+    elif function_name == 'lookup_value':
+        for key_name in (
+            'lookup_column',
+            'lookup_value',
+            'target_column',
+            'value',
+            'total_matches',
+            'returned_rows',
+        ):
+            if key_name in result_payload:
+                compact_payload[key_name] = result_payload.get(key_name)
+
+        data_rows = get_tabular_invocation_data_rows(invocation)
+        if data_rows:
+            compact_payload['sample_rows'] = data_rows[:max_rows]
+            compact_payload['sample_rows_limited'] = len(data_rows) > max_rows
+    elif function_name in {'query_tabular_data', 'filter_rows'}:
+        for key_name in ('total_matches', 'returned_rows'):
+            if key_name in result_payload:
+                compact_payload[key_name] = result_payload.get(key_name)
+
+        data_rows = get_tabular_invocation_data_rows(invocation)
+        if data_rows:
+            compact_payload['sample_rows'] = data_rows[:max_rows]
+            compact_payload['sample_rows_limited'] = len(data_rows) > max_rows
+
+        rendered_conditions = describe_tabular_invocation_conditions(invocation)
+        if rendered_conditions:
+            compact_payload['conditions'] = rendered_conditions
+    else:
+        compact_payload.update(result_payload)
+
+    return compact_payload
+
+
+def build_tabular_analysis_fallback_from_invocations(invocations):
+    """Build a compact computed-results handoff from successful tool calls.
+
+    Used when the mini SK tabular pass completed tool execution but failed to
+    produce a final natural-language synthesis response.
+    """
+    successful_invocations = [
+        invocation for invocation in (invocations or [])
+        if not get_tabular_invocation_error_message(invocation)
+    ]
+    if not successful_invocations:
+        return None
+
+    overlap_summary = get_tabular_query_overlap_summary(successful_invocations)
+    compact_results = []
+    for invocation in successful_invocations[:8]:
+        compact_payload = get_tabular_invocation_compact_payload(invocation)
+        if compact_payload is None:
+            continue
+        compact_results.append(compact_payload)
+
+    if not overlap_summary and not compact_results:
+        return None
+
+    rendered_sections = [
+        "The following structured results come directly from successful tabular tool executions.",
+        "Use them as computed evidence even though the inner tabular synthesis step did not complete.",
+    ]
+
+    if overlap_summary:
+        rendered_sections.append(
+            "OVERLAP SUMMARY:\n"
+            f"{json.dumps(overlap_summary, indent=2, default=str)}"
+        )
+
+    if compact_results:
+        rendered_sections.append(
+            "TOOL RESULT SUMMARIES:\n"
+            f"{json.dumps(compact_results, indent=2, default=str)}"
+        )
+
+    return "\n\n".join(rendered_sections)
+
+
+def get_tabular_invocation_selected_sheets(invocations):
+    """Return unique selected-sheet names for a group of tabular invocations."""
+    selected_sheets = []
+    seen_sheet_names = set()
+
+    for invocation in invocations or []:
+        selected_sheet = get_tabular_invocation_selected_sheet(invocation)
+        if not selected_sheet:
+            continue
+
+        lowered_sheet_name = selected_sheet.lower()
+        if lowered_sheet_name in seen_sheet_names:
+            continue
+
+        seen_sheet_names.add(lowered_sheet_name)
+        selected_sheets.append(selected_sheet)
+
+    return selected_sheets
+
+
+def get_tabular_retry_sheet_overrides(invocations):
+    """Choose workbook sheet overrides for the next retry based on failed tool payloads."""
+    candidate_scores_by_filename = {}
+    candidate_details_by_filename = {}
+
+    for invocation in invocations or []:
+        function_name = getattr(invocation, 'function_name', '')
+        if function_name not in get_tabular_analysis_function_names():
+            continue
+
+        result_payload = get_tabular_invocation_result_payload(invocation) or {}
+        invocation_parameters = getattr(invocation, 'parameters', {}) or {}
+        filename = str(
+            result_payload.get('filename')
+            or invocation_parameters.get('filename')
+            or ''
+        ).strip()
+        if not filename:
+            continue
+
+        candidate_sheets = get_tabular_invocation_candidate_sheets(invocation)
+        if not candidate_sheets:
+            continue
+
+        selected_sheet = str(result_payload.get('selected_sheet') or '').strip().lower()
+        missing_column = str(result_payload.get('missing_column') or '').strip()
+
+        filename_scores = candidate_scores_by_filename.setdefault(filename, {})
+        filename_details = candidate_details_by_filename.setdefault(filename, [])
+        candidate_count = len(candidate_sheets)
+
+        for candidate_index, candidate_sheet in enumerate(candidate_sheets):
+            if selected_sheet and candidate_sheet.lower() == selected_sheet:
+                continue
+
+            score = max(1, candidate_count - candidate_index)
+            filename_scores[candidate_sheet] = filename_scores.get(candidate_sheet, 0) + score
+
+        if missing_column:
+            filename_details.append(f"missing column '{missing_column}'")
+
+    retry_sheet_overrides = {}
+    for filename, filename_scores in candidate_scores_by_filename.items():
+        if not filename_scores:
+            continue
+
+        selected_sheet_name = sorted(
+            filename_scores.items(),
+            key=lambda item: (-item[1], item[0].lower())
+        )[0][0]
+        detail_messages = candidate_details_by_filename.get(filename, [])
+        detail_text = ', '.join(detail_messages[:3]) if detail_messages else None
+        retry_sheet_overrides[filename] = {
+            'sheet_name': selected_sheet_name,
+            'detail': detail_text,
+        }
+
+    return retry_sheet_overrides
+
+
+def split_tabular_analysis_invocations(invocations):
+    """Split analytical tabular invocations into successful and failed calls."""
+    successful_invocations = []
+    failed_invocations = []
+
+    for invocation in invocations or []:
+        function_name = getattr(invocation, 'function_name', '')
+        if function_name not in get_tabular_analysis_function_names():
+            continue
+
+        if get_tabular_invocation_error_message(invocation):
+            failed_invocations.append(invocation)
+        else:
+            successful_invocations.append(invocation)
+
+    return successful_invocations, failed_invocations
+
+
+def summarize_tabular_invocation_errors(invocations):
+    """Return a stable list of unique tabular tool error messages."""
+    unique_errors = []
+    seen_errors = set()
+
+    for invocation in invocations or []:
+        error_message = get_tabular_invocation_error_message(invocation)
+        if not error_message:
+            continue
+
+        normalized_error_message = error_message.strip()
+        if not normalized_error_message or normalized_error_message in seen_errors:
+            continue
+
+        seen_errors.add(normalized_error_message)
+        unique_errors.append(normalized_error_message)
+
+    return unique_errors
+
+
+def filter_tabular_citation_invocations(invocations):
+    """Hide discovery-only citation noise when analytical tabular calls exist."""
+    if not invocations:
+        return []
+
+    successful_analytical_invocations, _ = split_tabular_analysis_invocations(invocations)
+    if successful_analytical_invocations:
+        return successful_analytical_invocations
+
+    successful_schema_summary_invocations = []
+    for invocation in invocations or []:
+        if getattr(invocation, 'function_name', '') != 'describe_tabular_file':
+            continue
+        if get_tabular_invocation_error_message(invocation):
+            continue
+        successful_schema_summary_invocations.append(invocation)
+
+    if successful_schema_summary_invocations:
+        return successful_schema_summary_invocations
+
+    return []
+
+
+def format_tabular_thought_parameter_value(value):
+    """Render a concise parameter value for tabular thought details."""
+    if value is None:
+        return None
+
+    if isinstance(value, (dict, list, tuple)):
+        rendered_value = json.dumps(value, default=str)
+    else:
+        rendered_value = str(value)
+
+    if not rendered_value:
+        return None
+
+    if len(rendered_value) > 120:
+        rendered_value = rendered_value[:117] + '...'
+
+    return rendered_value
+
+
+def get_tabular_tool_thought_payloads(invocations):
+    """Convert tabular plugin invocations into user-visible thought payloads."""
+    thought_payloads = []
+
+    for invocation in invocations or []:
+        function_name = getattr(invocation, 'function_name', 'unknown_tool')
+        duration_ms = getattr(invocation, 'duration_ms', None)
+        error_message = get_tabular_invocation_error_message(invocation)
+        success = getattr(invocation, 'success', True) and not error_message
+        parameters = getattr(invocation, 'parameters', {}) or {}
+
+        filename = parameters.get('filename')
+        sheet_name = parameters.get('sheet_name')
+        duration_suffix = f" ({int(duration_ms)}ms)" if duration_ms else ""
+        content = f"Tabular tool {function_name}{duration_suffix}"
+        if filename:
+            content = f"Tabular tool {function_name} on {filename}{duration_suffix}"
+        if filename and sheet_name:
+            content = f"Tabular tool {function_name} on {filename} [{sheet_name}]{duration_suffix}"
+        if not success:
+            content = f"{content} failed"
+
+        detail_parts = []
+        for parameter_name, parameter_value in parameters.items():
+            if parameter_name in get_tabular_thought_excluded_parameter_names():
+                continue
+
+            rendered_value = format_tabular_thought_parameter_value(parameter_value)
+            if rendered_value is None:
+                continue
+
+            detail_parts.append(f"{parameter_name}={rendered_value}")
+
+        rendered_error_message = format_tabular_thought_parameter_value(error_message)
+        if rendered_error_message:
+            detail_parts.append(f"error={rendered_error_message}")
+
+        detail_parts.append(f"success={success}")
+        detail = "; ".join(detail_parts) if detail_parts else None
+        thought_payloads.append((content, detail))
+
+    return thought_payloads
+
+
+def get_tabular_status_thought_payloads(invocations, analysis_succeeded):
+    """Return additional tabular status thoughts for retries and fallbacks."""
+    successful_analytical_invocations, failed_analytical_invocations = split_tabular_analysis_invocations(invocations)
+    if not failed_analytical_invocations:
+        return []
+
+    error_messages = summarize_tabular_invocation_errors(failed_analytical_invocations)
+    detail = "; ".join(error_messages) if error_messages else None
+
+    if analysis_succeeded and successful_analytical_invocations:
+        return [(
+            "Tabular analysis recovered after retrying tool errors",
+            detail,
+        )]
+
+    if analysis_succeeded:
+        return [(
+            "Tabular analysis recovered via internal fallback after tool errors",
+            detail,
+        )]
+
+    return [(
+        "Tabular analysis encountered tool errors before fallback",
+        detail,
+    )]
+
+
+def _normalize_tabular_sheet_token(token):
+    """Normalize question and sheet-name tokens for lightweight matching."""
+    normalized = re.sub(r'[^a-z0-9]+', '', str(token or '').lower())
+    if len(normalized) > 4 and normalized.endswith('ies'):
+        return normalized[:-3] + 'y'
+    if len(normalized) > 3 and normalized.endswith('s') and not normalized.endswith('ss'):
+        return normalized[:-1]
+    return normalized
+
+
+def _tokenize_tabular_sheet_text(text):
+    """Tokenize free text into normalized sheet-matching tokens."""
+    original_text = re.sub(r'(?i)w[\s\-_]*2', ' w2 ', str(text or ''))
+    expanded_text = re.sub(r'([a-z])([A-Z])', r'\1 \2', original_text)
+    expanded_text = re.sub(r'([A-Za-z])([0-9])', r'\1 \2', expanded_text)
+    expanded_text = re.sub(r'([0-9])([A-Za-z])', r'\1 \2', expanded_text)
+    expanded_text = re.sub(r'[_\-]+', ' ', expanded_text)
+    tokens = []
+    seen_tokens = set()
+
+    for raw_text in (original_text, expanded_text):
+        for raw_token in re.split(r'[^a-z0-9]+', raw_text.lower()):
+            normalized_token = _normalize_tabular_sheet_token(raw_token)
+            if not normalized_token or len(normalized_token) <= 1:
+                continue
+            if normalized_token in seen_tokens:
+                continue
+            seen_tokens.add(normalized_token)
+            tokens.append(normalized_token)
+
+    return tokens
+
+
+def _score_tabular_sheet_match(sheet_name, question_text, columns=None):
+    """Score how strongly a worksheet name matches the user question.
+
+    When *columns* (a list of column-name strings from the sheet schema) is
+    provided, column-name tokens that overlap with the question contribute to
+    the score.  This allows sheets whose names are generic (e.g. "Orders") to
+    still score highly when the question references column values like
+    "sales" or "profit".
+    """
+    question_tokens = set(_tokenize_tabular_sheet_text(question_text))
+    question_phrase = ' '.join(_tokenize_tabular_sheet_text(question_text))
+    sheet_tokens = _tokenize_tabular_sheet_text(sheet_name)
+    if not sheet_tokens:
+        return 0
+
+    sheet_phrase = ' '.join(sheet_tokens)
+    score = 0
+
+    if sheet_phrase and sheet_phrase in question_phrase:
+        score += 8
+
+    token_matches = sum(1 for token in sheet_tokens if token in question_tokens)
+    score += token_matches * 3
+
+    if len(sheet_tokens) == 1 and sheet_tokens[0] in question_tokens:
+        score += 4
+
+    # Column-name overlap: each matching column token adds 2 points.
+    if columns and question_tokens:
+        column_tokens = set()
+        for col_name in columns:
+            column_tokens.update(_tokenize_tabular_sheet_text(col_name))
+        column_matches = sum(1 for token in question_tokens if token in column_tokens)
+        score += column_matches * 2
+
+    return score
+
+
+def _select_relevant_workbook_sheets(sheet_names, question_text, minimum_score=1, per_sheet=None):
+    """Return all workbook sheets that appear relevant to the question."""
+    ranked_sheets = []
+    for sheet_name in sheet_names or []:
+        columns = None
+        if per_sheet:
+            sheet_info = per_sheet.get(sheet_name, {})
+            columns = sheet_info.get('columns', [])
+        score = _score_tabular_sheet_match(sheet_name, question_text, columns=columns)
+        if score < minimum_score:
+            continue
+        ranked_sheets.append((score, sheet_name))
+
+    ranked_sheets.sort(key=lambda item: (-item[0], item[1].lower()))
+    return [sheet_name for _, sheet_name in ranked_sheets]
+
+
+def _build_tabular_cross_sheet_bridge_plan(sheet_names, question_text, per_sheet=None):
+    """Infer a lightweight reference-sheet to fact-sheet plan for grouped workbook questions."""
+    if not per_sheet or not is_tabular_cross_sheet_bridge_question(question_text):
+        return None
+
+    ranked_sheets = []
+    for sheet_name in sheet_names or []:
+        sheet_info = per_sheet.get(sheet_name, {})
+        columns = sheet_info.get('columns', [])
+        row_count = sheet_info.get('row_count', 0) or 0
+        score = _score_tabular_sheet_match(sheet_name, question_text, columns=columns)
+        if score <= 0:
+            continue
+        ranked_sheets.append({
+            'sheet_name': sheet_name,
+            'score': score,
+            'row_count': row_count,
+        })
+
+    if len(ranked_sheets) < 2:
+        return None
+
+    fact_sheet = max(
+        ranked_sheets,
+        key=lambda item: (item['row_count'], item['score'], item['sheet_name'].lower()),
+    )
+    reference_candidates = [
+        item for item in ranked_sheets
+        if item['sheet_name'] != fact_sheet['sheet_name'] and item['row_count'] > 0
+    ]
+    if not reference_candidates:
+        return None
+
+    reference_sheet = min(
+        reference_candidates,
+        key=lambda item: (item['row_count'], -item['score'], item['sheet_name'].lower()),
+    )
+
+    if fact_sheet['row_count'] <= reference_sheet['row_count']:
+        return None
+
+    if fact_sheet['row_count'] < max(25, reference_sheet['row_count'] * 2):
+        return None
+
+    relevant_sheets = [reference_sheet['sheet_name'], fact_sheet['sheet_name']]
+    for item in sorted(ranked_sheets, key=lambda entry: (-entry['score'], entry['sheet_name'].lower())):
+        if item['sheet_name'] in relevant_sheets:
+            continue
+        relevant_sheets.append(item['sheet_name'])
+
+    return {
+        'reference_sheet': reference_sheet['sheet_name'],
+        'reference_row_count': reference_sheet['row_count'],
+        'fact_sheet': fact_sheet['sheet_name'],
+        'fact_row_count': fact_sheet['row_count'],
+        'relevant_sheets': relevant_sheets,
+    }
+
+
+def is_tabular_access_limited_analysis(analysis_text):
+    """Return True when a tool-backed analysis still claims the data is unavailable."""
+    normalized_analysis = re.sub(r'\s+', ' ', str(analysis_text or '').strip().lower())
+    if not normalized_analysis:
+        return False
+
+    inaccessible_phrases = (
+        "don't have direct access",
+        'do not have direct access',
+        "don't have",
+        'do not have',
+        'visible excerpt you provided',
+        'if those tool-backed results exist',
+        'allow me to query again',
+        'can outline what i would retrieve',
+    )
+    return any(phrase in normalized_analysis for phrase in inaccessible_phrases)
+
+
+def _select_likely_workbook_sheet(sheet_names, question_text, per_sheet=None):
+    """Return a likely sheet name when the user question strongly matches one sheet."""
+    best_sheet = None
+    best_score = 0
+    runner_up_score = 0
+
+    for sheet_name in sheet_names or []:
+        columns = None
+        if per_sheet:
+            sheet_info = per_sheet.get(sheet_name, {})
+            columns = sheet_info.get('columns', [])
+        score = _score_tabular_sheet_match(sheet_name, question_text, columns=columns)
+
+        if score > best_score:
+            runner_up_score = best_score
+            best_score = score
+            best_sheet = sheet_name
+        elif score > runner_up_score:
+            runner_up_score = score
+
+    if best_score <= 0 or best_score == runner_up_score:
+        return None
+
+    return best_sheet
+
+
+async def run_tabular_sk_analysis(user_question, tabular_filenames, user_id,
+                                   conversation_id, gpt_model, settings,
+                                   source_hint="workspace", group_id=None,
+                                   public_workspace_id=None,
+                                   execution_mode='analysis'):
+    """Run lightweight SK with TabularProcessingPlugin to analyze tabular data.
+
+    Creates a temporary Kernel with only the TabularProcessingPlugin, uses the
+    same chat model as the user's session, and returns computed analysis results.
+    Returns None on failure for graceful degradation.
+    """
+    from semantic_kernel import Kernel as SKKernel
+    from semantic_kernel.connectors.ai.open_ai import AzureChatCompletion
+    from semantic_kernel.connectors.ai.function_choice_behavior import FunctionChoiceBehavior
+    from semantic_kernel.connectors.ai.open_ai.prompt_execution_settings.azure_chat_prompt_execution_settings import AzureChatPromptExecutionSettings
+    from semantic_kernel.contents.chat_history import ChatHistory as SKChatHistory
+    from semantic_kernel_plugins.tabular_processing_plugin import TabularProcessingPlugin
+
+    try:
+        plugin_logger = get_plugin_logger()
+        execution_mode = execution_mode if execution_mode in {'analysis', 'schema_summary', 'entity_lookup'} else 'analysis'
+        schema_summary_mode = execution_mode == 'schema_summary'
+        entity_lookup_mode = execution_mode == 'entity_lookup'
+        log_event(
+            f"[Tabular SK Analysis] Starting {execution_mode} analysis for files: {tabular_filenames}",
+            level=logging.INFO,
+        )
+
+        # 1. Create lightweight kernel with only tabular plugin
+        kernel = SKKernel()
+        tabular_plugin = TabularProcessingPlugin()
+        kernel.add_plugin(tabular_plugin, plugin_name="tabular_processing")
+
+        # 2. Create chat service using same config as main chat
+        enable_gpt_apim = settings.get('enable_gpt_apim', False)
+        if enable_gpt_apim:
+            chat_service = AzureChatCompletion(
+                service_id="tabular-analysis",
+                deployment_name=gpt_model,
+                endpoint=settings.get('azure_apim_gpt_endpoint'),
+                api_key=settings.get('azure_apim_gpt_subscription_key'),
+                api_version=settings.get('azure_apim_gpt_api_version'),
+            )
+        else:
+            auth_type = settings.get('azure_openai_gpt_authentication_type')
+            if auth_type == 'managed_identity':
+                token_provider = get_bearer_token_provider(DefaultAzureCredential(), cognitive_services_scope)
+                chat_service = AzureChatCompletion(
+                    service_id="tabular-analysis",
+                    deployment_name=gpt_model,
+                    endpoint=settings.get('azure_openai_gpt_endpoint'),
+                    api_version=settings.get('azure_openai_gpt_api_version'),
+                    ad_token_provider=token_provider,
+                )
+            else:
+                chat_service = AzureChatCompletion(
+                    service_id="tabular-analysis",
+                    deployment_name=gpt_model,
+                    endpoint=settings.get('azure_openai_gpt_endpoint'),
+                    api_key=settings.get('azure_openai_gpt_key'),
+                    api_version=settings.get('azure_openai_gpt_api_version'),
+                )
+        kernel.add_service(chat_service)
+
+        # 3. Pre-dispatch: load file schemas to eliminate discovery LLM rounds
+        source_context = f"source='{source_hint}'"
+        if group_id:
+            source_context += f", group_id='{group_id}'"
+        if public_workspace_id:
+            source_context += f", public_workspace_id='{public_workspace_id}'"
+
+        schema_parts = []
+        workbook_sheet_hints = {}
+        workbook_related_sheet_hints = {}
+        workbook_cross_sheet_bridge_hints = {}
+        workbook_blob_locations = {}
+        retry_sheet_overrides = {}
+        previous_failed_call_parameters = []  # entity lookup: concrete failed call params for retry hints
+        allowed_function_filters = {
+            'included_functions': [
+                f"tabular_processing-{function_name}"
+                for function_name in (
+                    ['describe_tabular_file']
+                    if schema_summary_mode else
+                    sorted(get_tabular_analysis_function_names())
+                )
+            ]
+        }
+        for fname in tabular_filenames:
+            try:
+                container, blob_path = tabular_plugin._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, fname, source_hint,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                schema_info = tabular_plugin._build_workbook_schema_summary(
+                    container,
+                    blob_path,
+                    fname,
+                    preview_rows=2,
+                )
+                workbook_blob_locations[fname] = (container, blob_path)
+
+                if schema_info.get('is_workbook') and schema_info.get('sheet_count', 0) > 1:
+                    # Build a compact sheet directory so the model can pick the
+                    # relevant sheet itself instead of us guessing.
+                    per_sheet = schema_info.get('per_sheet_schemas', {})
+                    likely_sheet = _select_likely_workbook_sheet(
+                        schema_info.get('sheet_names', []),
+                        user_question,
+                        per_sheet=per_sheet,
+                    )
+                    relevant_sheets = _select_relevant_workbook_sheets(
+                        schema_info.get('sheet_names', []),
+                        user_question,
+                        per_sheet=per_sheet,
+                    )
+                    cross_sheet_bridge_plan = None
+                    if not schema_summary_mode and not entity_lookup_mode:
+                        cross_sheet_bridge_plan = _build_tabular_cross_sheet_bridge_plan(
+                            schema_info.get('sheet_names', []),
+                            user_question,
+                            per_sheet=per_sheet,
+                        )
+                    if entity_lookup_mode:
+                        workbook_related_sheet_hints[fname] = relevant_sheets or list(schema_info.get('sheet_names', []))
+                    elif cross_sheet_bridge_plan:
+                        workbook_cross_sheet_bridge_hints[fname] = cross_sheet_bridge_plan
+                        workbook_related_sheet_hints[fname] = cross_sheet_bridge_plan.get('relevant_sheets', [])
+                        likely_sheet = cross_sheet_bridge_plan.get('fact_sheet') or likely_sheet
+                    if likely_sheet:
+                        workbook_sheet_hints[fname] = likely_sheet
+                        if not entity_lookup_mode and not cross_sheet_bridge_plan:
+                            tabular_plugin.set_default_sheet(container, blob_path, likely_sheet)
+                    elif not entity_lookup_mode and not cross_sheet_bridge_plan:
+                        # Fallback for analysis mode: pick the sheet with the
+                        # most rows so that set_default_sheet is always called
+                        # and the model can omit sheet_name on tool calls.
+                        fallback_sheet = max(
+                            schema_info.get('sheet_names', []),
+                            key=lambda s: per_sheet.get(s, {}).get('row_count', 0),
+                            default=None,
+                        )
+                        if fallback_sheet:
+                            likely_sheet = fallback_sheet
+                            workbook_sheet_hints[fname] = likely_sheet
+                            tabular_plugin.set_default_sheet(container, blob_path, likely_sheet)
+
+                    sheet_directory = []
+                    for sname in schema_info.get('sheet_names', []):
+                        sheet_info = per_sheet.get(sname, {})
+                        sheet_directory.append({
+                            'sheet_name': sname,
+                            'row_count': sheet_info.get('row_count', 0),
+                            'columns': sheet_info.get('columns', []),
+                        })
+                    directory_schema = {
+                        'filename': fname,
+                        'is_workbook': True,
+                        'sheet_count': schema_info.get('sheet_count', 0),
+                        'likely_sheet': likely_sheet,
+                        'sheet_directory': sheet_directory,
+                    }
+                    schema_parts.append(json.dumps(directory_schema, indent=2, default=str))
+                    log_event(
+                        f"[Tabular SK Analysis] Pre-loaded workbook {fname} directory "
+                        f"({schema_info.get('sheet_count', 0)} sheets available)"
+                        + (f"; likely sheet '{likely_sheet}'" if likely_sheet else ''),
+                        level=logging.DEBUG,
+                    )
+                else:
+                    schema_parts.append(json.dumps(schema_info, indent=2, default=str))
+                    if schema_info.get('is_workbook'):
+                        # Single-sheet workbook — set default so the model needs no sheet arg
+                        single_sheet = (schema_info.get('sheet_names') or [None])[0]
+                        if single_sheet:
+                            tabular_plugin.set_default_sheet(container, blob_path, single_sheet)
+                    df = tabular_plugin._read_tabular_blob_to_dataframe(container, blob_path)
+                    log_event(f"[Tabular SK Analysis] Pre-loaded schema for {fname} ({len(df)} rows)", level=logging.DEBUG)
+            except Exception as e:
+                log_event(f"[Tabular SK Analysis] Failed to pre-load schema for {fname}: {e}", level=logging.WARNING)
+                schema_parts.append(json.dumps({"filename": fname, "error": f"Could not pre-load: {str(e)}"}))
+
+        schema_context = "\n".join(schema_parts)
+
+        def build_system_prompt(force_tool_use=False, tool_error_messages=None, execution_gap_messages=None):
+            if schema_summary_mode:
+                retry_prefix = ""
+                if force_tool_use:
+                    retry_prefix = (
+                        "RETRY MODE: Your previous attempt did not execute a usable workbook-schema tool call. "
+                        "You MUST call describe_tabular_file before writing any answer text. "
+                        "Do not switch to aggregate, filter, query, lookup, or grouped-analysis tools for worksheet-summary questions.\n\n"
+                    )
+
+                tool_error_feedback = ""
+                if tool_error_messages:
+                    rendered_errors = "\n".join(
+                        f"- {error_message}" for error_message in tool_error_messages
+                    )
+                    tool_error_feedback = (
+                        "PREVIOUS TOOL ERRORS:\n"
+                        f"{rendered_errors}\n"
+                        "Correct the function arguments and retry describe_tabular_file immediately.\n\n"
+                    )
+
+                return (
+                    "You are a workbook schema analyst. The workbook structure is available through the "
+                    "tabular_processing plugin and the pre-loaded schema context. You MUST call "
+                    "describe_tabular_file before answering. Use the workbook-level response to identify "
+                    "worksheet names, what each worksheet represents, and the high-confidence relationships "
+                    "visible from shared identifiers, columns, and sheet purposes.\n\n"
+                    f"{retry_prefix}"
+                    f"{tool_error_feedback}"
+                    f"FILE SCHEMAS:\n"
+                    f"{schema_context}\n\n"
+                    "AVAILABLE FUNCTIONS: describe_tabular_file only.\n\n"
+                    "IMPORTANT:\n"
+                    "1. Call describe_tabular_file for each workbook you need to summarize.\n"
+                    "2. For multi-sheet workbooks, omit sheet_name so the tool returns workbook-level sheet schemas.\n"
+                    "3. Summarize the worksheet list, what each worksheet represents, and any cross-sheet relationships visible from shared identifiers or repeated business entities.\n"
+                    "4. Do not switch to aggregate, filter, query, lookup, or grouped-analysis tools for workbook-structure questions.\n"
+                    "5. If a relationship is not explicit, describe it as an inference from the schema rather than a confirmed join.\n"
+                    "6. Do not mention hypothetical follow-up analyses or failed attempts unless the user explicitly asked about failures."
+                )
+
+            retry_prefix = ""
+            if force_tool_use:
+                retry_prefix = (
+                    "RETRY MODE: Your previous attempt did not execute a usable analytical tool call. "
+                    "You MUST call one or more analytical tabular_processing plugin functions before writing any answer text. "
+                    "Do not say the analysis still needs to be run — run it now.\n\n"
+                )
+
+            tool_error_feedback = ""
+            if tool_error_messages:
+                rendered_errors = "\n".join(
+                    f"- {error_message}" for error_message in tool_error_messages
+                )
+                tool_error_feedback = (
+                    "PREVIOUS TOOL ERRORS:\n"
+                    f"{rendered_errors}\n"
+                    "Correct the function arguments and try again. If the operation is not 'count', provide an aggregate_column.\n\n"
+                )
+
+            execution_gap_feedback = ""
+            if execution_gap_messages:
+                rendered_gaps = "\n".join(
+                    f"- {gap_message}" for gap_message in execution_gap_messages
+                )
+                execution_gap_feedback = (
+                    "PREVIOUS EXECUTION GAPS:\n"
+                    f"{rendered_gaps}\n"
+                    "Correct the analysis plan and query the missing related worksheets before answering.\n\n"
+                )
+
+            missing_sheet_feedback = ""
+            if tool_error_messages and any(
+                'Specify sheet_name or sheet_index on analytical calls.' in error_message
+                for error_message in tool_error_messages
+            ):
+                if entity_lookup_mode:
+                    # Entity lookup: generate concrete per-sheet filter_rows examples from the actual failed call parameters
+                    call_example_lines = []
+                    for failed_params in previous_failed_call_parameters[:2]:
+                        fname = failed_params.get('filename', '')
+                        col = failed_params.get('column', '')
+                        op = failed_params.get('operator', '==')
+                        val = failed_params.get('value', '')
+                        if not fname or not col or not val:
+                            continue
+                        related_sheets = workbook_related_sheet_hints.get(fname) or list(workbook_sheet_hints.values())
+                        for sheet in related_sheets[:6]:
+                            call_example_lines.append(
+                                f'  filter_rows(filename="{fname}", sheet_name="{sheet}", column="{col}", operator="{op}", value="{val}")'
+                            )
+                    if call_example_lines:
+                        examples_block = "\n".join(call_example_lines)
+                        missing_sheet_feedback = (
+                            "MULTI-SHEET RETRY REQUIRED: Your previous calls omitted sheet_name and all failed.\n"
+                            "For this multi-sheet workbook, sheet_name is MANDATORY in every analytical call.\n"
+                            "Execute ALL of these calls now (copy exactly as written):\n"
+                            f"{examples_block}\n\n"
+                        )
+                    else:
+                        related_lines = [
+                            "MULTI-SHEET RETRY REQUIRED: Your previous calls omitted sheet_name.",
+                            "Add sheet_name to every analytical call. Relevant worksheets per file:",
+                        ]
+                        for workbook_name, related_sheets in workbook_related_sheet_hints.items():
+                            related_lines.append(
+                                f"  {workbook_name}: query each of: {', '.join(related_sheets[:6])}"
+                            )
+                        missing_sheet_feedback = "\n".join(related_lines) + "\n\n"
+                else:
+                    guidance_lines = [
+                        "MULTI-SHEET RETRY: Your previous analytical call omitted sheet_name on a multi-sheet workbook.",
+                        "Retry immediately with sheet_name set to the most relevant worksheet from sheet_directory.",
+                        "For account/category lookup questions by month, use filter_rows or query_tabular_data on the label column first, then read the requested month column.",
+                        "Do not aggregate an entire month column unless the user explicitly asked for a total, sum, average, min, max, or count.",
+                    ]
+                    for workbook_name, hinted_sheet in workbook_sheet_hints.items():
+                        guidance_lines.append(
+                            f"Likely worksheet for {workbook_name} based on the question text: {hinted_sheet}."
+                        )
+                    missing_sheet_feedback = "\n".join(guidance_lines) + "\n\n"
+
+            sheet_hint_feedback = ""
+            if workbook_sheet_hints:
+                rendered_hints = "\n".join(
+                    f"- {workbook_name}: likely worksheet '{hinted_sheet}'"
+                    for workbook_name, hinted_sheet in workbook_sheet_hints.items()
+                )
+                sheet_hint_feedback = (
+                    "LIKELY WORKSHEET HINTS:\n"
+                    f"{rendered_hints}\n"
+                    "Use the likely worksheet unless the question clearly refers to a different sheet or a prior tool error identified a better recovery sheet.\n\n"
+                )
+
+            recovery_sheet_feedback = ""
+            if retry_sheet_overrides:
+                rendered_recovery_hints = "\n".join(
+                    (
+                        f"- {workbook_name}: retry on worksheet '{override_payload['sheet_name']}'"
+                        + (f" ({override_payload['detail']})" if override_payload.get('detail') else '')
+                    )
+                    for workbook_name, override_payload in retry_sheet_overrides.items()
+                )
+                recovery_sheet_feedback = (
+                    "RECOVERY WORKSHEET HINTS:\n"
+                    f"{rendered_recovery_hints}\n"
+                    "These recovery hints override the original likely-sheet guess when the previous tool call failed on the wrong worksheet.\n\n"
+                )
+
+            related_sheet_feedback = ""
+            if workbook_related_sheet_hints:
+                rendered_related_sheet_hints = "\n".join(
+                    f"- {workbook_name}: {', '.join(related_sheets)}"
+                    for workbook_name, related_sheets in workbook_related_sheet_hints.items()
+                    if related_sheets
+                )
+                if rendered_related_sheet_hints:
+                    related_sheet_instruction = (
+                        'Use these worksheets to satisfy cross-sheet profile and related-record requests.'
+                        if entity_lookup_mode else
+                        'Use these worksheets together when the answer may require one sheet for entities and another for facts.'
+                    )
+                    related_sheet_feedback = (
+                        "QUESTION-RELEVANT WORKSHEET HINTS:\n"
+                        f"{rendered_related_sheet_hints}\n"
+                        f"{related_sheet_instruction}\n\n"
+                    )
+
+            cross_sheet_bridge_feedback = ""
+            if workbook_cross_sheet_bridge_hints:
+                rendered_bridge_hints = "\n".join(
+                    (
+                        f"- {workbook_name}: reference worksheet '{bridge_hint['reference_sheet']}' "
+                        f"({bridge_hint['reference_row_count']} rows); fact worksheet '{bridge_hint['fact_sheet']}' "
+                        f"({bridge_hint['fact_row_count']} rows)"
+                    )
+                    for workbook_name, bridge_hint in workbook_cross_sheet_bridge_hints.items()
+                )
+                cross_sheet_bridge_feedback = (
+                    "CROSS-SHEET BRIDGE PLAN:\n"
+                    f"{rendered_bridge_hints}\n"
+                    "For grouped cross-sheet questions, first use the reference worksheet to identify canonical entity or category names, then compute the requested metric from the fact worksheet. Prefer shared identifier or name columns over yes/no, boolean, or membership-flag columns.\n\n"
+                )
+
+            if entity_lookup_mode:
+                entity_retry_prefix = retry_prefix
+                if force_tool_use:
+                    entity_retry_prefix = (
+                        "RETRY MODE: Your previous attempt did not complete the related-record lookup. "
+                        "You MUST call one or more analytical tabular_processing plugin functions before writing any answer text. "
+                        "Query the missing related worksheets explicitly with sheet_name.\n\n"
+                    )
+
+                return (
+                    "You are a workbook entity lookup analyst. The full dataset is available through the "
+                    "tabular_processing plugin functions. The user is asking for one entity and related records across worksheets. "
+                    "You MUST use one or more tabular_processing plugin functions before answering. Never answer from the schema preview alone.\n\n"
+                    f"{entity_retry_prefix}"
+                    f"{tool_error_feedback}"
+                    f"{execution_gap_feedback}"
+                    f"{recovery_sheet_feedback}"
+                    f"{sheet_hint_feedback}"
+                    f"{related_sheet_feedback}"
+                    f"{missing_sheet_feedback}"
+                    f"FILE SCHEMAS:\n"
+                    f"{schema_context}\n\n"
+                    "AVAILABLE FUNCTIONS: lookup_value, aggregate_column, filter_rows, query_tabular_data, "
+                    "group_by_aggregate, and group_by_datetime_component.\n\n"
+                    "Discovery functions are not available in this analysis run because schema context is already pre-loaded.\n\n"
+                    "IMPORTANT:\n"
+                    "1. Pass sheet_name='<name>' on EVERY analytical call for multi-sheet workbooks. Do not rely on a default sheet for cross-sheet entity lookups.\n"
+                    "2. First retrieve the primary entity row on the most relevant worksheet.\n"
+                    "3. Then query other relevant worksheets explicitly to collect related records.\n"
+                    "4. When a retrieved row contains a secondary identifier such as ReturnID, CaseID, AccountID, PaymentID, W2ID, or Form1099ID, reuse it to query dependent worksheets.\n"
+                    "5. Do not stop after the first successful row if the question asks for related records across sheets.\n"
+                    "6. If a requested record type has no corresponding worksheet in the workbook, say that the workbook does not contain that record type.\n"
+                    "7. Clearly distinguish between no matching rows and no corresponding worksheet.\n"
+                    "8. Summarize concrete found records sheet-by-sheet using the tool results, not schema placeholders.\n"
+                    "9. Do not mention hypothetical follow-up analyses, parser errors, or failed attempts unless the user explicitly asked about failures and you have actual tool error output to report."
+                )
+
+            return (
+                "You are a data analyst. The full dataset is available through the "
+                "tabular_processing plugin functions. You MUST use one or more "
+                "tabular_processing plugin functions before answering. Never answer from "
+                "the schema preview alone. Never say that you would need to run the "
+                "analysis later — run it now.\n\n"
+                f"{retry_prefix}"
+                f"{tool_error_feedback}"
+                f"{execution_gap_feedback}"
+                f"{recovery_sheet_feedback}"
+                f"{sheet_hint_feedback}"
+                f"{related_sheet_feedback}"
+                f"{cross_sheet_bridge_feedback}"
+                f"{missing_sheet_feedback}"
+                f"FILE SCHEMAS:\n"
+                f"{schema_context}\n\n"
+                "AVAILABLE FUNCTIONS: lookup_value, aggregate_column, filter_rows, query_tabular_data, "
+                "group_by_aggregate, and group_by_datetime_component for year/quarter/month/week/day/hour trend analysis.\n\n"
+                "Discovery functions are not available in this analysis run because schema context is already pre-loaded.\n\n"
+                "IMPORTANT:\n"
+                "1. Use the pre-loaded schema to pick the correct columns, then call the plugin functions.\n"
+                "2. For multi-sheet workbooks, review the sheet_directory to find the most relevant sheet for the question. Pass sheet_name='<name>' in every analytical tool call unless a trustworthy default sheet has already been established. If a CROSS-SHEET BRIDGE PLAN is provided, query the listed worksheets explicitly and do not rely on a default sheet.\n"
+                "3. If a previous tool error says a requested column is missing on the current sheet and suggests candidate sheets, switch to one of those candidate sheets immediately.\n"
+                "4. For account/category lookup questions at a specific period or metric, use lookup_value first. Provide lookup_column, lookup_value, and target_column.\n"
+                "5. If lookup_value is not sufficient, use filter_rows or query_tabular_data on the label column, then read the requested period column.\n"
+                "6. Only use aggregate_column when the user explicitly asks for a sum, average, min, max, or count across rows.\n"
+                "7. For time-based questions on datetime columns, use group_by_datetime_component.\n"
+                "8. For threshold, ranking, comparison, or correlation-like questions, first filter/query the relevant rows, then compute grouped metrics.\n"
+                "9. When the question asks for grouped results for each entity or category and a cross-sheet bridge plan is available, use the reference worksheet to identify the canonical entities or categories and the fact worksheet to compute the metric. Do not answer 'each X' by grouping a yes/no, boolean, or membership-flag column unless the user explicitly asked about that flag.\n"
+                "10. When the question asks for rows satisfying multiple conditions, prefer one combined query_expression using and/or instead of separate broad queries that you plan to intersect later.\n"
+                "11. Batch multiple independent function calls in a SINGLE response whenever possible.\n"
+                "12. Keep max_rows as small as possible. Only increase it when the user explicitly asked for an exhaustive row list or export; otherwise return total_matches plus representative rows.\n"
+                "13. For analytical questions, prefer lookup/filter/query plus aggregate/grouped computations over raw row or preview output.\n"
+                "14. For identifier-based workbook questions, locate the identifier on the correct sheet before explaining downstream calculations.\n"
+                "15. For peak, busiest, highest, or lowest questions, use grouped functions and inspect the highest_group, highest_value, lowest_group, and lowest_value summary fields.\n"
+                "16. Return only computed findings and name the strongest drivers clearly.\n"
+                "17. Do not mention hypothetical follow-up analyses, parser errors, or failed attempts unless the user explicitly asked about failures and you have actual tool error output to report."
+            )
+
+        baseline_invocations = plugin_logger.get_invocations_for_conversation(
+            user_id,
+            conversation_id,
+            limit=1000
+        )
+        baseline_invocation_count = len(baseline_invocations)
+        previous_tool_error_messages = []
+        previous_execution_gap_messages = []
+
+        for attempt_number in range(1, 4):
+            force_tool_use = attempt_number > 1
+            # 4. Build chat history with pre-loaded schemas
+            chat_history = SKChatHistory()
+            chat_history.add_system_message(build_system_prompt(
+                force_tool_use=force_tool_use,
+                tool_error_messages=previous_tool_error_messages,
+                execution_gap_messages=previous_execution_gap_messages,
+            ))
+
+            chat_history.add_user_message(
+                f"Analyze the tabular data to answer: {user_question}\n"
+                f"Use user_id='{user_id}', conversation_id='{conversation_id}', {source_context}."
+            )
+
+            # 5. Execute with auto function calling
+            execution_settings = AzureChatPromptExecutionSettings(
+                service_id="tabular-analysis",
+                function_choice_behavior=(
+                    FunctionChoiceBehavior.Required(
+                        maximum_auto_invoke_attempts=8,
+                        filters=allowed_function_filters,
+                    )
+                    if force_tool_use else
+                    FunctionChoiceBehavior.Auto(
+                        maximum_auto_invoke_attempts=7,
+                        filters=allowed_function_filters,
+                    )
+                ),
+            )
+
+            result = None
+            synthesis_exception = None
+            try:
+                result = await chat_service.get_chat_message_contents(
+                    chat_history, execution_settings, kernel=kernel
+                )
+            except Exception as exc:
+                synthesis_exception = exc
+                log_event(
+                    f"[Tabular SK Analysis] Attempt {attempt_number} synthesis failed after tool execution setup: {exc}",
+                    level=logging.WARNING,
+                    exceptionTraceback=True,
+                )
+
+            invocations_after = plugin_logger.get_invocations_for_conversation(
+                user_id,
+                conversation_id,
+                limit=1000
+            )
+            new_invocations = get_new_plugin_invocations(invocations_after, baseline_invocation_count)
+            new_invocation_count = len(new_invocations)
+            discovery_invocations, analytical_invocations, _ = split_tabular_plugin_invocations(new_invocations)
+            successful_analytical_invocations, failed_analytical_invocations = split_tabular_analysis_invocations(new_invocations)
+            successful_schema_summary_invocations = []
+            failed_schema_summary_invocations = []
+            for invocation in discovery_invocations:
+                if getattr(invocation, 'function_name', '') != 'describe_tabular_file':
+                    continue
+                if get_tabular_invocation_error_message(invocation):
+                    failed_schema_summary_invocations.append(invocation)
+                else:
+                    successful_schema_summary_invocations.append(invocation)
+
+            if synthesis_exception is not None:
+                raw_tool_fallback = None
+                if not schema_summary_mode:
+                    raw_tool_fallback = build_tabular_analysis_fallback_from_invocations(
+                        successful_analytical_invocations,
+                    )
+
+                if raw_tool_fallback:
+                    log_event(
+                        f"[Tabular SK Analysis] Falling back to raw successful tool summaries after attempt {attempt_number} synthesis error",
+                        extra={
+                            'successful_tool_count': len(successful_analytical_invocations),
+                            'attempt_number': attempt_number,
+                        },
+                        level=logging.WARNING,
+                    )
+                    return raw_tool_fallback
+
+                log_event(
+                    f"[Tabular SK Analysis] Attempt {attempt_number} could not recover from synthesis error",
+                    extra={
+                        'successful_tool_count': len(successful_analytical_invocations),
+                        'failed_tool_count': len(failed_analytical_invocations),
+                        'attempt_number': attempt_number,
+                    },
+                    level=logging.WARNING,
+                )
+                break
+
+            if result and result[0].content:
+                analysis = result[0].content.strip()
+                if len(analysis) > 20000:
+                    analysis = analysis[:20000] + "\n[Analysis truncated]"
+
+                if schema_summary_mode:
+                    if successful_schema_summary_invocations:
+                        log_event(
+                            f"[Tabular SK Analysis] Schema summary complete via {len(successful_schema_summary_invocations)} workbook tool call(s) on attempt {attempt_number}",
+                            level=logging.INFO,
+                        )
+                        return analysis
+
+                    if failed_schema_summary_invocations:
+                        previous_tool_error_messages = summarize_tabular_invocation_errors(failed_schema_summary_invocations)
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used workbook schema tool(s) but all returned errors; retrying",
+                            extra={
+                                'tool_errors': previous_tool_error_messages,
+                                'failed_tool_count': len(failed_schema_summary_invocations),
+                            },
+                            level=logging.WARNING,
+                        )
+                    elif analytical_invocations:
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used analytical tool(s) during schema-summary mode without usable workbook results; retrying",
+                            level=logging.WARNING,
+                        )
+                    elif discovery_invocations:
+                        discovery_function_names = sorted({
+                            invocation.function_name for invocation in discovery_invocations
+                        })
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used only discovery tool(s) {discovery_function_names} without usable workbook summary; retrying",
+                            level=logging.WARNING,
+                        )
+                    elif new_invocation_count > 0:
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used unsupported tool(s) without usable workbook results; retrying",
+                            level=logging.WARNING,
+                        )
+                    else:
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} returned narrative without workbook schema tool use; retrying",
+                            level=logging.WARNING,
+                        )
+                else:
+                    if successful_analytical_invocations:
+                        previous_tool_error_messages = []
+                        previous_failed_call_parameters = []
+
+                        if entity_lookup_mode:
+                            selected_sheets = get_tabular_invocation_selected_sheets(successful_analytical_invocations)
+                            execution_gap_messages = []
+
+                            # Cross-sheet results ("ALL (cross-sheet search)") already span
+                            # the entire workbook — no execution gap for sheet coverage.
+                            has_cross_sheet_result = any(
+                                'cross-sheet' in (s or '').lower() for s in selected_sheets
+                            )
+
+                            if len(selected_sheets) <= 1 and not has_cross_sheet_result:
+                                rendered_selected_sheets = ', '.join(selected_sheets) if selected_sheets else 'unknown worksheet'
+                                execution_gap_messages.append(
+                                    f"Previous attempt only queried worksheet(s): {rendered_selected_sheets}. The question asks for related records across worksheets, so query additional relevant sheets explicitly with sheet_name."
+                                )
+
+                            if is_tabular_access_limited_analysis(analysis):
+                                execution_gap_messages.append(
+                                    'Previous attempt still claimed the requested data was unavailable even though analytical tool calls succeeded. Use the returned rows and answer directly.'
+                                )
+
+                            if execution_gap_messages and attempt_number < 3:
+                                previous_execution_gap_messages = execution_gap_messages
+                                log_event(
+                                    f"[Tabular SK Analysis] Attempt {attempt_number} entity lookup was incomplete despite successful tool calls; retrying",
+                                    extra={
+                                        'selected_sheets': selected_sheets,
+                                        'execution_gaps': previous_execution_gap_messages,
+                                        'successful_tool_count': len(successful_analytical_invocations),
+                                    },
+                                    level=logging.WARNING,
+                                )
+                                baseline_invocation_count = len(invocations_after)
+                                continue
+
+                        previous_execution_gap_messages = []
+                        log_event(
+                            f"[Tabular SK Analysis] Analysis complete via {len(successful_analytical_invocations)} analytical tool call(s) on attempt {attempt_number}",
+                            level=logging.INFO
+                        )
+                        return analysis
+
+                    if failed_analytical_invocations:
+                        previous_tool_error_messages = summarize_tabular_invocation_errors(failed_analytical_invocations)
+                        previous_execution_gap_messages = []
+                        retry_sheet_overrides = get_tabular_retry_sheet_overrides(failed_analytical_invocations)
+                        for workbook_name, override_payload in retry_sheet_overrides.items():
+                            blob_location = workbook_blob_locations.get(workbook_name)
+                            if not blob_location:
+                                continue
+
+                            container_name, blob_name = blob_location
+                            tabular_plugin.set_default_sheet(
+                                container_name,
+                                blob_name,
+                                override_payload['sheet_name'],
+                            )
+
+                        if retry_sheet_overrides:
+                            log_event(
+                                f"[Tabular SK Analysis] Attempt {attempt_number} selected retry worksheet override(s): {retry_sheet_overrides}",
+                                level=logging.INFO,
+                            )
+                        # For entity_lookup mode, extract and cache concrete call parameters
+                        # so the retry prompt can generate per-sheet corrected call examples
+                        if entity_lookup_mode:
+                            seen_entity_filters = set()
+                            entity_call_params = []
+                            for invoc in failed_analytical_invocations:
+                                error_msg = get_tabular_invocation_error_message(invoc) or ''
+                                if 'Specify sheet_name or sheet_index on analytical calls.' not in error_msg:
+                                    continue
+                                invoc_params = getattr(invoc, 'parameters', {}) or {}
+                                fn = getattr(invoc, 'function_name', '')
+                                fname = str(invoc_params.get('filename') or '').strip()
+                                if fn == 'filter_rows':
+                                    col = str(invoc_params.get('column') or '').strip()
+                                    op = str(invoc_params.get('operator') or '==').strip()
+                                    val = str(invoc_params.get('value') or '').strip()
+                                elif fn == 'lookup_value':
+                                    col = str(invoc_params.get('lookup_column') or '').strip()
+                                    op = '=='
+                                    val = str(invoc_params.get('lookup_value') or '').strip()
+                                else:
+                                    continue
+                                if not fname or not col or not val:
+                                    continue
+                                filter_key = (fname, col, val)
+                                if filter_key in seen_entity_filters:
+                                    continue
+                                seen_entity_filters.add(filter_key)
+                                entity_call_params.append({
+                                    'filename': fname,
+                                    'column': col,
+                                    'operator': op,
+                                    'value': val,
+                                })
+                            previous_failed_call_parameters = entity_call_params
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used analytical tool(s) but all returned errors; retrying",
+                            extra={
+                                'tool_errors': previous_tool_error_messages,
+                                'failed_tool_count': len(failed_analytical_invocations),
+                            },
+                            level=logging.WARNING
+                        )
+                    elif analytical_invocations:
+                        previous_execution_gap_messages = []
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used analytical tool(s) without usable computed results; retrying",
+                            level=logging.WARNING
+                        )
+                    elif discovery_invocations:
+                        previous_execution_gap_messages = []
+                        discovery_function_names = sorted({
+                            invocation.function_name for invocation in discovery_invocations
+                        })
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used only discovery tool(s) {discovery_function_names} without computed analysis; retrying",
+                            level=logging.WARNING
+                        )
+                    elif new_invocation_count > 0:
+                        previous_execution_gap_messages = []
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} used unsupported tool(s) without computed analysis; retrying",
+                            level=logging.WARNING
+                        )
+                    else:
+                        previous_execution_gap_messages = []
+                        log_event(
+                            f"[Tabular SK Analysis] Attempt {attempt_number} returned narrative without tool use; retrying",
+                            level=logging.WARNING
+                        )
+
+            else:
+                if schema_summary_mode and failed_schema_summary_invocations:
+                    previous_tool_error_messages = summarize_tabular_invocation_errors(failed_schema_summary_invocations)
+                    log_event(
+                        f"[Tabular SK Analysis] Attempt {attempt_number} returned no content after workbook tool errors; retrying",
+                        extra={
+                            'tool_errors': previous_tool_error_messages,
+                            'failed_tool_count': len(failed_schema_summary_invocations),
+                        },
+                        level=logging.WARNING,
+                    )
+                elif failed_analytical_invocations:
+                    previous_tool_error_messages = summarize_tabular_invocation_errors(failed_analytical_invocations)
+                    previous_execution_gap_messages = []
+                    log_event(
+                        f"[Tabular SK Analysis] Attempt {attempt_number} returned no content after tool errors; retrying",
+                        extra={
+                            'tool_errors': previous_tool_error_messages,
+                            'failed_tool_count': len(failed_analytical_invocations),
+                        },
+                        level=logging.WARNING
+                    )
+                else:
+                    log_event(
+                        f"[Tabular SK Analysis] Attempt {attempt_number} returned no content",
+                        level=logging.WARNING
+                    )
+
+            baseline_invocation_count = len(invocations_after)
+
+        log_event("[Tabular SK Analysis] Unable to obtain computed tool-backed results", level=logging.WARNING)
+        return None
+
+    except Exception as e:
+        log_event(f"[Tabular SK Analysis] Error: {e}", level=logging.WARNING, exceptionTraceback=True)
+        return None
+
+def collect_tabular_sk_citations(user_id, conversation_id):
+    """Collect plugin invocations from the tabular SK analysis and convert to citation format."""
+    from semantic_kernel_plugins.plugin_invocation_logger import get_plugin_logger
+
+    plugin_logger = get_plugin_logger()
+    plugin_invocations = plugin_logger.get_invocations_for_conversation(user_id, conversation_id)
+    plugin_invocations = filter_tabular_citation_invocations(plugin_invocations)
+
+    if not plugin_invocations:
+        return []
+
+    def make_json_serializable(obj):
+        if obj is None:
+            return None
+        elif isinstance(obj, (str, int, float, bool)):
+            return obj
+        elif isinstance(obj, dict):
+            return {str(k): make_json_serializable(v) for k, v in obj.items()}
+        elif isinstance(obj, (list, tuple)):
+            return [make_json_serializable(item) for item in obj]
+        else:
+            return str(obj)
+
+    citations = []
+    for inv in plugin_invocations:
+        timestamp_str = None
+        if inv.timestamp:
+            if hasattr(inv.timestamp, 'isoformat'):
+                timestamp_str = inv.timestamp.isoformat()
+            else:
+                timestamp_str = str(inv.timestamp)
+
+        parameters = getattr(inv, 'parameters', {}) or {}
+        sheet_name = parameters.get('sheet_name')
+        sheet_index = parameters.get('sheet_index')
+        tool_name = f"{inv.plugin_name}.{inv.function_name}"
+        if sheet_name:
+            tool_name = f"{tool_name} [{sheet_name}]"
+        elif sheet_index not in (None, ''):
+            tool_name = f"{tool_name} [sheet #{sheet_index}]"
+
+        citation = {
+            'tool_name': tool_name,
+            'function_name': inv.function_name,
+            'plugin_name': inv.plugin_name,
+            'function_arguments': make_json_serializable(parameters),
+            'function_result': make_json_serializable(inv.result),
+            'duration_ms': inv.duration_ms,
+            'timestamp': timestamp_str,
+            'success': inv.success,
+            'error_message': make_json_serializable(inv.error_message),
+            'user_id': inv.user_id,
+            'sheet_name': sheet_name,
+            'sheet_index': sheet_index,
+        }
+        citations.append(citation)
+
+    log_event(f"[Tabular SK Citations] Collected {len(citations)} tool execution citations", level=logging.INFO)
+    return citations
+
+
+def is_tabular_filename(filename):
+    """Return True when the filename has a supported tabular extension."""
+    if not filename or not isinstance(filename, str):
+        return False
+
+    _, extension = os.path.splitext(filename.strip().lower())
+    return extension.lstrip('.') in TABULAR_EXTENSIONS
+
+
+def get_citation_location(file_name, page_number=None, chunk_text=None, sheet_name=None):
+    """Return a display label/value pair for a citation location."""
+    if sheet_name:
+        return 'Sheet', str(sheet_name)
+
+    normalized_chunk_text = (chunk_text or '').strip()
+    if is_tabular_filename(file_name) and (
+        normalized_chunk_text.startswith('Tabular workbook:')
+        or normalized_chunk_text.startswith('Tabular data file:')
+    ):
+        return 'Location', 'Workbook Schema'
+
+    return 'Page', str(page_number or 1)
+
+
+def get_document_container_for_scope(document_scope):
+    """Return the Cosmos documents container that matches the workspace scope."""
+    if document_scope == 'group':
+        return cosmos_group_documents_container
+    if document_scope == 'public':
+        return cosmos_public_documents_container
+    return cosmos_user_documents_container
+
+
+def get_selected_workspace_tabular_filenames(selected_document_ids=None, selected_document_id=None, document_scope='personal'):
+    """Resolve explicitly selected workspace documents and return tabular filenames."""
+    selected_ids = list(selected_document_ids or [])
+    if not selected_ids and selected_document_id and selected_document_id != 'all':
+        selected_ids = [selected_document_id]
+
+    if not selected_ids:
+        return set()
+
+    cosmos_container = get_document_container_for_scope(document_scope)
+    tabular_filenames = set()
+
+    for doc_id in selected_ids:
+        if not doc_id or doc_id == 'all':
+            continue
+
+        try:
+            doc_query = (
+                "SELECT TOP 1 c.file_name, c.title "
+                "FROM c WHERE c.id = @doc_id "
+                "ORDER BY c.version DESC"
+            )
+            doc_params = [{"name": "@doc_id", "value": doc_id}]
+            doc_results = list(cosmos_container.query_items(
+                query=doc_query,
+                parameters=doc_params,
+                enable_cross_partition_query=True
+            ))
+
+            if not doc_results:
+                continue
+
+            file_name = doc_results[0].get('file_name') or doc_results[0].get('title')
+            if is_tabular_filename(file_name):
+                tabular_filenames.add(file_name)
+        except Exception as e:
+            log_event(
+                f"[Tabular SK Analysis] Failed to resolve selected document '{doc_id}': {e}",
+                level=logging.WARNING
+            )
+
+    return tabular_filenames
+
+
+def collect_workspace_tabular_filenames(combined_documents=None, selected_document_ids=None,
+                                        selected_document_id=None, document_scope='personal'):
+    """Collect tabular filenames from search results and explicit workspace selection."""
+    tabular_filenames = set()
+
+    for source_doc in combined_documents or []:
+        file_name = source_doc.get('file_name', '')
+        if is_tabular_filename(file_name):
+            tabular_filenames.add(file_name)
+
+    tabular_filenames.update(get_selected_workspace_tabular_filenames(
+        selected_document_ids=selected_document_ids,
+        selected_document_id=selected_document_id,
+        document_scope=document_scope,
+    ))
+
+    return tabular_filenames
+
+
+def determine_tabular_source_hint(document_scope, active_group_id=None, active_public_workspace_id=None):
+    """Map workspace scope metadata to the tabular plugin source hint."""
+    if document_scope == 'group' and active_group_id:
+        return 'group'
+    if document_scope == 'public' and active_public_workspace_id:
+        return 'public'
+    return 'workspace'
+
 
 def register_route_backend_chats(app):
+    def build_background_stream_response(event_generator_factory):
+        """Run SSE generation in background execution so it survives disconnects."""
+        stream_bridge = BackgroundStreamBridge()
+
+        @copy_current_request_context
+        def stream_worker():
+            try:
+                for event in event_generator_factory():
+                    stream_bridge.push(event)
+            except Exception as e:
+                debug_print(f"[STREAM BACKGROUND] Worker error: {e}")
+                stream_bridge.push(
+                    f"data: {json.dumps({'error': f'Internal server error: {str(e)}'})}\n\n"
+                )
+            finally:
+                stream_bridge.finish()
+
+        executor = current_app.extensions.get('executor')
+        if executor:
+            try:
+                executor.submit(stream_worker)
+            except Exception as e:
+                debug_print(f"[STREAM BACKGROUND] Executor submit failed, falling back to thread: {e}")
+                worker_thread = threading.Thread(target=stream_worker, daemon=True)
+                worker_thread.start()
+        else:
+            worker_thread = threading.Thread(target=stream_worker, daemon=True)
+            worker_thread.start()
+
+        def consume_stream():
+            try:
+                for event in stream_bridge.iter_events():
+                    yield event
+            except GeneratorExit:
+                stream_bridge.detach_consumer()
+                raise
+            finally:
+                stream_bridge.detach_consumer()
+
+        return Response(
+            stream_with_context(consume_stream()),
+            mimetype='text/event-stream',
+            headers={
+                'Cache-Control': 'no-cache',
+                'X-Accel-Buffering': 'no',
+                'Connection': 'keep-alive'
+            }
+        )
+
     @app.route('/api/chat', methods=['POST'])
     @swagger_route(security=get_auth_security())
     @login_required
     @user_required
     def chat_api():
         try:
+            request_start_time = time.time()
             settings = get_settings()
             data = request.get_json()
             user_id = get_current_user_id()
@@ -668,6 +2739,18 @@ def result_requires_message_reload(result: Any) -> bool:
 
                 conversation_item['last_updated'] = datetime.utcnow().isoformat()
                 cosmos_conversations_container.upsert_item(conversation_item) # Update timestamp and potentially title
+
+                # Generate assistant_message_id early for thought tracking
+                assistant_message_id = f"{conversation_id}_assistant_{int(time.time())}_{random.randint(1000,9999)}"
+
+                # Initialize thought tracker
+                thought_tracker = ThoughtTracker(
+                    conversation_id=conversation_id,
+                    message_id=assistant_message_id,
+                    thread_id=current_user_thread_id,
+                    user_id=user_id
+                )
+
         # region 3 - Content Safety
             # ---------------------------------------------------------------------
             # 3) Check Content Safety (but DO NOT return 403).
@@ -679,6 +2762,7 @@ def result_requires_message_reload(result: Any) -> bool:
             blocklist_matches = []
 
             if settings.get('enable_content_safety') and "content_safety_client" in CLIENTS:
+                thought_tracker.add_thought('content_safety', 'Checking content safety...')
                 try:
                     content_safety_client = CLIENTS["content_safety_client"]
                     request_obj = AnalyzeTextOptions(text=user_message)
@@ -836,6 +2920,7 @@ def result_requires_message_reload(result: Any) -> bool:
 
 
                 # Perform the search
+                thought_tracker.add_thought('search', f"Searching {document_scope or 'personal'} workspace documents for '{(search_query or user_message)[:50]}'")
                 try:
                     # Prepare search arguments
                     # Set default and maximum values for top_n
@@ -898,9 +2983,11 @@ def result_requires_message_reload(result: Any) -> bool:
                         'error': 'There was an issue with the embedding process. Please check with an admin on embedding configuration.'
                     }), 500
 
+                combined_documents = []
                 if search_results:
+                    unique_doc_names = set(doc.get('file_name', 'Unknown') for doc in search_results)
+                    thought_tracker.add_thought('search', f"Found {len(search_results)} results from {len(unique_doc_names)} documents")
                     retrieved_texts = []
-                    combined_documents = []
                     classifications_found = set(conversation_item.get('classification', [])) # Load existing
 
                     for doc in search_results:
@@ -915,13 +3002,23 @@ def result_requires_message_reload(result: Any) -> bool:
                         chunk_id = doc.get('chunk_id', str(uuid.uuid4())) # Ensure ID exists
                         score = doc.get('score', 0.0) # Add default score
                         group_id = doc.get('group_id', None) # Add default group ID
+                        sheet_name = doc.get('sheet_name')
+                        location_label, location_value = get_citation_location(
+                            file_name,
+                            page_number=page_number,
+                            chunk_text=chunk_text,
+                            sheet_name=sheet_name,
+                        )
 
-                        citation = f"(Source: {file_name}, Page: {page_number}) [#{citation_id}]"
+                        citation = f"(Source: {file_name}, {location_label}: {location_value}) [#{citation_id}]"
                         retrieved_texts.append(f"{chunk_text}\n{citation}")
                         combined_documents.append({
                             "file_name": file_name, 
                             "citation_id": citation_id, 
                             "page_number": page_number,
+                            "sheet_name": sheet_name,
+                            "location_label": location_label,
+                            "location_value": location_value,
                             "version": version, 
                             "classification": classification, 
                             "chunk_text": chunk_text,
@@ -935,17 +3032,7 @@ def result_requires_message_reload(result: Any) -> bool:
 
                     retrieved_content = "\n\n".join(retrieved_texts)
                     # Construct system prompt for search results
-                    system_prompt_search = f"""You are an AI assistant. Use the following retrieved document excerpts to answer the user's question. Cite sources using the format (Source: filename, Page: page number).
-
-                        Retrieved Excerpts:
-                        {retrieved_content}
-
-                        Based *only* on the information provided above, answer the user's query. If the answer isn't in the excerpts, say so.
-
-                        Example
-                        User: What is the policy on double dipping?
-                        Assistant: The policy prohibits entities from using federal funds received through one program to apply for additional funds through another program, commonly known as 'double dipping' (Source: PolicyDocument.pdf, Page: 12)
-                        """
+                    system_prompt_search = build_search_augmentation_system_prompt(retrieved_content)
                     # Add this to a temporary list, don't save to DB yet
                     system_messages_for_augmentation.append({
                         'role': 'system',
@@ -1122,24 +3209,11 @@ def result_requires_message_reload(result: Any) -> bool:
                         # Update the system prompt with the enhanced content including metadata
                         if retrieved_texts:
                             retrieved_content = "\n\n".join(retrieved_texts)
-                            system_prompt_search = f"""You are an AI assistant. Use the following retrieved document excerpts to answer the user's question. Cite sources using the format (Source: filename, Page: page number).
-                                Retrieved Excerpts:
-                                {retrieved_content}
-                                Based *only* on the information provided above, answer the user's query. If the answer isn't in the excerpts, say so.
-
-                                Retrieved Excerpts:
-                                {retrieved_content}
-
-                                Based *only* on the information provided above, answer the user's query. If the answer isn't in the excerpts, say so.
-
-                                Example
-                                User: What is the policy on double dipping?
-                                Assistant: The policy prohibits entities from using federal funds received through one program to apply for additional funds through another program, commonly known as 'double dipping' (Source: PolicyDocument.pdf, Page: 12)
-                                """
+                            system_prompt_search = build_search_augmentation_system_prompt(retrieved_content)
                             # Update the system message with enhanced content and updated documents array
                             if system_messages_for_augmentation:
-                                system_messages_for_augmentation[-1]['content'] = system_prompt_search
-                                system_messages_for_augmentation[-1]['documents'] = combined_documents
+                                system_messages_for_augmentation[0]['content'] = system_prompt_search
+                                system_messages_for_augmentation[0]['documents'] = combined_documents
                     # --- END NEW METADATA CITATIONS ---
 
                     # Update conversation classifications if new ones were found
@@ -1462,7 +3536,8 @@ def result_requires_message_reload(result: Any) -> bool:
                         'conversation_id': conversation_id,
                         'conversation_title': conversation_item['title'],
                         'model_deployment_name': image_gen_model,
-                        'message_id': image_message_id
+                        'message_id': image_message_id,
+                        'user_message_id': user_message_id
                     }), 200
                 except Exception as e:
                     debug_print(f"Image generation error: {str(e)}")
@@ -1488,7 +3563,83 @@ def result_requires_message_reload(result: Any) -> bool:
                         'error': user_friendly_message
                     }), status_code
 
+            workspace_tabular_files = set()
+            if hybrid_search_enabled and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                workspace_tabular_files = collect_workspace_tabular_filenames(
+                    combined_documents=combined_documents,
+                    selected_document_ids=selected_document_ids,
+                    selected_document_id=selected_document_id,
+                    document_scope=document_scope,
+                )
+
+            if hybrid_search_enabled and workspace_tabular_files and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                tabular_source_hint = determine_tabular_source_hint(
+                    document_scope,
+                    active_group_id=active_group_id,
+                    active_public_workspace_id=active_public_workspace_id,
+                )
+                tabular_execution_mode = get_tabular_execution_mode(user_message)
+                tabular_filenames_str = ", ".join(sorted(workspace_tabular_files))
+                plugin_logger = get_plugin_logger()
+                baseline_tabular_invocation_count = len(
+                    plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000)
+                )
+
+                tabular_analysis = asyncio.run(run_tabular_sk_analysis(
+                    user_question=user_message,
+                    tabular_filenames=workspace_tabular_files,
+                    user_id=user_id,
+                    conversation_id=conversation_id,
+                    gpt_model=gpt_model,
+                    settings=settings,
+                    source_hint=tabular_source_hint,
+                    group_id=active_group_id if tabular_source_hint == 'group' else None,
+                    public_workspace_id=active_public_workspace_id if tabular_source_hint == 'public' else None,
+                    execution_mode=tabular_execution_mode,
+                ))
+                tabular_invocations = get_new_plugin_invocations(
+                    plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000),
+                    baseline_tabular_invocation_count
+                )
+                tabular_thought_payloads = get_tabular_tool_thought_payloads(tabular_invocations)
+                for thought_content, thought_detail in tabular_thought_payloads:
+                    thought_tracker.add_thought('tabular_analysis', thought_content, thought_detail)
+                tabular_status_thought_payloads = get_tabular_status_thought_payloads(
+                    tabular_invocations,
+                    analysis_succeeded=bool(tabular_analysis),
+                )
+                for thought_content, thought_detail in tabular_status_thought_payloads:
+                    thought_tracker.add_thought('tabular_analysis', thought_content, thought_detail)
+
+                if tabular_analysis:
+                    tabular_system_msg = build_tabular_computed_results_system_message(
+                        f"the file(s) {tabular_filenames_str}",
+                        tabular_analysis,
+                    )
+                else:
+                    tabular_system_msg = build_tabular_fallback_system_message(
+                        tabular_filenames_str,
+                        execution_mode=tabular_execution_mode,
+                    )
+
+                system_messages_for_augmentation.append({
+                    'role': 'system',
+                    'content': tabular_system_msg
+                })
+
+                if tabular_analysis:
+                    tabular_sk_citations = collect_tabular_sk_citations(user_id, conversation_id)
+                    if tabular_sk_citations:
+                        agent_citations_list.extend(tabular_sk_citations)
+                else:
+                    thought_tracker.add_thought(
+                        'tabular_analysis',
+                        "Tabular analysis could not compute results; using schema context instead",
+                        detail=f"files={tabular_filenames_str}"
+                    )
+
             if web_search_enabled:
+                thought_tracker.add_thought('web_search', f"Searching the web for '{(search_query or user_message)[:50]}'")
                 perform_web_search(
                     settings=settings,
                     conversation_id=conversation_id,
@@ -1504,7 +3655,9 @@ def result_requires_message_reload(result: Any) -> bool:
                     agent_citations_list=agent_citations_list,
                     web_search_citations_list=web_search_citations_list,
                 )
-            
+                if web_search_citations_list:
+                    thought_tracker.add_thought('web_search', f"Got {len(web_search_citations_list)} web search results")
+
         # region 5 - FINAL conversation history preparation
             # ---------------------------------------------------------------------
             # 5) Prepare FINAL conversation history for GPT (including summarization)
@@ -1650,6 +3803,7 @@ def result_requires_message_reload(result: Any) -> bool:
                 allowed_roles_in_history = ['user', 'assistant'] # Add 'system' if you PERSIST general system messages not related to augmentation
                 max_file_content_length_in_history = 50000 # Increased limit for all file content in history
                 max_tabular_content_length_in_history = 50000 # Same limit for tabular data consistency
+                chat_tabular_files = set()  # Track tabular files uploaded directly to chat
 
                 for message in recent_messages:
                     role = message.get('role')
@@ -1685,25 +3839,38 @@ def result_requires_message_reload(result: Any) -> bool:
                         filename = message.get('filename', 'uploaded_file')
                         file_content = message.get('file_content', '') # Assuming file content is stored
                         is_table = message.get('is_table', False)
-                        
-                        # Use higher limit for tabular data that needs complete analysis
-                        content_limit = max_tabular_content_length_in_history if is_table else max_file_content_length_in_history
-                        
-                        display_content = file_content[:content_limit]
-                        if len(file_content) > content_limit:
-                            display_content += "..."
-                        
-                        # Enhanced message for tabular data
-                        if is_table:
+                        file_content_source = message.get('file_content_source', '')
+
+                        # Tabular files stored in blob (enhanced citations enabled) - reference plugin
+                        if is_table and file_content_source == 'blob':
+                            chat_tabular_files.add(filename)  # Track for mini SK analysis
                             conversation_history_for_api.append({
-                                'role': 'system', # Represent file as system info
-                                'content': f"[User uploaded a tabular data file named '{filename}'. This is CSV format data for analysis:\n{display_content}]\nThis is complete tabular data in CSV format. You can perform calculations, analysis, and data operations on this dataset."
+                                'role': 'system',
+                                'content': f"[User uploaded a tabular data file named '{filename}'. "
+                                    f"The file is stored in blob storage and available for analysis. "
+                                    f"Use the tabular_processing plugin functions (list_tabular_files, describe_tabular_file, "
+                                    f"aggregate_column, filter_rows, query_tabular_data, group_by_aggregate, group_by_datetime_component) to analyze this data. "
+                                    f"The file source is 'chat'.]"
                             })
                         else:
-                            conversation_history_for_api.append({
-                                'role': 'system', # Represent file as system info
-                                'content': f"[User uploaded a file named '{filename}'. Content preview:\n{display_content}]\nUse this file context if relevant."
-                            })
+                            # Use higher limit for tabular data that needs complete analysis
+                            content_limit = max_tabular_content_length_in_history if is_table else max_file_content_length_in_history
+
+                            display_content = file_content[:content_limit]
+                            if len(file_content) > content_limit:
+                                display_content += "..."
+
+                            # Enhanced message for tabular data
+                            if is_table:
+                                conversation_history_for_api.append({
+                                    'role': 'system', # Represent file as system info
+                                    'content': f"[User uploaded a tabular data file named '{filename}'. This is CSV format data for analysis:\n{display_content}]\nThis is complete tabular data in CSV format. You can perform calculations, analysis, and data operations on this dataset."
+                                })
+                            else:
+                                conversation_history_for_api.append({
+                                    'role': 'system', # Represent file as system info
+                                    'content': f"[User uploaded a file named '{filename}'. Content preview:\n{display_content}]\nUse this file context if relevant."
+                                })
                     elif role == 'image': # Handle image uploads with extracted text and vision analysis
                         filename = message.get('filename', 'uploaded_image')
                         is_user_upload = message.get('metadata', {}).get('is_user_upload', False)
@@ -1767,6 +3934,67 @@ def result_requires_message_reload(result: Any) -> bool:
 
                     # Ignored roles: 'safety', 'blocked', 'system' (if they are only for augmentation/summary)
 
+                # --- Mini SK analysis for tabular files uploaded directly to chat ---
+                if chat_tabular_files and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                    chat_tabular_filenames_str = ", ".join(chat_tabular_files)
+                    chat_tabular_execution_mode = get_tabular_execution_mode(user_message)
+                    log_event(
+                        f"[Chat Tabular SK] Detected {len(chat_tabular_files)} tabular file(s) uploaded to chat: {chat_tabular_filenames_str}",
+                        level=logging.INFO
+                    )
+                    plugin_logger = get_plugin_logger()
+                    baseline_tabular_invocation_count = len(
+                        plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000)
+                    )
+
+                    chat_tabular_analysis = asyncio.run(run_tabular_sk_analysis(
+                        user_question=user_message,
+                        tabular_filenames=chat_tabular_files,
+                        user_id=user_id,
+                        conversation_id=conversation_id,
+                        gpt_model=gpt_model,
+                        settings=settings,
+                        source_hint="chat",
+                        execution_mode=chat_tabular_execution_mode,
+                    ))
+                    chat_tabular_invocations = get_new_plugin_invocations(
+                        plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000),
+                        baseline_tabular_invocation_count
+                    )
+                    chat_tabular_thought_payloads = get_tabular_tool_thought_payloads(chat_tabular_invocations)
+                    for thought_content, thought_detail in chat_tabular_thought_payloads:
+                        thought_tracker.add_thought('tabular_analysis', thought_content, thought_detail)
+                    chat_tabular_status_thought_payloads = get_tabular_status_thought_payloads(
+                        chat_tabular_invocations,
+                        analysis_succeeded=bool(chat_tabular_analysis),
+                    )
+                    for thought_content, thought_detail in chat_tabular_status_thought_payloads:
+                        thought_tracker.add_thought('tabular_analysis', thought_content, thought_detail)
+
+                    if chat_tabular_analysis:
+                        # Inject pre-computed analysis results as context
+                        conversation_history_for_api.append({
+                            'role': 'system',
+                            'content': build_tabular_computed_results_system_message(
+                                f"the chat-uploaded file(s) {chat_tabular_filenames_str}",
+                                chat_tabular_analysis,
+                            )
+                        })
+
+                        # Collect tool execution citations from SK tabular analysis
+                        chat_tabular_sk_citations = collect_tabular_sk_citations(user_id, conversation_id)
+                        if chat_tabular_sk_citations:
+                            agent_citations_list.extend(chat_tabular_sk_citations)
+
+                        debug_print(f"[Chat Tabular SK] Analysis injected, {len(chat_tabular_analysis)} chars")
+                    else:
+                        thought_tracker.add_thought(
+                            'tabular_analysis',
+                            "Tabular analysis could not compute results; using existing chat file context",
+                            detail=f"files={chat_tabular_filenames_str}"
+                        )
+                        debug_print("[Chat Tabular SK] Analysis returned None, relying on existing file context messages")
+
                 # Ensure the very last message is the current user's message (it should be if fetched correctly)
                 if not conversation_history_for_api or conversation_history_for_api[-1]['role'] != 'user':
                     debug_print("Warning: Last message in history is not the user's current message. Appending.")
@@ -1939,7 +4167,6 @@ async def run_sk_call(callable_obj, *args, **kwargs):
             chat_mode = None
             scope_id=active_group_id if chat_type == 'group' else user_id
             scope_type='group' if chat_type == 'group' else 'user'
-            conversation_id=conversation_id
             enable_multi_agent_orchestration = False
             fallback_steps = []
             selected_agent = None
@@ -2110,6 +4337,27 @@ def orchestrator_error(e):
                     })
 
                 if selected_agent:
+                    agent_deployment_name = getattr(selected_agent, 'deployment_name', None) or gpt_model
+                    thought_tracker.add_thought('agent_tool_call', f"Sending to agent '{getattr(selected_agent, 'display_name', getattr(selected_agent, 'name', 'unknown'))}'")
+                    thought_tracker.add_thought('generation', f"Sending to '{agent_deployment_name}'")
+
+                    # Register callback to write plugin thoughts to Cosmos in real-time
+                    callback_key = f"{user_id}:{conversation_id}"
+                    plugin_logger = get_plugin_logger()
+
+                    def on_plugin_invocation(inv):
+                        duration_str = f" ({int(inv.duration_ms)}ms)" if inv.duration_ms else ""
+                        tool_name = f"{inv.plugin_name}.{inv.function_name}"
+                        thought_tracker.add_thought(
+                            'agent_tool_call',
+                            f"Agent called {tool_name}{duration_str}",
+                            detail=f"success={inv.success}"
+                        )
+
+                    plugin_logger.register_callback(callback_key, on_plugin_invocation)
+
+                    agent_invoke_start_time = time.time()
+
                     def invoke_selected_agent():
                         return asyncio.run(run_sk_call(
                             selected_agent.invoke,
@@ -2120,16 +4368,22 @@ def agent_success(result):
                         msg = str(result)
                         notice = None
                         agent_used = getattr(selected_agent, 'name', 'All Plugins')
-                        
+
+                        # Emit responded thought with total duration from user message
+                        agent_total_duration_s = round(time.time() - request_start_time, 1)
+                        thought_tracker.add_thought('generation', f"'{agent_deployment_name}' responded ({agent_total_duration_s}s from initial message)")
+
+                        # Deregister real-time thought callback
+                        plugin_logger.deregister_callbacks(callback_key)
+
                         # Get the actual model deployment used by the agent
                         actual_model_deployment = getattr(selected_agent, 'deployment_name', None) or agent_used
                         debug_print(f"Agent '{agent_used}' using deployment: {actual_model_deployment}")
-                        
+
                         # Extract detailed plugin invocations for enhanced agent citations
-                        plugin_logger = get_plugin_logger()
-                        # CRITICAL FIX: Filter by user_id and conversation_id to prevent cross-conversation contamination
+                        # (Thoughts already written to Cosmos in real-time by callback)
                         plugin_invocations = plugin_logger.get_invocations_for_conversation(user_id, conversation_id)
-                        
+
                         # Convert plugin invocations to citation format with detailed information
                         detailed_citations = []
                         for inv in plugin_invocations:
@@ -2204,6 +4458,7 @@ def make_json_serializable(obj):
                             )
                         return (msg, actual_model_deployment, "agent", notice)
                     def agent_error(e):
+                        plugin_logger.deregister_callbacks(callback_key)
                         debug_print(f"Error during Semantic Kernel Agent invocation: {str(e)}")
                         log_event(
                             f"Error during Semantic Kernel Agent invocation: {str(e)}",
@@ -2244,8 +4499,21 @@ def foundry_agent_success(result):
                                 or agent_used
                             )
 
+                            # Emit responded thought with total duration from user message
+                            foundry_total_duration_s = round(time.time() - request_start_time, 1)
+                            thought_tracker.add_thought('generation', f"'{actual_model_deployment}' responded ({foundry_total_duration_s}s from initial message)")
+
+                            # Deregister real-time thought callback
+                            plugin_logger.deregister_callbacks(callback_key)
+
                             foundry_citations = getattr(selected_agent, 'last_run_citations', []) or []
                             if foundry_citations:
+                                # Emit thoughts for Foundry agent citations/tool calls
+                                for citation in foundry_citations:
+                                    thought_tracker.add_thought(
+                                        'agent_tool_call',
+                                        f"Agent retrieved citation from Azure AI Foundry"
+                                    )
                                 for citation in foundry_citations:
                                     try:
                                         serializable = json.loads(json.dumps(citation, default=str))
@@ -2282,6 +4550,7 @@ def foundry_agent_success(result):
                             return (msg, actual_model_deployment, 'agent', notice)
 
                         def foundry_agent_error(e):
+                            plugin_logger.deregister_callbacks(callback_key)
                             log_event(
                                 f"Error during Azure AI Foundry agent invocation: {str(e)}",
                                 extra={
@@ -2360,6 +4629,7 @@ def kernel_error(e):
                         'on_error': kernel_error
                     })
 
+            thought_tracker.add_thought('generation', f"Sending to '{gpt_model}'")
             def invoke_gpt_fallback():
                 if not conversation_history_for_api:
                     raise Exception('Cannot generate response: No conversation history available.')
@@ -2443,12 +4713,18 @@ def gpt_error(e):
             })
 
             fallback_result = try_fallback_chain(fallback_steps)
+
             # Unpack result - handle both 4-tuple (SK) and 5-tuple (GPT with tokens)
             if len(fallback_result) == 5:
                 ai_message, final_model_used, chat_mode, kernel_fallback_notice, token_usage_data = fallback_result
             else:
                 ai_message, final_model_used, chat_mode, kernel_fallback_notice = fallback_result
                 token_usage_data = None
+
+            # Emit responded thought for non-agent paths (agent paths emit their own inside callbacks)
+            if not selected_agent:
+                gpt_total_duration_s = round(time.time() - request_start_time, 1)
+                thought_tracker.add_thought('generation', f"'{final_model_used}' responded ({gpt_total_duration_s}s from initial message)")
             
             # Collect token usage from Semantic Kernel services if available
             if kernel and not token_usage_data:
@@ -2510,8 +4786,8 @@ def gpt_error(e):
                 if hasattr(selected_agent, 'name'):
                     agent_name = selected_agent.name
             
-            assistant_message_id = f"{conversation_id}_assistant_{int(time.time())}_{random.randint(1000,9999)}"
-            
+            # assistant_message_id was generated earlier for thought tracking
+
             # Get user_info and thread_id from the user message for ownership tracking and threading
             user_info_for_assistant = None
             user_thread_id = None
@@ -2672,7 +4948,8 @@ def gpt_error(e):
                 'web_search_citations': web_search_citations_list,
                 'agent_citations': agent_citations_list,
                 'reload_messages': reload_messages_required,
-                'kernel_fallback_notice': kernel_fallback_notice
+                'kernel_fallback_notice': kernel_fallback_notice,
+                'thoughts_enabled': thought_tracker.enabled
             }), 200
         
         except Exception as e:
@@ -2713,8 +4990,112 @@ def chat_stream_api():
             data = request.get_json()
             user_id = get_current_user_id()
             settings = get_settings()
+            request_start_time = time.time()
         except Exception as e:
             return jsonify({'error': f'Failed to parse request: {str(e)}'}), 400
+
+        compatibility_mode = bool(data.get('image_generation')) or bool(
+            data.get('retry_user_message_id') or data.get('edited_user_message_id')
+        )
+
+        request_message = (data.get('message') or '').strip()
+        request_preview = request_message[:120] + '...' if len(request_message) > 120 else request_message
+        debug_print(
+            "[Streaming] Incoming /api/chat/stream request | "
+            f"conversation_id={data.get('conversation_id')} | "
+            f"compatibility_mode={compatibility_mode} | "
+            f"hybrid_search={data.get('hybrid_search')} | "
+            f"web_search={data.get('web_search_enabled')} | "
+            f"doc_scope={data.get('doc_scope')} | "
+            f"chat_type={data.get('chat_type', 'user')} | "
+            f"selected_document_id={data.get('selected_document_id')} | "
+            f"selected_document_ids={len(data.get('selected_document_ids', []) or [])} | "
+            f"active_group_id={data.get('active_group_id')} | "
+            f"active_group_ids={len(data.get('active_group_ids', []) or [])} | "
+            f"active_public_workspace_id={data.get('active_public_workspace_id')} | "
+            f"frontend_model={data.get('model_deployment')} | "
+            f"message_preview={request_preview!r}"
+        )
+
+        def normalize_legacy_chat_payload(payload):
+            """Convert the legacy JSON response shape into the streaming terminal payload."""
+            return {
+                'done': True,
+                'conversation_id': payload.get('conversation_id'),
+                'conversation_title': payload.get('conversation_title'),
+                'classification': payload.get('classification', []),
+                'model_deployment_name': payload.get('model_deployment_name'),
+                'message_id': payload.get('message_id'),
+                'user_message_id': payload.get('user_message_id'),
+                'augmented': payload.get('augmented', False),
+                'hybrid_citations': payload.get('hybrid_citations', []),
+                'web_search_citations': payload.get('web_search_citations', []),
+                'agent_citations': payload.get('agent_citations', []),
+                'agent_display_name': payload.get('agent_display_name'),
+                'agent_name': payload.get('agent_name'),
+                'full_content': payload.get('reply', ''),
+                'image_url': payload.get('image_url'),
+                'reload_messages': payload.get('reload_messages', False),
+                'kernel_fallback_notice': payload.get('kernel_fallback_notice'),
+                'thoughts_enabled': payload.get('thoughts_enabled', False),
+                'blocked': payload.get('blocked', False),
+            }
+
+        def generate_compatibility_response():
+            """Bridge legacy JSON chat handling into a terminal SSE event for parity cases."""
+            try:
+                if data.get('image_generation'):
+                    prompt_text = (data.get('message') or '').strip()
+                    prompt_preview = prompt_text[:120] + '...' if len(prompt_text) > 120 else prompt_text
+
+                    image_prompt_event = {
+                        'type': 'thought',
+                        'step_type': 'generation',
+                        'content': f'Generating image based on \"{prompt_preview}\"' if prompt_preview else 'Generating image from your prompt'
+                    }
+                    yield f"data: {json.dumps(image_prompt_event)}\n\n"
+
+                    image_request_event = {
+                        'type': 'thought',
+                        'step_type': 'generation',
+                        'content': 'Preparing image model request'
+                    }
+                    yield f"data: {json.dumps(image_request_event)}\n\n"
+
+                legacy_result = chat_api()
+                legacy_response = legacy_result
+                status_code = 200
+
+                if isinstance(legacy_result, tuple):
+                    legacy_response = legacy_result[0]
+                    if len(legacy_result) > 1 and isinstance(legacy_result[1], int):
+                        status_code = legacy_result[1]
+
+                if hasattr(legacy_response, 'get_json'):
+                    payload = legacy_response.get_json(silent=True) or {}
+                else:
+                    payload = {}
+
+                if status_code >= 400:
+                    error_message = payload.get('error') or f'Compatibility chat request failed ({status_code})'
+                    yield f"data: {json.dumps({'error': error_message})}\n\n"
+                    return
+
+                if payload.get('image_url'):
+                    image_ready_event = {
+                        'type': 'thought',
+                        'step_type': 'generation',
+                        'content': 'Image generated and ready to display'
+                    }
+                    yield f"data: {json.dumps(image_ready_event)}\n\n"
+
+                yield f"data: {json.dumps(normalize_legacy_chat_payload(payload))}\n\n"
+            except Exception as compatibility_error:
+                yield f"data: {json.dumps({'error': str(compatibility_error)})}\n\n"
+
+        if compatibility_mode:
+            debug_print("[Streaming] Routing request through compatibility bridge")
+            return build_background_stream_response(generate_compatibility_response)
         
         def generate():
             try:
@@ -2757,6 +5138,24 @@ def generate():
                 classifications_to_send = data.get('classifications')
                 chat_type = data.get('chat_type', 'user')
                 reasoning_effort = data.get('reasoning_effort')  # Extract reasoning effort for reasoning models
+
+                debug_print(
+                    "[Streaming] Parsed request payload | "
+                    f"user_id={user_id} | "
+                    f"conversation_id={conversation_id} | "
+                    f"message_length={len(user_message)} | "
+                    f"hybrid_search={hybrid_search_enabled} | "
+                    f"web_search={web_search_enabled} | "
+                    f"doc_scope={document_scope} | "
+                    f"chat_type={chat_type} | "
+                    f"selected_document_id={selected_document_id} | "
+                    f"selected_document_ids={len(selected_document_ids)} | "
+                    f"active_group_id={active_group_id} | "
+                    f"active_group_ids={len(active_group_ids)} | "
+                    f"active_public_workspace_id={active_public_workspace_id} | "
+                    f"frontend_model={frontend_gpt_model} | "
+                    f"reasoning_effort={reasoning_effort}"
+                )
                 
                 # Check if agents are enabled
                 enable_semantic_kernel = settings.get('enable_semantic_kernel', False)
@@ -2816,6 +5215,9 @@ def generate():
                 from semantic_kernel_plugins.plugin_invocation_logger import get_plugin_logger
                 plugin_logger = get_plugin_logger()
                 plugin_logger.clear_invocations_for_conversation(user_id, conversation_id)
+                debug_print(
+                    f"[Streaming] Cleared plugin invocations for user_id={user_id}, conversation_id={conversation_id}"
+                )
                 
                 # Validate chat_type
                 if chat_type not in ('user', 'group'):
@@ -2841,6 +5243,12 @@ def generate():
                     hybrid_search_enabled = hybrid_search_enabled.lower() == 'true'
                 if isinstance(web_search_enabled, str):
                     web_search_enabled = web_search_enabled.lower() == 'true'
+                debug_print(
+                    "[Streaming] Normalized toggles | "
+                    f"hybrid_search={hybrid_search_enabled} | "
+                    f"web_search={web_search_enabled} | "
+                    f"chat_type={chat_type}"
+                )
                 
                 # Initialize GPT client (simplified version)
                 gpt_model = ""
@@ -2904,6 +5312,10 @@ def generate():
                     if not gpt_client or not gpt_model:
                         yield f"data: {json.dumps({'error': 'Failed to initialize AI model'})}\n\n"
                         return
+
+                    debug_print(
+                        f"[Streaming] Initialized model client | model={gpt_model} | enable_gpt_apim={enable_gpt_apim}"
+                    )
                         
                 except Exception as e:
                     yield f"data: {json.dumps({'error': f'Model initialization failed: {str(e)}'})}\n\n"
@@ -2922,11 +5334,13 @@ def generate():
                         'strict': False
                     }
                     cosmos_conversations_container.upsert_item(conversation_item)
+                    debug_print(f"[Streaming] Created new conversation {conversation_id}")
                 else:
                     try:
                         conversation_item = cosmos_conversations_container.read_item(
                             item=conversation_id, partition_key=conversation_id
                         )
+                        debug_print(f"[Streaming] Loaded existing conversation {conversation_id}")
                     except CosmosResourceNotFoundError:
                         conversation_item = {
                             'id': conversation_id,
@@ -2938,6 +5352,7 @@ def generate():
                             'strict': False
                         }
                         cosmos_conversations_container.upsert_item(conversation_item)
+                        debug_print(f"[Streaming] Conversation {conversation_id} not found; created replacement")
                 
                 # Determine chat type
                 actual_chat_type = 'personal'
@@ -3088,6 +5503,9 @@ def generate():
                 }
                 
                 cosmos_messages_container.upsert_item(user_message_doc)
+                debug_print(
+                    f"[Streaming] Saved user message {user_message_id} | thread_id={current_user_thread_id} | previous_thread_id={previous_thread_id}"
+                )
                 
                 # Log activity
                 try:
@@ -3111,10 +5529,127 @@ def generate():
                 
                 conversation_item['last_updated'] = datetime.utcnow().isoformat()
                 cosmos_conversations_container.upsert_item(conversation_item)
-                
+
+                # Generate assistant_message_id early for thought tracking
+                assistant_message_id = f"{conversation_id}_assistant_{int(time.time())}_{random.randint(1000,9999)}"
+
+                # Initialize thought tracker for streaming path
+                thought_tracker = ThoughtTracker(
+                    conversation_id=conversation_id,
+                    message_id=assistant_message_id,
+                    thread_id=current_user_thread_id,
+                    user_id=user_id
+                )
+
+                def emit_thought(step_type, content, detail=None):
+                    """Add a thought to Cosmos and return an SSE event string."""
+                    thought_tracker.add_thought(step_type, content, detail)
+                    return f"data: {json.dumps({'type': 'thought', 'step_index': thought_tracker.current_index - 1, 'step_type': step_type, 'content': content})}\n\n"
+
+                # Content Safety check (matching non-streaming path)
+                blocked = False
+                if settings.get('enable_content_safety') and "content_safety_client" in CLIENTS:
+                    yield emit_thought('content_safety', 'Checking content safety...')
+                    try:
+                        content_safety_client = CLIENTS["content_safety_client"]
+                        request_obj = AnalyzeTextOptions(text=user_message)
+                        cs_response = content_safety_client.analyze_text(request_obj)
+
+                        max_severity = 0
+                        triggered_categories = []
+                        blocklist_matches = []
+                        block_reasons = []
+
+                        for cat_result in cs_response.categories_analysis:
+                            triggered_categories.append({
+                                "category": cat_result.category,
+                                "severity": cat_result.severity
+                            })
+                            if cat_result.severity > max_severity:
+                                max_severity = cat_result.severity
+
+                        if cs_response.blocklists_match:
+                            for match in cs_response.blocklists_match:
+                                blocklist_matches.append({
+                                    "blocklistName": match.blocklist_name,
+                                    "blocklistItemId": match.blocklist_item_id,
+                                    "blocklistItemText": match.blocklist_item_text
+                                })
+
+                        if max_severity >= 4:
+                            blocked = True
+                            block_reasons.append("Max severity >= 4")
+                        if len(blocklist_matches) > 0:
+                            blocked = True
+                            block_reasons.append("Blocklist match")
+
+                        if blocked:
+                            # Upsert to safety container
+                            safety_item = {
+                                'id': str(uuid.uuid4()),
+                                'user_id': user_id,
+                                'conversation_id': conversation_id,
+                                'message': user_message,
+                                'triggered_categories': triggered_categories,
+                                'blocklist_matches': blocklist_matches,
+                                'timestamp': datetime.utcnow().isoformat(),
+                                'reason': "; ".join(block_reasons),
+                                'metadata': {}
+                            }
+                            cosmos_safety_container.upsert_item(safety_item)
+
+                            # Build blocked message
+                            blocked_msg_content = (
+                                "Your message was blocked by Content Safety.\n\n"
+                                f"**Reason**: {', '.join(block_reasons)}\n"
+                                "Triggered categories:\n"
+                            )
+                            for cat in triggered_categories:
+                                blocked_msg_content += (
+                                    f" - {cat['category']} (severity={cat['severity']})\n"
+                                )
+                            if blocklist_matches:
+                                blocked_msg_content += (
+                                    "\nBlocklist Matches:\n" +
+                                    "\n".join([f" - {m['blocklistItemText']} (in {m['blocklistName']})"
+                                            for m in blocklist_matches])
+                                )
+
+                            # Insert safety message
+                            safety_message_id = f"{conversation_id}_safety_{int(time.time())}_{random.randint(1000,9999)}"
+                            safety_doc = {
+                                'id': safety_message_id,
+                                'conversation_id': conversation_id,
+                                'role': 'safety',
+                                'content': blocked_msg_content.strip(),
+                                'timestamp': datetime.utcnow().isoformat(),
+                                'model_deployment_name': None,
+                                'metadata': {},
+                            }
+                            cosmos_messages_container.upsert_item(safety_doc)
+
+                            conversation_item['last_updated'] = datetime.utcnow().isoformat()
+                            cosmos_conversations_container.upsert_item(conversation_item)
+
+                            # Stream the blocked response and stop
+                            yield f"data: {json.dumps({'content': blocked_msg_content.strip(), 'blocked': True})}\n\n"
+                            yield "data: [DONE]\n\n"
+                            return
+
+                    except HttpResponseError as e:
+                        debug_print(f"[Content Safety Error - Streaming] {e}")
+                    except Exception as ex:
+                        debug_print(f"[Content Safety - Streaming] Unexpected error: {ex}")
+
                 # Hybrid search (if enabled)
                 combined_documents = []
                 if hybrid_search_enabled:
+                    debug_print(
+                        "[Streaming] Starting hybrid search | "
+                        f"conversation_id={conversation_id} | doc_scope={document_scope} | "
+                        f"selected_document_ids={len(selected_document_ids)} | tags={len(tags_filter) if isinstance(tags_filter, list) else 0}"
+                    )
+                    yield emit_thought('search', f"Searching {document_scope or 'personal'} workspace documents for '{(search_query or user_message)[:50]}'")
                     try:
                         search_args = {
                             "query": search_query,
@@ -3142,10 +5677,15 @@ def generate():
                             search_args['tags_filter'] = tags_filter
                         
                         search_results = hybrid_search(**search_args)
+                        debug_print(
+                            f"[Streaming] Hybrid search completed | results={len(search_results) if search_results else 0}"
+                        )
                     except Exception as e:
                         debug_print(f"Error during hybrid search: {e}")
-                    
+
                     if search_results:
+                        unique_doc_names_stream = set(doc.get('file_name', 'Unknown') for doc in search_results)
+                        yield emit_thought('search', f"Found {len(search_results)} results from {len(unique_doc_names_stream)} documents")
                         retrieved_texts = []
                         
                         for doc in search_results:
@@ -3159,14 +5699,24 @@ def generate():
                             chunk_id = doc.get('chunk_id', str(uuid.uuid4()))
                             score = doc.get('score', 0.0)
                             group_id = doc.get('group_id', None)
+                            sheet_name = doc.get('sheet_name')
+                            location_label, location_value = get_citation_location(
+                                file_name,
+                                page_number=page_number,
+                                chunk_text=chunk_text,
+                                sheet_name=sheet_name,
+                            )
                             
-                            citation = f"(Source: {file_name}, Page: {page_number}) [#{citation_id}]"
+                            citation = f"(Source: {file_name}, {location_label}: {location_value}) [#{citation_id}]"
                             retrieved_texts.append(f"{chunk_text}\n{citation}")
                             
                             combined_documents.append({
                                 "file_name": file_name,
                                 "citation_id": citation_id,
                                 "page_number": page_number,
+                                "sheet_name": sheet_name,
+                                "location_label": location_label,
+                                "location_value": location_value,
                                 "version": version,
                                 "classification": classification,
                                 "chunk_text": chunk_text,
@@ -3303,27 +5853,106 @@ def generate():
                                         retrieved_texts.append(vision_context)
                         
                         retrieved_content = "\n\n".join(retrieved_texts)
-                        system_prompt_search = f"""You are an AI assistant. Use the following retrieved document excerpts to answer the user's question. Cite sources using the format (Source: filename, Page: page number).
-                                                Retrieved Excerpts:
-                                                {retrieved_content}
-
-                                                Based *only* on the information provided above, answer the user's query. If the answer isn't in the excerpts, say so.
-
-                                                Example
-                                                User: What is the policy on double dipping?
-                                                Assistant: The policy prohibits entities from using federal funds received through one program to apply for additional funds through another program, commonly known as 'double dipping' (Source: PolicyDocument.pdf, Page: 12)
-                                                """
+                        system_prompt_search = build_search_augmentation_system_prompt(retrieved_content)
                         
                         system_messages_for_augmentation.append({
                             'role': 'system',
                             'content': system_prompt_search,
                             'documents': combined_documents
                         })
-                        
+
                         # Reorder hybrid citations list in descending order based on page_number
                         hybrid_citations_list.sort(key=lambda x: x.get('page_number', 0), reverse=True)
                 
+                workspace_tabular_files = set()
+                if hybrid_search_enabled and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                    workspace_tabular_files = collect_workspace_tabular_filenames(
+                        combined_documents=combined_documents,
+                        selected_document_ids=selected_document_ids,
+                        selected_document_id=selected_document_id,
+                        document_scope=document_scope,
+                    )
+
+                if hybrid_search_enabled and workspace_tabular_files and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                    tabular_source_hint = determine_tabular_source_hint(
+                        document_scope,
+                        active_group_id=active_group_id,
+                        active_public_workspace_id=active_public_workspace_id,
+                    )
+                    tabular_execution_mode = get_tabular_execution_mode(user_message)
+                    tabular_filenames_str = ", ".join(sorted(workspace_tabular_files))
+                    plugin_logger = get_plugin_logger()
+                    baseline_tabular_invocation_count = len(
+                        plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000)
+                    )
+                    debug_print(
+                        "[Streaming][Tabular SK] Starting workspace tabular analysis | "
+                        f"files={sorted(workspace_tabular_files)} | source_hint={tabular_source_hint} | "
+                        f"execution_mode={tabular_execution_mode} | baseline_invocations={baseline_tabular_invocation_count}"
+                    )
+
+                    tabular_analysis = asyncio.run(run_tabular_sk_analysis(
+                        user_question=user_message,
+                        tabular_filenames=workspace_tabular_files,
+                        user_id=user_id,
+                        conversation_id=conversation_id,
+                        gpt_model=gpt_model,
+                        settings=settings,
+                        source_hint=tabular_source_hint,
+                        group_id=active_group_id if tabular_source_hint == 'group' else None,
+                        public_workspace_id=active_public_workspace_id if tabular_source_hint == 'public' else None,
+                        execution_mode=tabular_execution_mode,
+                    ))
+                    tabular_invocations = get_new_plugin_invocations(
+                        plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000),
+                        baseline_tabular_invocation_count
+                    )
+                    debug_print(
+                        "[Streaming][Tabular SK] Completed workspace tabular analysis | "
+                        f"analysis_returned={bool(tabular_analysis)} | new_invocations={len(tabular_invocations)}"
+                    )
+                    tabular_thought_payloads = get_tabular_tool_thought_payloads(tabular_invocations)
+                    for thought_content, thought_detail in tabular_thought_payloads:
+                        yield emit_thought('tabular_analysis', thought_content, thought_detail)
+                    tabular_status_thought_payloads = get_tabular_status_thought_payloads(
+                        tabular_invocations,
+                        analysis_succeeded=bool(tabular_analysis),
+                    )
+                    for thought_content, thought_detail in tabular_status_thought_payloads:
+                        yield emit_thought('tabular_analysis', thought_content, thought_detail)
+
+                    if tabular_analysis:
+                        system_messages_for_augmentation.append({
+                            'role': 'system',
+                            'content': build_tabular_computed_results_system_message(
+                                f"the file(s) {tabular_filenames_str}",
+                                tabular_analysis,
+                            )
+                        })
+
+                        tabular_sk_citations = collect_tabular_sk_citations(user_id, conversation_id)
+                        if tabular_sk_citations:
+                            agent_citations_list.extend(tabular_sk_citations)
+                    else:
+                        system_messages_for_augmentation.append({
+                            'role': 'system',
+                            'content': build_tabular_fallback_system_message(
+                                tabular_filenames_str,
+                                execution_mode=tabular_execution_mode,
+                            )
+                        })
+
+                        yield emit_thought(
+                            'tabular_analysis',
+                            "Tabular analysis could not compute results; using schema context instead",
+                            detail=f"files={tabular_filenames_str}"
+                        )
+
                 if web_search_enabled:
+                    debug_print(
+                        f"[Streaming] Starting web search augmentation for conversation_id={conversation_id}"
+                    )
+                    yield emit_thought('web_search', f"Searching the web for '{(search_query or user_message)[:50]}'")
                     perform_web_search(
                         settings=settings,
                         conversation_id=conversation_id,
@@ -3339,6 +5968,11 @@ def generate():
                         agent_citations_list=agent_citations_list,
                         web_search_citations_list=web_search_citations_list,
                     )
+                    if web_search_citations_list:
+                        debug_print(
+                            f"[Streaming] Web search completed | citations={len(web_search_citations_list)}"
+                        )
+                        yield emit_thought('web_search', f"Got {len(web_search_citations_list)} web search results")
 
                 # Update message chat type
                 message_chat_type = None
@@ -3381,15 +6015,139 @@ def generate():
                             'content': aug_msg['content']
                         })
                     
-                    # Add recent messages
+                    # Add recent messages (with file role handling)
                     allowed_roles_in_history = ['user', 'assistant']
+                    max_file_content_length_in_history = 50000
+                    max_tabular_content_length_in_history = 50000
+                    chat_tabular_files = set()  # Track tabular files uploaded directly to chat
+
                     for message in recent_messages:
-                        if message.get('role') in allowed_roles_in_history:
+                        role = message.get('role')
+                        content = message.get('content', '')
+
+                        if role in allowed_roles_in_history:
                             conversation_history_for_api.append({
-                                'role': message['role'],
-                                'content': message.get('content', '')
+                                'role': role,
+                                'content': content
                             })
-                    
+                        elif role == 'file':
+                            filename = message.get('filename', 'uploaded_file')
+                            file_content = message.get('file_content', '')
+                            is_table = message.get('is_table', False)
+                            file_content_source = message.get('file_content_source', '')
+
+                            # Tabular files stored in blob - track for mini SK analysis
+                            if is_table and file_content_source == 'blob':
+                                chat_tabular_files.add(filename)
+                                conversation_history_for_api.append({
+                                    'role': 'system',
+                                    'content': (
+                                        f"[User uploaded a tabular data file named '{filename}'. "
+                                        f"The file is stored in blob storage and available for analysis. "
+                                        f"Use the tabular_processing plugin functions (list_tabular_files, "
+                                        f"describe_tabular_file, aggregate_column, filter_rows, "
+                                        f"query_tabular_data, group_by_aggregate, group_by_datetime_component) to analyze this data. "
+                                        f"The file source is 'chat'.]"
+                                    )
+                                })
+                            else:
+                                content_limit = (
+                                    max_tabular_content_length_in_history if is_table
+                                    else max_file_content_length_in_history
+                                )
+                                display_content = file_content[:content_limit]
+                                if len(file_content) > content_limit:
+                                    display_content += "..."
+
+                                if is_table:
+                                    conversation_history_for_api.append({
+                                        'role': 'system',
+                                        'content': (
+                                            f"[User uploaded a tabular data file named '{filename}'. "
+                                            f"This is CSV format data for analysis:\n{display_content}]\n"
+                                            f"This is complete tabular data in CSV format. You can perform "
+                                            f"calculations, analysis, and data operations on this dataset."
+                                        )
+                                    })
+                                else:
+                                    conversation_history_for_api.append({
+                                        'role': 'system',
+                                        'content': (
+                                            f"[User uploaded a file named '{filename}'. "
+                                            f"Content preview:\n{display_content}]\n"
+                                            f"Use this file context if relevant."
+                                        )
+                                    })
+
+                    # --- Mini SK analysis for tabular files uploaded directly to chat ---
+                    if chat_tabular_files and settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+                        chat_tabular_filenames_str = ", ".join(chat_tabular_files)
+                        chat_tabular_execution_mode = get_tabular_execution_mode(user_message)
+                        log_event(
+                            f"[Chat Tabular SK] Streaming: Detected {len(chat_tabular_files)} tabular file(s) uploaded to chat: {chat_tabular_filenames_str}",
+                            level=logging.INFO
+                        )
+                        plugin_logger = get_plugin_logger()
+                        baseline_tabular_invocation_count = len(
+                            plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000)
+                        )
+                        debug_print(
+                            "[Streaming][Chat Tabular SK] Starting chat-uploaded tabular analysis | "
+                            f"files={sorted(chat_tabular_files)} | execution_mode={chat_tabular_execution_mode} | "
+                            f"baseline_invocations={baseline_tabular_invocation_count}"
+                        )
+
+                        chat_tabular_analysis = asyncio.run(run_tabular_sk_analysis(
+                            user_question=user_message,
+                            tabular_filenames=chat_tabular_files,
+                            user_id=user_id,
+                            conversation_id=conversation_id,
+                            gpt_model=gpt_model,
+                            settings=settings,
+                            source_hint="chat",
+                            execution_mode=chat_tabular_execution_mode,
+                        ))
+                        chat_tabular_invocations = get_new_plugin_invocations(
+                            plugin_logger.get_invocations_for_conversation(user_id, conversation_id, limit=1000),
+                            baseline_tabular_invocation_count
+                        )
+                        debug_print(
+                            "[Streaming][Chat Tabular SK] Completed chat-uploaded tabular analysis | "
+                            f"analysis_returned={bool(chat_tabular_analysis)} | new_invocations={len(chat_tabular_invocations)}"
+                        )
+                        chat_tabular_thought_payloads = get_tabular_tool_thought_payloads(chat_tabular_invocations)
+                        for thought_content, thought_detail in chat_tabular_thought_payloads:
+                            yield emit_thought('tabular_analysis', thought_content, thought_detail)
+                        chat_tabular_status_thought_payloads = get_tabular_status_thought_payloads(
+                            chat_tabular_invocations,
+                            analysis_succeeded=bool(chat_tabular_analysis),
+                        )
+                        for thought_content, thought_detail in chat_tabular_status_thought_payloads:
+                            yield emit_thought('tabular_analysis', thought_content, thought_detail)
+
+                        if chat_tabular_analysis:
+                            conversation_history_for_api.append({
+                                'role': 'system',
+                                'content': build_tabular_computed_results_system_message(
+                                    f"the chat-uploaded file(s) {chat_tabular_filenames_str}",
+                                    chat_tabular_analysis,
+                                )
+                            })
+
+                            # Collect tool execution citations
+                            chat_tabular_sk_citations = collect_tabular_sk_citations(user_id, conversation_id)
+                            if chat_tabular_sk_citations:
+                                agent_citations_list.extend(chat_tabular_sk_citations)
+
+                            debug_print(f"[Chat Tabular SK] Streaming: Analysis injected, {len(chat_tabular_analysis)} chars")
+                        else:
+                            yield emit_thought(
+                                'tabular_analysis',
+                                "Tabular analysis could not compute results; using existing chat file context",
+                                detail=f"files={chat_tabular_filenames_str}"
+                            )
+                            debug_print("[Chat Tabular SK] Streaming: Analysis returned None, relying on existing file context")
+
                 except Exception as e:
                     yield f"data: {json.dumps({'error': f'History error: {str(e)}'})}\n\n"
                     return
@@ -3472,18 +6230,46 @@ def generate():
                 # Stream the response
                 accumulated_content = ""
                 token_usage_data = None  # Will be populated from final stream chunk
-                assistant_message_id = f"{conversation_id}_assistant_{int(time.time())}_{random.randint(1000,9999)}"
+                # assistant_message_id was generated earlier for thought tracking
                 final_model_used = gpt_model  # Default to gpt_model, will be overridden if agent is used
                 
                 # DEBUG: Check agent streaming decision
                 debug_print(f"[DEBUG] use_agent_streaming={use_agent_streaming}, selected_agent={selected_agent is not None}")
                 debug_print(f"[DEBUG] enable_semantic_kernel={enable_semantic_kernel}, user_enable_agents={user_enable_agents}")
+                debug_print(
+                    "[Streaming] Selected response path | "
+                    f"use_agent_streaming={use_agent_streaming} | "
+                    f"selected_agent={getattr(selected_agent, 'name', None) if selected_agent else None} | "
+                    f"model={gpt_model}"
+                )
                 
                 try:
                     if use_agent_streaming and selected_agent:
                         # Stream from agent using invoke_stream
+                        yield emit_thought('agent_tool_call', f"Sending to agent '{agent_display_name_used or agent_name_used}'")
+                        yield emit_thought('generation', f"Sending to '{actual_model_used}'")
                         debug_print(f"--- Streaming from Agent: {agent_name_used} ---")
-                        
+
+                        # Register callback to persist plugin thoughts to Cosmos in real-time
+                        callback_key = f"{user_id}:{conversation_id}"
+                        plugin_logger_cb = get_plugin_logger()
+                        debug_print(
+                            f"[Streaming][Plugin Callback] Registering callback for key={callback_key}"
+                        )
+
+                        def on_plugin_invocation_streaming(inv):
+                            duration_str = f" ({int(inv.duration_ms)}ms)" if inv.duration_ms else ""
+                            tool_name = f"{inv.plugin_name}.{inv.function_name}"
+                            debug_print(
+                                f"[Streaming][Plugin Callback] Received invocation {tool_name}{duration_str} | success={inv.success}"
+                            )
+                            thought_tracker.add_thought(
+                                'agent_tool_call',
+                                f"Agent called {tool_name}{duration_str}"
+                            )
+
+                        plugin_logger_cb.register_callback(callback_key, on_plugin_invocation_streaming)
+
                         # Import required classes
                         from semantic_kernel.contents.chat_message_content import ChatMessageContent
                         
@@ -3497,6 +6283,8 @@ def generate():
                             for msg in conversation_history_for_api
                         ]
                         
+                        agent_stream_start_time = time.time()
+
                         # Stream agent responses - collect chunks first then yield
                         async def stream_agent_async():
                             """Collect all streaming chunks from agent"""
@@ -3524,7 +6312,6 @@ async def stream_agent_async():
                             return chunks, usage_data
                         
                         # Execute async streaming
-                        import asyncio
                         try:
                             # Try to get existing event loop
                             loop = asyncio.get_event_loop()
@@ -3539,36 +6326,59 @@ async def stream_agent_async():
                         try:
                             # Run streaming and collect chunks and usage
                             chunks, stream_usage = loop.run_until_complete(stream_agent_async())
-                            
-                            # Yield chunks to frontend
-                            for chunk_content in chunks:
-                                accumulated_content += chunk_content
-                                yield f"data: {json.dumps({'content': chunk_content})}\n\n"
-                            
-                            # Try to capture token usage from stream metadata
-                            if stream_usage:
-                                # stream_usage is a CompletionUsage object, not a dict
-                                prompt_tokens = getattr(stream_usage, 'prompt_tokens', 0)
-                                completion_tokens = getattr(stream_usage, 'completion_tokens', 0)
-                                total_tokens = getattr(stream_usage, 'total_tokens', None)
-                                
-                                # Calculate total if not provided
-                                if total_tokens is None or total_tokens == 0:
-                                    total_tokens = prompt_tokens + completion_tokens
-                                
-                                token_usage_data = {
-                                    'prompt_tokens': prompt_tokens,
-                                    'completion_tokens': completion_tokens,
-                                    'total_tokens': total_tokens,
-                                    'captured_at': datetime.utcnow().isoformat()
-                                }
-                                debug_print(f"[Agent Streaming Tokens] From metadata - prompt: {prompt_tokens}, completion: {completion_tokens}, total: {total_tokens}")
                         except Exception as stream_error:
+                            plugin_logger_cb.deregister_callbacks(callback_key)
+                            debug_print(
+                                f"[Streaming][Plugin Callback] Deregistered callback after streaming error for key={callback_key}"
+                            )
                             debug_print(f"❌ Agent streaming error: {stream_error}")
                             import traceback
                             traceback.print_exc()
                             yield f"data: {json.dumps({'error': f'Agent streaming failed: {str(stream_error)}'})}\n\n"
                             return
+
+                        # Emit responded thought with total duration from user message
+                        agent_stream_total_duration_s = round(time.time() - request_start_time, 1)
+                        yield emit_thought('generation', f"'{actual_model_used}' responded ({agent_stream_total_duration_s}s from initial message)")
+
+                        # Deregister callback (agent completed successfully)
+                        plugin_logger_cb.deregister_callbacks(callback_key)
+                        debug_print(
+                            f"[Streaming][Plugin Callback] Deregistered callback after successful stream for key={callback_key}"
+                        )
+
+                        # Emit SSE-only events for streaming UI (Cosmos writes already done by callback)
+                        agent_plugin_invocations = plugin_logger_cb.get_invocations_for_conversation(user_id, conversation_id)
+                        for inv in agent_plugin_invocations:
+                            duration_str = f" ({int(inv.duration_ms)}ms)" if inv.duration_ms else ""
+                            tool_name = f"{inv.plugin_name}.{inv.function_name}"
+                            content = f"Agent called {tool_name}{duration_str}"
+                            yield f"data: {json.dumps({'type': 'thought', 'step_index': thought_tracker.current_index, 'step_type': 'agent_tool_call', 'content': content})}\n\n"
+                            thought_tracker.current_index += 1
+
+                        # Yield chunks to frontend
+                        for chunk_content in chunks:
+                            accumulated_content += chunk_content
+                            yield f"data: {json.dumps({'content': chunk_content})}\n\n"
+
+                        # Try to capture token usage from stream metadata
+                        if stream_usage:
+                            # stream_usage is a CompletionUsage object, not a dict
+                            prompt_tokens = getattr(stream_usage, 'prompt_tokens', 0)
+                            completion_tokens = getattr(stream_usage, 'completion_tokens', 0)
+                            total_tokens = getattr(stream_usage, 'total_tokens', None)
+
+                            # Calculate total if not provided
+                            if total_tokens is None or total_tokens == 0:
+                                total_tokens = prompt_tokens + completion_tokens
+
+                            token_usage_data = {
+                                'prompt_tokens': prompt_tokens,
+                                'completion_tokens': completion_tokens,
+                                'total_tokens': total_tokens,
+                                'captured_at': datetime.utcnow().isoformat()
+                            }
+                            debug_print(f"[Agent Streaming Tokens] From metadata - prompt: {prompt_tokens}, completion: {completion_tokens}, total: {total_tokens}")
                         
                         # Collect token usage from kernel services if not captured from stream
                         if not token_usage_data:
@@ -3650,6 +6460,7 @@ def make_json_serializable(obj):
                     
                     else:
                         # Stream from regular GPT model (non-agent)
+                        yield emit_thought('generation', f"Sending to '{gpt_model}'")
                         debug_print(f"--- Streaming from GPT ({gpt_model}) ---")
                         
                         # Prepare stream parameters
@@ -3700,6 +6511,10 @@ def make_json_serializable(obj):
                                     'captured_at': datetime.utcnow().isoformat()
                                 }
                                 debug_print(f"[Streaming Tokens] Captured usage - prompt: {chunk.usage.prompt_tokens}, completion: {chunk.usage.completion_tokens}, total: {chunk.usage.total_tokens}")
+
+                        # Emit responded thought for regular LLM streaming
+                        gpt_stream_total_duration_s = round(time.time() - request_start_time, 1)
+                        yield emit_thought('generation', f"'{gpt_model}' responded ({gpt_stream_total_duration_s}s from initial message)")
                     
                     # Stream complete - save message and send final metadata
                     # Get user thread info to maintain thread consistency
@@ -3801,6 +6616,29 @@ def make_json_serializable(obj):
                     except Exception as e:
                         debug_print(f"Error collecting conversation metadata: {e}")
                     
+                    if is_personal_chat_conversation(conversation_item):
+                        conversation_item = mark_conversation_unread(
+                            conversation_item,
+                            assistant_message_id,
+                            unread_timestamp=conversation_item['last_updated']
+                        )
+
+                        notification_doc = create_chat_response_notification(
+                            user_id=user_id,
+                            conversation_id=conversation_id,
+                            message_id=assistant_message_id,
+                            conversation_title=conversation_item.get('title', ''),
+                            response_preview=accumulated_content,
+                        )
+                        if notification_doc:
+                            debug_print(
+                                f"Created chat completion notification {notification_doc['id']} for conversation {conversation_id}"
+                            )
+                    else:
+                        debug_print(
+                            f"Skipping personal chat completion notification for conversation {conversation_id} because chat_type={conversation_item.get('chat_type')}"
+                        )
+
                     cosmos_conversations_container.upsert_item(conversation_item)
                     
                     # Send final message with metadata
@@ -3818,8 +6656,16 @@ def make_json_serializable(obj):
                         'agent_citations': agent_citations_list,
                         'agent_display_name': agent_display_name_used if use_agent_streaming else None,
                         'agent_name': agent_name_used if use_agent_streaming else None,
-                        'full_content': accumulated_content
+                        'full_content': accumulated_content,
+                        'thoughts_enabled': thought_tracker.enabled
                     }
+                    debug_print(
+                        "[Streaming] Finalizing stream response | "
+                        f"conversation_id={conversation_id} | message_id={assistant_message_id} | "
+                        f"content_length={len(accumulated_content)} | hybrid_citations={len(hybrid_citations_list)} | "
+                        f"web_citations={len(web_search_citations_list)} | agent_citations={len(agent_citations_list)} | "
+                        f"thoughts_enabled={thought_tracker.enabled}"
+                    )
                     yield f"data: {json.dumps(final_data)}\n\n"
                     
                 except Exception as e:
@@ -3871,15 +6717,7 @@ def make_json_serializable(obj):
                 debug_print(f"[STREAM API ERROR] Full traceback:\n{error_traceback}")
                 yield f"data: {json.dumps({'error': f'Internal server error: {str(e)}'})}\n\n"
         
-        return Response(
-            stream_with_context(generate()),
-            mimetype='text/event-stream',
-            headers={
-                'Cache-Control': 'no-cache',
-                'X-Accel-Buffering': 'no',
-                'Connection': 'keep-alive'
-            }
-        )
+        return build_background_stream_response(generate)
 
     @app.route('/api/message/<message_id>/mask', methods=['POST'])
     @swagger_route(security=get_auth_security())
diff --git a/application/single_app/route_backend_control_center.py b/application/single_app/route_backend_control_center.py
index 2c3952f1..a28f756b 100644
--- a/application/single_app/route_backend_control_center.py
+++ b/application/single_app/route_backend_control_center.py
@@ -3572,8 +3572,15 @@ def api_bulk_public_workspace_action():
                         deleted_count = 0
                         for doc in docs_to_delete:
                             try:
-                                delete_document_chunks(doc['id'])
-                                delete_document(doc['id'])
+                                delete_document_chunks(
+                                    document_id=doc['id'],
+                                    public_workspace_id=workspace_id,
+                                )
+                                delete_document(
+                                    user_id=None,
+                                    document_id=doc['id'],
+                                    public_workspace_id=workspace_id,
+                                )
                                 deleted_count += 1
                             except Exception as del_e:
                                 debug_print(f"Error deleting document {doc['id']}: {del_e}")
diff --git a/application/single_app/route_backend_conversation_export.py b/application/single_app/route_backend_conversation_export.py
index aad750e4..689d3476 100644
--- a/application/single_app/route_backend_conversation_export.py
+++ b/application/single_app/route_backend_conversation_export.py
@@ -2,15 +2,31 @@
 
 import io
 import json
+import markdown2
+import re
+import tempfile
 import zipfile
+from collections import Counter, defaultdict
 from datetime import datetime
+from html import escape as _escape_html
+from typing import Any, Dict, List, Optional
 
 from config import *
+from flask import jsonify, make_response, request
+from functions_appinsights import log_event
 from functions_authentication import *
-from functions_settings import *
-from flask import Response, jsonify, request, make_response
+from functions_chat import sort_messages_by_thread
+from functions_conversation_metadata import update_conversation_with_metadata
 from functions_debug import debug_print
+from functions_settings import *
+from functions_thoughts import get_thoughts_for_conversation
 from swagger_wrapper import swagger_route, get_auth_security
+from docx import Document as DocxDocument
+from docx.shared import Pt
+
+
+TRANSCRIPT_ROLES = {'user', 'assistant'}
+SUMMARY_SOURCE_CHAR_LIMIT = 60000
 
 
 def register_route_backend_conversation_export(app):
@@ -29,32 +45,36 @@ def api_export_conversations():
             conversation_ids (list): List of conversation IDs to export.
             format (str): Export format — "json" or "markdown".
             packaging (str): Output packaging — "single" or "zip".
+            include_summary_intro (bool): Whether to generate a per-conversation intro.
+            summary_model_deployment (str): Optional model deployment for summary generation.
         """
         user_id = get_current_user_id()
         if not user_id:
             return jsonify({'error': 'User not authenticated'}), 401
 
-        data = request.get_json()
+        data = request.get_json(silent=True)
         if not data:
             return jsonify({'error': 'Request body is required'}), 400
 
         conversation_ids = data.get('conversation_ids', [])
-        export_format = data.get('format', 'json').lower()
-        packaging = data.get('packaging', 'single').lower()
+        export_format = str(data.get('format', 'json')).lower()
+        packaging = str(data.get('packaging', 'single')).lower()
+        include_summary_intro = bool(data.get('include_summary_intro', False))
+        summary_model_deployment = str(data.get('summary_model_deployment', '') or '').strip()
 
         if not conversation_ids or not isinstance(conversation_ids, list):
             return jsonify({'error': 'At least one conversation_id is required'}), 400
 
-        if export_format not in ('json', 'markdown'):
-            return jsonify({'error': 'Format must be "json" or "markdown"'}), 400
+        if export_format not in ('json', 'markdown', 'pdf'):
+            return jsonify({'error': 'Format must be "json", "markdown", or "pdf"'}), 400
 
         if packaging not in ('single', 'zip'):
             return jsonify({'error': 'Packaging must be "single" or "zip"'}), 400
 
         try:
+            settings = get_settings()
             exported = []
             for conv_id in conversation_ids:
-                # Verify ownership and fetch conversation
                 try:
                     conversation = cosmos_conversations_container.read_item(
                         item=conv_id,
@@ -64,225 +84,1798 @@ def api_export_conversations():
                     debug_print(f"Export: conversation {conv_id} not found or access denied")
                     continue
 
-                # Verify user owns this conversation
                 if conversation.get('user_id') != user_id:
                     debug_print(f"Export: user {user_id} does not own conversation {conv_id}")
                     continue
 
-                # Fetch messages ordered by timestamp
-                message_query = f"""
+                message_query = """
                     SELECT * FROM c
-                    WHERE c.conversation_id = '{conv_id}'
+                    WHERE c.conversation_id = @conversation_id
                     ORDER BY c.timestamp ASC
                 """
                 messages = list(cosmos_messages_container.query_items(
                     query=message_query,
+                    parameters=[{'name': '@conversation_id', 'value': conv_id}],
                     partition_key=conv_id
                 ))
 
-                # Filter for active thread messages only
-                filtered_messages = []
-                for msg in messages:
-                    thread_info = msg.get('metadata', {}).get('thread_info', {})
-                    active = thread_info.get('active_thread')
-                    if active is True or active is None or 'active_thread' not in thread_info:
-                        filtered_messages.append(msg)
-
-                exported.append({
-                    'conversation': _sanitize_conversation(conversation),
-                    'messages': [_sanitize_message(m) for m in filtered_messages]
-                })
+                exported.append(
+                    _build_export_entry(
+                        conversation=conversation,
+                        raw_messages=messages,
+                        user_id=user_id,
+                        settings=settings,
+                        include_summary_intro=include_summary_intro,
+                        summary_model_deployment=summary_model_deployment
+                    )
+                )
 
             if not exported:
                 return jsonify({'error': 'No accessible conversations found'}), 404
 
-            # Generate export content
             timestamp_str = datetime.utcnow().strftime('%Y%m%d_%H%M%S')
 
             if packaging == 'zip':
                 return _build_zip_response(exported, export_format, timestamp_str)
-            else:
-                return _build_single_file_response(exported, export_format, timestamp_str)
-
-        except Exception as e:
-            debug_print(f"Export error: {str(e)}")
-            return jsonify({'error': f'Export failed: {str(e)}'}), 500
-
-    def _sanitize_conversation(conv):
-        """Return only user-facing conversation fields."""
-        return {
-            'id': conv.get('id'),
-            'title': conv.get('title', 'Untitled'),
-            'last_updated': conv.get('last_updated', ''),
-            'chat_type': conv.get('chat_type', 'personal'),
-            'tags': conv.get('tags', []),
-            'is_pinned': conv.get('is_pinned', False),
-            'context': conv.get('context', [])
-        }
-
-    def _sanitize_message(msg):
-        """Return only user-facing message fields."""
-        result = {
-            'role': msg.get('role', ''),
-            'content': msg.get('content', ''),
-            'timestamp': msg.get('timestamp', ''),
-        }
-        # Include citations if present
-        if msg.get('citations'):
-            result['citations'] = msg['citations']
-        # Include context/tool info if present
-        if msg.get('context'):
-            result['context'] = msg['context']
-        return result
-
-    def _build_single_file_response(exported, export_format, timestamp_str):
-        """Build a single-file download response."""
-        if export_format == 'json':
-            content = json.dumps(exported, indent=2, ensure_ascii=False, default=str)
-            filename = f"conversations_export_{timestamp_str}.json"
-            content_type = 'application/json; charset=utf-8'
+
+            return _build_single_file_response(exported, export_format, timestamp_str)
+
+        except Exception as exc:
+            debug_print(f"Export error: {str(exc)}")
+            log_event(f"Conversation export failed: {exc}", level="WARNING")
+            return jsonify({'error': f'Export failed: {str(exc)}'}), 500
+
+    @app.route('/api/message/export-word', methods=['POST'])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    def api_export_message_word():
+        """
+        Export a single message as a Word (.docx) document.
+
+        Request body:
+            message_id (str): ID of the message to export.
+            conversation_id (str): ID of the conversation the message belongs to.
+        """
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({'error': 'User not authenticated'}), 401
+
+        data = request.get_json(silent=True)
+        if not data:
+            return jsonify({'error': 'Request body is required'}), 400
+
+        message_id = str(data.get('message_id', '') or '').strip()
+        conversation_id = str(data.get('conversation_id', '') or '').strip()
+
+        if not message_id or not conversation_id:
+            return jsonify({'error': 'message_id and conversation_id are required'}), 400
+
+        try:
+            try:
+                conversation = cosmos_conversations_container.read_item(
+                    item=conversation_id,
+                    partition_key=conversation_id
+                )
+            except Exception:
+                return jsonify({'error': 'Conversation not found'}), 404
+
+            if conversation.get('user_id') != user_id:
+                return jsonify({'error': 'Access denied'}), 403
+
+            try:
+                message = cosmos_messages_container.read_item(
+                    item=message_id,
+                    partition_key=conversation_id
+                )
+            except Exception:
+                message_query = """
+                    SELECT * FROM c
+                    WHERE c.id = @message_id AND c.conversation_id = @conversation_id
+                """
+                message_results = list(cosmos_messages_container.query_items(
+                    query=message_query,
+                    parameters=[
+                        {'name': '@message_id', 'value': message_id},
+                        {'name': '@conversation_id', 'value': conversation_id}
+                    ],
+                    enable_cross_partition_query=True
+                ))
+                if not message_results:
+                    return jsonify({'error': 'Message not found'}), 404
+                message = message_results[0]
+
+            if message.get('conversation_id') != conversation_id:
+                return jsonify({'error': 'Message not found'}), 404
+
+            document_bytes = _message_to_docx_bytes(message)
+            timestamp_str = datetime.utcnow().strftime('%Y%m%d_%H%M%S')
+            filename = f"message_export_{timestamp_str}.docx"
+
+            response = make_response(document_bytes)
+            response.headers['Content-Type'] = (
+                'application/vnd.openxmlformats-officedocument.wordprocessingml.document'
+            )
+            response.headers['Content-Disposition'] = f'attachment; filename="{filename}"'
+            return response
+
+        except Exception as exc:
+            debug_print(f"Message export error: {str(exc)}")
+            log_event(f"Message export failed: {exc}", level="WARNING")
+            return jsonify({'error': 'Export failed due to a server error. Please try again later.'}), 500
+
+
+def _build_export_entry(
+    conversation: Dict[str, Any],
+    raw_messages: List[Dict[str, Any]],
+    user_id: str,
+    settings: Dict[str, Any],
+    include_summary_intro: bool = False,
+    summary_model_deployment: str = ''
+) -> Dict[str, Any]:
+    filtered_messages = _filter_messages_for_export(raw_messages)
+    ordered_messages = sort_messages_by_thread(filtered_messages)
+
+    raw_thoughts = get_thoughts_for_conversation(conversation.get('id'), user_id)
+    thoughts_by_message = defaultdict(list)
+    for thought in raw_thoughts:
+        thoughts_by_message[thought.get('message_id')].append(_sanitize_thought(thought))
+
+    exported_messages = []
+    role_counts = Counter()
+    total_citation_counts = Counter({'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0})
+    transcript_index = 0
+    total_thoughts = 0
+
+    for sequence_index, message in enumerate(ordered_messages, start=1):
+        role = message.get('role', 'unknown')
+        role_counts[role] += 1
+
+        message_transcript_index = None
+        if role in TRANSCRIPT_ROLES:
+            transcript_index += 1
+            message_transcript_index = transcript_index
+
+        thoughts = thoughts_by_message.get(message.get('id'), [])
+        exported_message = _sanitize_message(
+            message,
+            sequence_index=sequence_index,
+            transcript_index=message_transcript_index,
+            thoughts=thoughts
+        )
+        exported_messages.append(exported_message)
+
+        counts = exported_message.get('citation_counts', {})
+        for key in total_citation_counts:
+            total_citation_counts[key] += counts.get(key, 0)
+        total_thoughts += len(thoughts)
+
+    # Compute message time range for summary caching
+    message_time_start = None
+    message_time_end = None
+    if ordered_messages:
+        message_time_start = ordered_messages[0].get('timestamp')
+        message_time_end = ordered_messages[-1].get('timestamp')
+
+    sanitized_conversation = _sanitize_conversation(
+        conversation,
+        messages=exported_messages,
+        role_counts=role_counts,
+        citation_counts=total_citation_counts,
+        thought_count=total_thoughts
+    )
+    summary_intro = _build_summary_intro(
+        messages=exported_messages,
+        conversation=conversation,
+        sanitized_conversation=sanitized_conversation,
+        settings=settings,
+        enabled=include_summary_intro,
+        summary_model_deployment=summary_model_deployment,
+        message_time_start=message_time_start,
+        message_time_end=message_time_end
+    )
+
+    return {
+        'conversation': sanitized_conversation,
+        'summary_intro': summary_intro,
+        'messages': exported_messages
+    }
+
+
+def _filter_messages_for_export(messages: List[Dict[str, Any]]) -> List[Dict[str, Any]]:
+    filtered_messages = []
+    for message in messages:
+        metadata = message.get('metadata', {}) or {}
+        if metadata.get('is_deleted') is True:
+            continue
+
+        thread_info = metadata.get('thread_info', {}) or {}
+        active = thread_info.get('active_thread')
+        if active is True or active is None or 'active_thread' not in thread_info:
+            filtered_messages.append(message)
+
+    return filtered_messages
+
+
+def _sanitize_conversation(
+    conversation: Dict[str, Any],
+    messages: List[Dict[str, Any]],
+    role_counts: Counter,
+    citation_counts: Counter,
+    thought_count: int
+) -> Dict[str, Any]:
+    transcript_count = sum(1 for message in messages if message.get('is_transcript_message'))
+    return {
+        'id': conversation.get('id'),
+        'title': conversation.get('title', 'Untitled'),
+        'last_updated': conversation.get('last_updated', ''),
+        'chat_type': conversation.get('chat_type', 'personal'),
+        'tags': conversation.get('tags', []),
+        'context': conversation.get('context', []),
+        'classification': conversation.get('classification', []),
+        'strict': conversation.get('strict', False),
+        'is_pinned': conversation.get('is_pinned', False),
+        'scope_locked': conversation.get('scope_locked'),
+        'locked_contexts': conversation.get('locked_contexts', []),
+        'message_count': len(messages),
+        'transcript_message_count': transcript_count,
+        'message_counts_by_role': dict(role_counts),
+        'citation_counts': dict(citation_counts),
+        'thought_count': thought_count
+    }
+
+
+def _sanitize_message(
+    message: Dict[str, Any],
+    sequence_index: int,
+    transcript_index: Optional[int],
+    thoughts: List[Dict[str, Any]]
+) -> Dict[str, Any]:
+    role = message.get('role', '')
+    content = message.get('content', '')
+    raw_citation_buckets = _collect_raw_citation_buckets(message)
+    normalized_citations = _normalize_citations(raw_citation_buckets)
+    citation_counts = _build_citation_counts(normalized_citations)
+    details = _curate_message_details(message, citation_counts, len(thoughts))
+
+    return {
+        'id': message.get('id'),
+        'role': role,
+        'speaker_label': _role_to_label(role),
+        'sequence_index': sequence_index,
+        'transcript_index': transcript_index,
+        'label': f"Turn {transcript_index}" if transcript_index else f"Message {sequence_index}",
+        'is_transcript_message': role in TRANSCRIPT_ROLES,
+        'timestamp': message.get('timestamp', ''),
+        'content': content,
+        'content_text': _normalize_content(content),
+        'details': details,
+        'citations': normalized_citations,
+        'citation_counts': citation_counts,
+        'thoughts': thoughts,
+        'legacy_citations': raw_citation_buckets['legacy'],
+        'hybrid_citations': raw_citation_buckets['hybrid'],
+        'web_search_citations': raw_citation_buckets['web'],
+        'agent_citations': raw_citation_buckets['agent']
+    }
+
+
+def _sanitize_thought(thought: Dict[str, Any]) -> Dict[str, Any]:
+    return {
+        'step_index': thought.get('step_index'),
+        'step_type': thought.get('step_type'),
+        'content': thought.get('content'),
+        'detail': thought.get('detail'),
+        'duration_ms': thought.get('duration_ms'),
+        'timestamp': thought.get('timestamp')
+    }
+
+
+def _collect_raw_citation_buckets(message: Dict[str, Any]) -> Dict[str, List[Any]]:
+    def ensure_list(value: Any) -> List[Any]:
+        if not value:
+            return []
+        return value if isinstance(value, list) else [value]
+
+    return {
+        'legacy': ensure_list(message.get('citations')),
+        'hybrid': ensure_list(message.get('hybrid_citations')),
+        'web': ensure_list(message.get('web_search_citations')),
+        'agent': ensure_list(message.get('agent_citations'))
+    }
+
+
+def _normalize_citations(raw_citation_buckets: Dict[str, List[Any]]) -> List[Dict[str, Any]]:
+    normalized = []
+
+    for citation in raw_citation_buckets.get('hybrid', []):
+        if isinstance(citation, dict):
+            normalized.append({
+                'citation_type': 'document',
+                'label': _build_document_citation_label(citation),
+                'file_name': citation.get('file_name'),
+                'title': citation.get('title') or citation.get('file_name'),
+                'page_number': citation.get('page_number'),
+                'citation_id': citation.get('citation_id'),
+                'chunk_id': citation.get('chunk_id'),
+                'metadata_type': citation.get('metadata_type'),
+                'metadata_content': citation.get('metadata_content'),
+                'score': citation.get('score'),
+                'classification': citation.get('classification'),
+                'url': citation.get('url')
+            })
         else:
-            parts = []
-            for entry in exported:
-                parts.append(_conversation_to_markdown(entry))
-            content = '\n\n---\n\n'.join(parts)
-            filename = f"conversations_export_{timestamp_str}.md"
-            content_type = 'text/markdown; charset=utf-8'
-
-        response = make_response(content)
-        response.headers['Content-Type'] = content_type
-        response.headers['Content-Disposition'] = f'attachment; filename="{filename}"'
-        return response
-
-    def _build_zip_response(exported, export_format, timestamp_str):
-        """Build a ZIP archive containing one file per conversation."""
-        buffer = io.BytesIO()
-        with zipfile.ZipFile(buffer, 'w', zipfile.ZIP_DEFLATED) as zf:
-            for entry in exported:
-                conv = entry['conversation']
-                safe_title = _safe_filename(conv.get('title', 'Untitled'))
-                conv_id_short = conv.get('id', 'unknown')[:8]
-
-                if export_format == 'json':
-                    file_content = json.dumps(entry, indent=2, ensure_ascii=False, default=str)
-                    ext = 'json'
-                else:
-                    file_content = _conversation_to_markdown(entry)
-                    ext = 'md'
+            normalized.append({
+                'citation_type': 'document',
+                'label': str(citation),
+                'value': str(citation)
+            })
+
+    for citation in raw_citation_buckets.get('web', []):
+        if isinstance(citation, dict):
+            title = citation.get('title') or citation.get('url') or 'Web source'
+            normalized.append({
+                'citation_type': 'web',
+                'label': title,
+                'title': title,
+                'url': citation.get('url')
+            })
+        else:
+            normalized.append({
+                'citation_type': 'web',
+                'label': str(citation),
+                'value': str(citation)
+            })
+
+    for citation in raw_citation_buckets.get('agent', []):
+        if isinstance(citation, dict):
+            tool_name = citation.get('tool_name') or citation.get('function_name') or 'Tool invocation'
+            normalized.append({
+                'citation_type': 'agent_tool',
+                'label': tool_name,
+                'tool_name': citation.get('tool_name'),
+                'function_name': citation.get('function_name'),
+                'plugin_name': citation.get('plugin_name'),
+                'success': citation.get('success'),
+                'timestamp': citation.get('timestamp')
+            })
+        else:
+            normalized.append({
+                'citation_type': 'agent_tool',
+                'label': str(citation),
+                'value': str(citation)
+            })
+
+    for citation in raw_citation_buckets.get('legacy', []):
+        if isinstance(citation, dict):
+            title = citation.get('title') or citation.get('filepath') or citation.get('url') or 'Legacy citation'
+            normalized.append({
+                'citation_type': 'legacy',
+                'label': title,
+                'title': title,
+                'url': citation.get('url'),
+                'filepath': citation.get('filepath')
+            })
+        else:
+            normalized.append({
+                'citation_type': 'legacy',
+                'label': str(citation),
+                'value': str(citation)
+            })
+
+    return normalized
+
+
+def _build_document_citation_label(citation: Dict[str, Any]) -> str:
+    file_name = citation.get('file_name') or citation.get('title') or 'Document source'
+    metadata_type = citation.get('metadata_type')
+    page_number = citation.get('page_number')
+
+    if metadata_type:
+        return f"{file_name} — {metadata_type.replace('_', ' ').title()}"
+    if page_number not in (None, ''):
+        return f"{file_name} — Page {page_number}"
+    return file_name
+
+
+def _build_citation_counts(citations: List[Dict[str, Any]]) -> Dict[str, int]:
+    counts = {
+        'document': 0,
+        'web': 0,
+        'agent_tool': 0,
+        'legacy': 0,
+        'total': len(citations)
+    }
+    for citation in citations:
+        citation_type = citation.get('citation_type')
+        if citation_type in counts:
+            counts[citation_type] += 1
+    return counts
+
+
+def _curate_message_details(
+    message: Dict[str, Any],
+    citation_counts: Dict[str, int],
+    thought_count: int
+) -> Dict[str, Any]:
+    role = message.get('role', '')
+    metadata = message.get('metadata', {}) or {}
+    details: Dict[str, Any] = {}
+
+    if role == 'user':
+        details['interaction_mode'] = _remove_empty_values({
+            'button_states': metadata.get('button_states'),
+            'workspace_search': _curate_workspace_search(metadata.get('workspace_search')),
+            'prompt_selection': _curate_prompt_selection(metadata.get('prompt_selection')),
+            'agent_selection': _curate_agent_selection(metadata.get('agent_selection')),
+            'model_selection': _curate_model_selection(metadata.get('model_selection'))
+        })
+    elif role == 'assistant':
+        details['generation'] = _remove_empty_values({
+            'augmented': message.get('augmented'),
+            'model_deployment': message.get('model_deployment_name'),
+            'agent_name': message.get('agent_name'),
+            'agent_display_name': message.get('agent_display_name'),
+            'reasoning_effort': metadata.get('reasoning_effort'),
+            'hybrid_search_query': message.get('hybridsearch_query'),
+            'token_usage': _curate_token_usage(metadata.get('token_usage')),
+            'citation_counts': citation_counts,
+            'thought_count': thought_count
+        })
+    else:
+        details['message_context'] = _remove_empty_values({
+            'filename': message.get('filename'),
+            'prompt': message.get('prompt'),
+            'is_table': message.get('is_table'),
+            'model_deployment': message.get('model_deployment_name')
+        })
+
+    return _remove_empty_values(details)
+
+
+def _curate_workspace_search(workspace_search: Optional[Dict[str, Any]]) -> Dict[str, Any]:
+    if not isinstance(workspace_search, dict):
+        return {}
+    return _remove_empty_values({
+        'search_enabled': workspace_search.get('search_enabled'),
+        'document_scope': workspace_search.get('document_scope'),
+        'document_name': workspace_search.get('document_name'),
+        'document_filename': workspace_search.get('document_filename'),
+        'group_name': workspace_search.get('group_name'),
+        'classification': workspace_search.get('classification'),
+        'public_workspace_id': workspace_search.get('active_public_workspace_id')
+    })
+
+
+def _curate_prompt_selection(prompt_selection: Optional[Dict[str, Any]]) -> Dict[str, Any]:
+    if not isinstance(prompt_selection, dict):
+        return {}
+    return _remove_empty_values({
+        'prompt_name': prompt_selection.get('prompt_name'),
+        'selected_prompt_index': prompt_selection.get('selected_prompt_index'),
+        'selected_prompt_text': prompt_selection.get('selected_prompt_text')
+    })
+
+
+def _curate_agent_selection(agent_selection: Optional[Dict[str, Any]]) -> Dict[str, Any]:
+    if not isinstance(agent_selection, dict):
+        return {}
+    return _remove_empty_values({
+        'selected_agent': agent_selection.get('selected_agent'),
+        'agent_display_name': agent_selection.get('agent_display_name'),
+        'is_global': agent_selection.get('is_global'),
+        'is_group': agent_selection.get('is_group'),
+        'group_name': agent_selection.get('group_name')
+    })
+
+
+def _curate_model_selection(model_selection: Optional[Dict[str, Any]]) -> Dict[str, Any]:
+    if not isinstance(model_selection, dict):
+        return {}
+    return _remove_empty_values({
+        'selected_model': model_selection.get('selected_model'),
+        'frontend_requested_model': model_selection.get('frontend_requested_model'),
+        'reasoning_effort': model_selection.get('reasoning_effort'),
+        'streaming': model_selection.get('streaming')
+    })
+
+
+def _curate_token_usage(token_usage: Any) -> Dict[str, Any]:
+    if not isinstance(token_usage, dict):
+        return {}
+    return _remove_empty_values({
+        'prompt_tokens': token_usage.get('prompt_tokens'),
+        'completion_tokens': token_usage.get('completion_tokens'),
+        'total_tokens': token_usage.get('total_tokens')
+    })
+
+
+def _remove_empty_values(value: Any) -> Any:
+    if isinstance(value, dict):
+        cleaned = {}
+        for key, item in value.items():
+            cleaned_item = _remove_empty_values(item)
+            if cleaned_item in (None, '', [], {}):
+                continue
+            cleaned[key] = cleaned_item
+        return cleaned
+
+    if isinstance(value, list):
+        cleaned_list = []
+        for item in value:
+            cleaned_item = _remove_empty_values(item)
+            if cleaned_item in (None, '', [], {}):
+                continue
+            cleaned_list.append(cleaned_item)
+        return cleaned_list
+
+    return value
+
+
+def generate_conversation_summary(
+    messages: List[Dict[str, Any]],
+    conversation_title: str,
+    settings: Dict[str, Any],
+    model_deployment: str,
+    message_time_start: str = None,
+    message_time_end: str = None,
+    conversation_id: str = None
+) -> Dict[str, Any]:
+    """Generate a conversation summary using the LLM and optionally persist it.
+
+    This is the shared helper used by both the export pipeline and the
+    on-demand summary API endpoint.  Returns a summary dict suitable for
+    storage in conversation metadata.
+
+    Raises ValueError when there is no content to summarise and
+    RuntimeError on model errors.
+    """
+    transcript_lines = []
+    for message in messages:
+        content_text = message.get('content_text', '')
+        if not content_text:
+            continue
+        role = message.get('role', 'unknown')
+        speaker = message.get('speaker_label', role).upper()
+        transcript_lines.append(f"{speaker}: {content_text}")
+
+    transcript_text = '\n\n'.join(transcript_lines).strip()
+    if not transcript_text:
+        raise ValueError('No message content was available to summarize.')
+
+    transcript_text = _truncate_for_summary(transcript_text)
+
+    gpt_client, gpt_model = _initialize_gpt_client(settings, model_deployment)
+    summary_prompt = (
+        "You are summarizing a conversation for an export document. "
+        "Read the full conversation below and write a concise summary. "
+        "Use your judgement on length: for short conversations write one brief paragraph, "
+        "for longer or more detailed conversations write two paragraphs. "
+        "If you need refer to the user, use their name, but do not refer to the user too often."
+        "Cover the goals, the key topics discussed, any data or tools referenced, "
+        "and the main outcomes or answers provided. "
+        "Be factual and neutral. Return plain text only — no headings, no bullet points, no markdown formatting."
+    )
+
+    model_lower = gpt_model.lower()
+    is_reasoning_model = (
+        'o1' in model_lower or 'o3' in model_lower or 'gpt-5' in model_lower
+    )
+    instruction_role = 'developer' if is_reasoning_model else 'system'
+
+    debug_print(f"Summary generation: sending {len(transcript_lines)} messages "
+                f"({len(transcript_text)} chars) to {gpt_model} (role={instruction_role})")
+
+    summary_response = gpt_client.chat.completions.create(
+        model=gpt_model,
+        messages=[
+            {
+                'role': instruction_role,
+                'content': summary_prompt
+            },
+            {
+                'role': 'user',
+                'content': (
+                    f"Conversation Title: {conversation_title}\n\n"
+                    f"{transcript_text}"
+                )
+            }
+        ]
+    )
+
+    debug_print(f"Summary generation: response choices="
+                f"{len(summary_response.choices) if summary_response.choices else 0}, "
+                f"finish_reason={summary_response.choices[0].finish_reason if summary_response.choices else 'N/A'}")
+
+    summary_text = (summary_response.choices[0].message.content or '').strip() if summary_response.choices else ''
+    if not summary_text:
+        debug_print('Summary generation: model returned an empty response')
+        log_event('Conversation summary generation returned empty response', level='WARNING')
+        raise RuntimeError('Summary model returned an empty response.')
+
+    summary_data = {
+        'content': summary_text,
+        'model_deployment': gpt_model,
+        'generated_at': datetime.utcnow().isoformat(),
+        'message_time_start': message_time_start,
+        'message_time_end': message_time_end
+    }
+
+    # Persist to Cosmos when a conversation_id is available
+    if conversation_id:
+        try:
+            update_conversation_with_metadata(conversation_id, {'summary': summary_data})
+            debug_print(f"Summary persisted to conversation {conversation_id}")
+        except Exception as persist_exc:
+            debug_print(f"Failed to persist summary to Cosmos: {persist_exc}")
+            log_event(f"Failed to persist conversation summary: {persist_exc}", level="WARNING")
+
+    return summary_data
+
+
+def _build_summary_intro(
+    messages: List[Dict[str, Any]],
+    conversation: Dict[str, Any],
+    sanitized_conversation: Dict[str, Any],
+    settings: Dict[str, Any],
+    enabled: bool,
+    summary_model_deployment: str,
+    message_time_start: str = None,
+    message_time_end: str = None
+) -> Dict[str, Any]:
+    """Build the summary_intro block for the export payload.
+
+    Uses cached summary from conversation metadata when present and
+    still current (no newer messages).  Otherwise generates a fresh
+    summary via ``generate_conversation_summary`` and persists it.
+    """
+    summary_intro = {
+        'enabled': enabled,
+        'generated': False,
+        'model_deployment': summary_model_deployment or None,
+        'generated_at': None,
+        'content': '',
+        'error': None
+    }
+
+    if not enabled:
+        return summary_intro
+
+    # Check for a cached summary stored in the conversation document
+    existing_summary = conversation.get('summary')
+    if existing_summary and isinstance(existing_summary, dict):
+        cached_end = existing_summary.get('message_time_end')
+        if cached_end and message_time_end and cached_end >= message_time_end:
+            debug_print('Export summary: using cached summary from conversation metadata')
+            summary_intro.update({
+                'generated': True,
+                'model_deployment': existing_summary.get('model_deployment'),
+                'generated_at': existing_summary.get('generated_at'),
+                'content': existing_summary.get('content', ''),
+                'error': None
+            })
+            return summary_intro
+        debug_print('Export summary: cached summary is stale, regenerating')
+
+    try:
+        conversation_id = conversation.get('id')
+        conversation_title = sanitized_conversation.get('title', 'Untitled')
+
+        summary_data = generate_conversation_summary(
+            messages=messages,
+            conversation_title=conversation_title,
+            settings=settings,
+            model_deployment=summary_model_deployment,
+            message_time_start=message_time_start,
+            message_time_end=message_time_end,
+            conversation_id=conversation_id
+        )
+
+        summary_intro.update({
+            'generated': True,
+            'model_deployment': summary_data.get('model_deployment'),
+            'generated_at': summary_data.get('generated_at'),
+            'content': summary_data.get('content', ''),
+            'error': None
+        })
+        return summary_intro
+
+    except (ValueError, RuntimeError) as known_exc:
+        debug_print(f"Export summary generation issue: {known_exc}")
+        summary_intro['error'] = str(known_exc)
+        if hasattr(known_exc, 'model_deployment'):
+            summary_intro['model_deployment'] = known_exc.model_deployment
+        return summary_intro
+
+    except Exception as exc:
+        debug_print(f"Export summary generation failed: {exc}")
+        log_event(f"Conversation export summary generation failed: {exc}", level="WARNING")
+        summary_intro['error'] = str(exc)
+        return summary_intro
+
+
+def _truncate_for_summary(transcript_text: str) -> str:
+    if len(transcript_text) <= SUMMARY_SOURCE_CHAR_LIMIT:
+        return transcript_text
+
+    head_chars = SUMMARY_SOURCE_CHAR_LIMIT // 2
+    tail_chars = SUMMARY_SOURCE_CHAR_LIMIT - head_chars
+    return (
+        transcript_text[:head_chars]
+        + "\n\n[... transcript truncated for export summary generation ...]\n\n"
+        + transcript_text[-tail_chars:]
+    )
+
+
+def _initialize_gpt_client(settings: Dict[str, Any], requested_model: str = ''):
+    enable_gpt_apim = settings.get('enable_gpt_apim', False)
+
+    if enable_gpt_apim:
+        raw_models = settings.get('azure_apim_gpt_deployment', '') or ''
+        apim_models = [model.strip() for model in raw_models.split(',') if model.strip()]
+        if not apim_models:
+            raise ValueError('APIM GPT deployment name is not configured.')
+
+        if requested_model and requested_model not in apim_models:
+            raise ValueError(f"Requested summary model '{requested_model}' is not configured for APIM.")
+
+        gpt_model = requested_model or apim_models[0]
+        gpt_client = AzureOpenAI(
+            api_version=settings.get('azure_apim_gpt_api_version'),
+            azure_endpoint=settings.get('azure_apim_gpt_endpoint'),
+            api_key=settings.get('azure_apim_gpt_subscription_key')
+        )
+        return gpt_client, gpt_model
+
+    auth_type = settings.get('azure_openai_gpt_authentication_type')
+    endpoint = settings.get('azure_openai_gpt_endpoint')
+    api_version = settings.get('azure_openai_gpt_api_version')
+    gpt_model_obj = settings.get('gpt_model', {}) or {}
+
+    if requested_model:
+        gpt_model = requested_model
+    elif gpt_model_obj.get('selected'):
+        gpt_model = gpt_model_obj['selected'][0]['deploymentName']
+    else:
+        raise ValueError('No GPT model selected or configured for export summary generation.')
+
+    if auth_type == 'managed_identity':
+        token_provider = get_bearer_token_provider(DefaultAzureCredential(), cognitive_services_scope)
+        gpt_client = AzureOpenAI(
+            api_version=api_version,
+            azure_endpoint=endpoint,
+            azure_ad_token_provider=token_provider
+        )
+    else:
+        api_key = settings.get('azure_openai_gpt_key')
+        if not api_key:
+            raise ValueError('Azure OpenAI API Key not configured.')
+        gpt_client = AzureOpenAI(
+            api_version=api_version,
+            azure_endpoint=endpoint,
+            api_key=api_key
+        )
+
+    return gpt_client, gpt_model
+
+
+def _build_single_file_response(exported: List[Dict[str, Any]], export_format: str, timestamp_str: str):
+    """Build a single-file download response."""
+    if export_format == 'json':
+        content = json.dumps(exported, indent=2, ensure_ascii=False, default=str)
+        filename = f"conversations_export_{timestamp_str}.json"
+        content_type = 'application/json; charset=utf-8'
+    elif export_format == 'pdf':
+        if len(exported) == 1:
+            content = _conversation_to_pdf_bytes(exported[0])
+        else:
+            combined_parts = []
+            for idx, entry in enumerate(exported):
+                if idx > 0:
+                    combined_parts.append(
+                        '<div style="margin-top: 24pt; border-top: 2px solid #999; '
+                        'padding-top: 12pt;"></div>'
+                    )
+                combined_parts.append(_build_pdf_html_body(entry))
+            content = _html_body_to_pdf_bytes('\n'.join(combined_parts))
+        filename = f"conversations_export_{timestamp_str}.pdf"
+        content_type = 'application/pdf'
+    else:
+        parts = []
+        for entry in exported:
+            parts.append(_conversation_to_markdown(entry))
+        content = '\n\n---\n\n'.join(parts)
+        filename = f"conversations_export_{timestamp_str}.md"
+        content_type = 'text/markdown; charset=utf-8'
+
+    response = make_response(content)
+    response.headers['Content-Type'] = content_type
+    response.headers['Content-Disposition'] = f'attachment; filename="{filename}"'
+    return response
+
+
+def _build_zip_response(exported: List[Dict[str, Any]], export_format: str, timestamp_str: str):
+    """Build a ZIP archive containing one file per conversation."""
+    buffer = io.BytesIO()
+    with zipfile.ZipFile(buffer, 'w', zipfile.ZIP_DEFLATED) as zf:
+        for entry in exported:
+            conversation = entry['conversation']
+            safe_title = _safe_filename(conversation.get('title', 'Untitled'))
+            conversation_id_short = conversation.get('id', 'unknown')[:8]
+
+            if export_format == 'json':
+                file_content = json.dumps(entry, indent=2, ensure_ascii=False, default=str)
+                ext = 'json'
+            elif export_format == 'pdf':
+                file_content = _conversation_to_pdf_bytes(entry)
+                ext = 'pdf'
+            else:
+                file_content = _conversation_to_markdown(entry)
+                ext = 'md'
+
+            file_name = f"{safe_title}_{conversation_id_short}.{ext}"
+            zf.writestr(file_name, file_content)
+
+    buffer.seek(0)
+    filename = f"conversations_export_{timestamp_str}.zip"
 
-                file_name = f"{safe_title}_{conv_id_short}.{ext}"
-                zf.writestr(file_name, file_content)
+    response = make_response(buffer.read())
+    response.headers['Content-Type'] = 'application/zip'
+    response.headers['Content-Disposition'] = f'attachment; filename="{filename}"'
+    return response
 
-        buffer.seek(0)
-        filename = f"conversations_export_{timestamp_str}.zip"
 
-        response = make_response(buffer.read())
-        response.headers['Content-Type'] = 'application/zip'
-        response.headers['Content-Disposition'] = f'attachment; filename="{filename}"'
-        return response
+def _conversation_to_markdown(entry: Dict[str, Any]) -> str:
+    """Convert a conversation + messages entry to Markdown format."""
+    conversation = entry['conversation']
+    messages = entry['messages']
+    summary_intro = entry.get('summary_intro', {}) or {}
 
-    def _conversation_to_markdown(entry):
-        """Convert a conversation + messages entry to Markdown format."""
-        conv = entry['conversation']
-        messages = entry['messages']
+    transcript_messages = [message for message in messages if message.get('is_transcript_message')]
+    detail_messages = [message for message in messages if message.get('details')]
+    reference_messages = [message for message in messages if message.get('citations')]
+    thought_messages = [message for message in messages if message.get('thoughts')]
+    supplemental_messages = [message for message in messages if not message.get('is_transcript_message')]
 
-        lines = []
-        title = conv.get('title', 'Untitled')
-        lines.append(f"# {title}")
+    lines: List[str] = []
+    lines.append(f"# {conversation.get('title', 'Untitled')}")
+    lines.append('')
+    lines.append(f"**Last Updated:** {conversation.get('last_updated', '')}  ")
+    lines.append(f"**Chat Type:** {conversation.get('chat_type', 'personal')}  ")
+    lines.append(f"**Messages:** {conversation.get('message_count', len(messages))}  ")
+    if conversation.get('tags'):
+        lines.append(f"**Tags:** {', '.join(_format_tag(tag) for tag in conversation.get('tags', []))}  ")
+    if conversation.get('classification'):
+        lines.append(f"**Classification:** {', '.join(_format_tag(item) for item in conversation.get('classification', []))}  ")
+    lines.append('')
+
+    if summary_intro.get('enabled') and summary_intro.get('generated') and summary_intro.get('content'):
+        lines.append('## Abstract')
+        lines.append('')
+        lines.append(summary_intro.get('content', ''))
+        lines.append('')
+        lines.append(f"_Generated with {summary_intro.get('model_deployment') or 'configured model'} on {summary_intro.get('generated_at')}_")
+        lines.append('')
+    elif summary_intro.get('enabled') and summary_intro.get('error'):
+        lines.append('> _A summary intro was requested, but it could not be generated for this export._')
+        lines.append(f"> _Error: {summary_intro.get('error')}_")
         lines.append('')
 
-        # Metadata
-        last_updated = conv.get('last_updated', '')
-        chat_type = conv.get('chat_type', 'personal')
-        tags = conv.get('tags', [])
-
-        lines.append(f"**Last Updated:** {last_updated}  ")
-        lines.append(f"**Chat Type:** {chat_type}  ")
-        if tags:
-            tag_strs = [str(t) for t in tags]
-            lines.append(f"**Tags:** {', '.join(tag_strs)}  ")
-        lines.append(f"**Messages:** {len(messages)}  ")
+    lines.append('## Transcript')
+    lines.append('')
+    if not transcript_messages:
+        lines.append('_No user or assistant transcript messages were available for export._')
         lines.append('')
-        lines.append('---')
+    else:
+        for message in transcript_messages:
+            lines.append(f"### {message.get('label')} — {message.get('speaker_label')}")
+            if message.get('timestamp'):
+                lines.append(f"*{message.get('timestamp')}*")
+            lines.append('')
+            lines.append(message.get('content_text') or '_No content recorded._')
+            lines.append('')
+
+    lines.append('## Appendix A — Conversation Metadata')
+    lines.append('')
+    metadata_to_render = _remove_empty_values({
+        'context': conversation.get('context'),
+        'classification': conversation.get('classification'),
+        'strict': conversation.get('strict'),
+        'is_pinned': conversation.get('is_pinned'),
+        'scope_locked': conversation.get('scope_locked'),
+        'locked_contexts': conversation.get('locked_contexts'),
+        'message_counts_by_role': conversation.get('message_counts_by_role'),
+        'citation_counts': conversation.get('citation_counts'),
+        'thought_count': conversation.get('thought_count')
+    })
+    _append_markdown_mapping(lines, metadata_to_render)
+    lines.append('')
+
+    if detail_messages:
+        lines.append('## Appendix B — Message Details')
         lines.append('')
+        for message in detail_messages:
+            lines.append(f"### {message.get('label')} — {message.get('speaker_label')}")
+            if message.get('timestamp'):
+                lines.append(f"*{message.get('timestamp')}*")
+            lines.append('')
+            _append_markdown_mapping(lines, message.get('details', {}))
+            lines.append('')
 
-        # Messages
-        for msg in messages:
-            role = msg.get('role', 'unknown')
-            timestamp = msg.get('timestamp', '')
-            raw_content = msg.get('content', '')
-            content = _normalize_content(raw_content)
-
-            role_label = role.capitalize()
-            if role == 'assistant':
-                role_label = 'Assistant'
-            elif role == 'user':
-                role_label = 'User'
-            elif role == 'system':
-                role_label = 'System'
-            elif role == 'tool':
-                role_label = 'Tool'
-
-            lines.append(f"### {role_label}")
-            if timestamp:
-                lines.append(f"*{timestamp}*")
+    if reference_messages:
+        lines.append('## Appendix C — References')
+        lines.append('')
+        for message in reference_messages:
+            lines.append(f"### {message.get('label')} — {message.get('speaker_label')}")
+            if message.get('timestamp'):
+                lines.append(f"*{message.get('timestamp')}*")
             lines.append('')
-            lines.append(content)
+            _append_citations_markdown(lines, message)
             lines.append('')
 
-            # Citations
-            citations = msg.get('citations')
-            if citations:
-                lines.append('**Citations:**')
-                if isinstance(citations, list):
-                    for cit in citations:
-                        if isinstance(cit, dict):
-                            source = cit.get('title') or cit.get('filepath') or cit.get('url', 'Unknown')
-                            lines.append(f"- {source}")
-                        else:
-                            lines.append(f"- {cit}")
-                lines.append('')
-
-            lines.append('---')
+    if thought_messages:
+        lines.append('## Appendix D — Processing Thoughts')
+        lines.append('')
+        for message in thought_messages:
+            lines.append(f"### {message.get('label')} — {message.get('speaker_label')}")
+            if message.get('timestamp'):
+                lines.append(f"*{message.get('timestamp')}*")
+            lines.append('')
+            for thought in message.get('thoughts', []):
+                thought_label = thought.get('step_type', 'step').replace('_', ' ').title()
+                lines.append(f"1. **{thought_label}:** {thought.get('content') or 'No content recorded.'}")
+                if thought.get('duration_ms') is not None:
+                    lines.append(f"   - **Duration:** {thought.get('duration_ms')} ms")
+                if thought.get('timestamp'):
+                    lines.append(f"   - **Timestamp:** {thought.get('timestamp')}")
+                if thought.get('detail'):
+                    lines.append('   - **Detail:**')
+                    _append_code_block(lines, thought.get('detail'), indent='     ')
             lines.append('')
 
-        return '\n'.join(lines)
+    if supplemental_messages:
+        lines.append('## Appendix E — Supplemental Messages')
+        lines.append('')
+        for message in supplemental_messages:
+            lines.append(f"### {message.get('label')} — {message.get('speaker_label')}")
+            if message.get('timestamp'):
+                lines.append(f"*{message.get('timestamp')}*")
+            lines.append('')
+            lines.append(message.get('content_text') or '_No content recorded._')
+            lines.append('')
 
-    def _normalize_content(content):
-        """Normalize message content to a plain string.
-        
-        Content may be a string, a list of content-part dicts
-        (e.g. [{"type": "text", "text": "..."}, ...]), or a dict.
-        """
-        if isinstance(content, str):
-            return content
-        if isinstance(content, list):
-            parts = []
-            for item in content:
-                if isinstance(item, dict):
-                    if item.get('type') == 'text':
-                        parts.append(item.get('text', ''))
-                    elif item.get('type') == 'image_url':
-                        parts.append('[Image]')
+    return '\n'.join(lines).strip()
+
+
+def _append_citations_markdown(lines: List[str], message: Dict[str, Any]):
+    document_citations = [citation for citation in message.get('citations', []) if citation.get('citation_type') == 'document']
+    web_citations = [citation for citation in message.get('citations', []) if citation.get('citation_type') == 'web']
+    agent_citations = message.get('agent_citations', []) or []
+    legacy_citations = [citation for citation in message.get('citations', []) if citation.get('citation_type') == 'legacy']
+
+    if not any([document_citations, web_citations, agent_citations, legacy_citations]):
+        lines.append('_No citations were recorded for this message._')
+        return
+
+    if document_citations:
+        lines.append('#### Document Sources')
+        lines.append('')
+        for index, citation in enumerate(document_citations, start=1):
+            lines.append(f"{index}. **{citation.get('label', 'Document source')}**")
+            detail_mapping = _remove_empty_values({
+                'citation_id': citation.get('citation_id'),
+                'page_number': citation.get('page_number'),
+                'classification': citation.get('classification'),
+                'score': citation.get('score'),
+                'metadata_type': citation.get('metadata_type')
+            })
+            _append_markdown_mapping(lines, detail_mapping, indent=1)
+            if citation.get('metadata_content'):
+                lines.append('   - **Metadata Content:**')
+                _append_code_block(lines, citation.get('metadata_content'), indent='     ')
+        lines.append('')
+
+    if web_citations:
+        lines.append('#### Web Sources')
+        lines.append('')
+        for index, citation in enumerate(web_citations, start=1):
+            title = citation.get('title') or citation.get('label') or 'Web source'
+            url = citation.get('url')
+            if url:
+                lines.append(f"{index}. [{title}]({url})")
+            else:
+                lines.append(f"{index}. {title}")
+        lines.append('')
+
+    if agent_citations:
+        lines.append('#### Tool Invocations')
+        lines.append('')
+        for index, citation in enumerate(agent_citations, start=1):
+            label = citation.get('tool_name') or citation.get('function_name') or f"Tool {index}"
+            lines.append(f"{index}. **{label}**")
+            detail_mapping = _remove_empty_values({
+                'function_name': citation.get('function_name'),
+                'plugin_name': citation.get('plugin_name'),
+                'success': citation.get('success'),
+                'timestamp': citation.get('timestamp')
+            })
+            _append_markdown_mapping(lines, detail_mapping, indent=1)
+            if citation.get('function_arguments') not in (None, '', [], {}):
+                lines.append('   - **Arguments:**')
+                _append_code_block(lines, citation.get('function_arguments'), indent='     ')
+            if citation.get('function_result') not in (None, '', [], {}):
+                lines.append('   - **Result:**')
+                _append_code_block(lines, citation.get('function_result'), indent='     ')
+        lines.append('')
+
+    if legacy_citations:
+        lines.append('#### Legacy Citation Records')
+        lines.append('')
+        for index, citation in enumerate(legacy_citations, start=1):
+            lines.append(f"{index}. {citation.get('label', 'Legacy citation')}")
+        lines.append('')
+
+
+def _append_markdown_mapping(lines: List[str], mapping: Dict[str, Any], indent: int = 0):
+    if not isinstance(mapping, dict) or not mapping:
+        return
+
+    prefix = '  ' * indent
+    for key, value in mapping.items():
+        label = _format_markdown_key(key)
+        if isinstance(value, dict):
+            lines.append(f"{prefix}- **{label}:**")
+            _append_markdown_mapping(lines, value, indent + 1)
+        elif isinstance(value, list):
+            if not value:
+                continue
+            if all(not isinstance(item, (dict, list)) for item in value):
+                lines.append(f"{prefix}- **{label}:** {', '.join(_stringify_markdown_value(item) for item in value)}")
+            else:
+                lines.append(f"{prefix}- **{label}:**")
+                for item in value:
+                    if isinstance(item, dict):
+                        lines.append(f"{prefix}  -")
+                        _append_markdown_mapping(lines, item, indent + 2)
                     else:
-                        parts.append(str(item))
+                        lines.append(f"{prefix}  - {_stringify_markdown_value(item)}")
+        else:
+            lines.append(f"{prefix}- **{label}:** {_stringify_markdown_value(value)}")
+
+
+def _append_code_block(lines: List[str], value: Any, indent: str = ''):
+    if isinstance(value, (dict, list)):
+        code_block = json.dumps(value, indent=2, ensure_ascii=False, default=str)
+        language = 'json'
+    else:
+        code_block = str(value)
+        language = 'text'
+
+    lines.append(f"{indent}```{language}")
+    for line in code_block.splitlines() or ['']:
+        lines.append(f"{indent}{line}")
+    lines.append(f"{indent}```")
+
+
+def _format_markdown_key(key: str) -> str:
+    return str(key).replace('_', ' ').title()
+
+
+def _stringify_markdown_value(value: Any) -> str:
+    if isinstance(value, bool):
+        return 'Yes' if value else 'No'
+    return str(value)
+
+
+def _format_tag(tag: Any) -> str:
+    """Format a tag or classification entry for display.
+
+    Tags in Cosmos are stored as dicts such as
+    ``{'category': 'model', 'value': 'gpt-5'}`` or
+    ``{'category': 'participant', 'name': 'Alice', 'user_id': '...'}``
+    but they can also be plain strings in older data.
+    """
+    if isinstance(tag, dict):
+        category = tag.get('category', '')
+        # Participant tags carry a readable name / email
+        name = tag.get('name') or tag.get('email') or tag.get('display_name')
+        if name:
+            return f"{category}: {name}" if category else str(name)
+        # Document tags carry a title
+        title = tag.get('title') or tag.get('document_id')
+        if title:
+            return f"{category}: {title}" if category else str(title)
+        # Generic category/value tags
+        value = tag.get('value')
+        if value:
+            return f"{category}: {value}" if category else str(value)
+        return category or str(tag)
+    return str(tag)
+
+
+def _role_to_label(role: str) -> str:
+    role_map = {
+        'assistant': 'Assistant',
+        'user': 'User',
+        'system': 'System',
+        'tool': 'Tool',
+        'file': 'File',
+        'image': 'Image',
+        'safety': 'Safety',
+        'blocked': 'Blocked'
+    }
+    return role_map.get(role, str(role).capitalize() or 'Message')
+
+
+def _normalize_content(content: Any) -> str:
+    """Normalize message content to a plain string."""
+    if isinstance(content, str):
+        return content
+    if isinstance(content, list):
+        parts = []
+        for item in content:
+            if isinstance(item, dict):
+                if item.get('type') == 'text':
+                    parts.append(item.get('text', ''))
+                elif item.get('type') == 'image_url':
+                    parts.append('[Image]')
                 else:
                     parts.append(str(item))
-            return '\n'.join(parts)
-        if isinstance(content, dict):
-            if content.get('type') == 'text':
-                return content.get('text', '')
-            return str(content)
-        return str(content) if content else ''
-
-    def _safe_filename(title):
-        """Create a filesystem-safe filename from a conversation title."""
-        import re
-        # Remove or replace unsafe characters
-        safe = re.sub(r'[<>:"/\\|?*]', '_', title)
-        safe = re.sub(r'\s+', '_', safe)
-        safe = safe.strip('_. ')
-        # Truncate to reasonable length
-        if len(safe) > 50:
-            safe = safe[:50]
-        return safe or 'Untitled'
+            else:
+                parts.append(str(item))
+        return '\n'.join(parts)
+    if isinstance(content, dict):
+        if content.get('type') == 'text':
+            return content.get('text', '')
+        return str(content)
+    return str(content) if content else ''
+
+
+def _safe_filename(title: str) -> str:
+    """Create a filesystem-safe filename from a conversation title."""
+    safe = re.sub(r'[<>:"/\\|?*]', '_', title)
+    safe = re.sub(r'\s+', '_', safe)
+    safe = safe.strip('_. ')
+    if len(safe) > 50:
+        safe = safe[:50]
+    return safe or 'Untitled'
+
+
+def _message_to_docx_bytes(message: Dict[str, Any]) -> bytes:
+    doc = DocxDocument()
+    doc.add_heading('Message Export', level=1)
+
+    role_label = _role_to_label(message.get('role', 'unknown'))
+    timestamp = message.get('timestamp', '')
+
+    meta_paragraph = doc.add_paragraph()
+    meta_run = meta_paragraph.add_run(f"Role: {role_label}")
+    meta_run.bold = True
+    if timestamp:
+        meta_paragraph.add_run(f"    {timestamp}")
+
+    doc.add_paragraph('')
+
+    content = _normalize_content(message.get('content', ''))
+    if content:
+        _add_markdown_content_to_doc(doc, content)
+    else:
+        doc.add_paragraph('No content recorded.')
+
+    citation_labels = _build_message_citation_labels(message)
+    if citation_labels:
+        doc.add_heading('Citations', level=2)
+        for citation_label in citation_labels:
+            doc.add_paragraph(citation_label, style='List Bullet')
+
+    buffer = io.BytesIO()
+    doc.save(buffer)
+    buffer.seek(0)
+    return buffer.read()
+
+
+def _build_message_citation_labels(message: Dict[str, Any]) -> List[str]:
+    normalized_citations = _normalize_citations(_collect_raw_citation_buckets(message))
+    citation_labels: List[str] = []
+    seen_labels = set()
+
+    for citation in normalized_citations:
+        label = str(
+            citation.get('label')
+            or citation.get('title')
+            or citation.get('url')
+            or citation.get('filepath')
+            or citation.get('tool_name')
+            or citation.get('function_name')
+            or ''
+        ).strip()
+        if not label or label in seen_labels:
+            continue
+        seen_labels.add(label)
+        citation_labels.append(label)
+
+    return citation_labels
+
+
+def _add_markdown_content_to_doc(doc: DocxDocument, content: str):
+    lines = content.split('\n')
+    index = 0
+
+    while index < len(lines):
+        line = lines[index]
+
+        heading_match = re.match(r'^(#{1,6})\s+(.*)', line)
+        if heading_match:
+            level = min(len(heading_match.group(1)), 4)
+            doc.add_heading(heading_match.group(2).strip(), level=level)
+            index += 1
+            continue
+
+        if line.strip().startswith('```'):
+            code_lines = []
+            index += 1
+            while index < len(lines) and not lines[index].strip().startswith('```'):
+                code_lines.append(lines[index])
+                index += 1
+            index += 1
+            code_paragraph = doc.add_paragraph()
+            code_run = code_paragraph.add_run('\n'.join(code_lines))
+            code_run.font.name = 'Consolas'
+            code_run.font.size = Pt(9)
+            continue
+
+        unordered_list_match = re.match(r'^(\s*)[*\-+]\s+(.*)', line)
+        if unordered_list_match:
+            doc.add_paragraph(unordered_list_match.group(2).strip(), style='List Bullet')
+            index += 1
+            continue
+
+        ordered_list_match = re.match(r'^(\s*)\d+[.)]\s+(.*)', line)
+        if ordered_list_match:
+            doc.add_paragraph(ordered_list_match.group(2).strip(), style='List Number')
+            index += 1
+            continue
+
+        if not line.strip():
+            index += 1
+            continue
+
+        paragraph = doc.add_paragraph()
+        _add_inline_markdown_runs(paragraph, line)
+        index += 1
+
+
+def _add_inline_markdown_runs(paragraph, text: str):
+    parts = re.compile(r'(\*\*.*?\*\*|\*.*?\*|`[^`]+`)').split(text)
+
+    for part in parts:
+        if part.startswith('**') and part.endswith('**'):
+            run = paragraph.add_run(part[2:-2])
+            run.bold = True
+        elif part.startswith('*') and part.endswith('*') and len(part) > 2:
+            run = paragraph.add_run(part[1:-1])
+            run.italic = True
+        elif part.startswith('`') and part.endswith('`'):
+            run = paragraph.add_run(part[1:-1])
+            run.font.name = 'Consolas'
+            run.font.size = Pt(9)
+        elif part:
+            paragraph.add_run(part)
+
+
+# ---------------------------------------------------------------------------
+# PDF Export — HTML generation and PyMuPDF Story rendering
+# ---------------------------------------------------------------------------
+
+_PDF_CSS = """
+body {
+    font-family: sans-serif;
+    font-size: 10pt;
+    color: #222;
+    line-height: 1.4;
+}
+h1 {
+    font-size: 16pt;
+    color: #1a1a2e;
+    margin-bottom: 2pt;
+}
+h2 {
+    font-size: 13pt;
+    color: #16213e;
+    margin-top: 16pt;
+    margin-bottom: 6pt;
+    border-bottom: 1px solid #ccc;
+    padding-bottom: 4pt;
+}
+h3 {
+    font-size: 11pt;
+    color: #0f3460;
+    margin-top: 10pt;
+    margin-bottom: 4pt;
+}
+h4 {
+    font-size: 10pt;
+    color: #333;
+    margin-top: 8pt;
+    margin-bottom: 4pt;
+}
+p {
+    margin-top: 2pt;
+    margin-bottom: 4pt;
+}
+.metadata {
+    font-size: 8pt;
+    color: #666;
+}
+.abstract {
+    background-color: #f8f9fa;
+    padding: 8pt;
+    margin-bottom: 8pt;
+}
+.note {
+    font-size: 9pt;
+    color: #856404;
+    background-color: #fff3cd;
+    padding: 6pt;
+}
+.bubble {
+    padding: 8pt 12pt;
+    margin-bottom: 8pt;
+}
+.bubble-header {
+    font-size: 8pt;
+    color: #444;
+    margin-bottom: 2pt;
+}
+.ts {
+    font-weight: normal;
+    color: #888;
+}
+.user-bubble {
+    background-color: #c8e0fa;
+    margin-left: 60pt;
+}
+.assistant-bubble {
+    background-color: #f1f0f0;
+    margin-right: 60pt;
+}
+.system-bubble {
+    background-color: #fff3cd;
+    margin-left: 30pt;
+    margin-right: 30pt;
+    font-size: 9pt;
+}
+.file-bubble {
+    background-color: #e8f5e9;
+    margin-right: 60pt;
+    font-size: 9pt;
+}
+.other-bubble {
+    background-color: #f5f5f5;
+    margin-left: 30pt;
+    margin-right: 30pt;
+    font-size: 9pt;
+}
+table {
+    border-collapse: collapse;
+    width: 100%;
+    font-size: 9pt;
+    margin-bottom: 8pt;
+}
+th, td {
+    border: 1px solid #ddd;
+    padding: 4pt 6pt;
+    text-align: left;
+}
+th {
+    background-color: #f5f5f5;
+    font-weight: bold;
+}
+pre {
+    background-color: #f5f5f5;
+    padding: 6pt;
+    font-size: 8pt;
+    font-family: monospace;
+}
+code {
+    font-family: monospace;
+    font-size: 9pt;
+    background-color: #f0f0f0;
+    padding: 1pt 3pt;
+}
+ol, ul {
+    margin-top: 4pt;
+    margin-bottom: 8pt;
+}
+li {
+    margin-bottom: 4pt;
+}
+small {
+    font-size: 8pt;
+    color: #666;
+}
+a {
+    color: #0066cc;
+}
+"""
+
+
+def _pdf_bubble_class(role: str) -> str:
+    """Return the CSS class for a chat bubble based on message role."""
+    role_classes = {
+        'user': 'user-bubble',
+        'assistant': 'assistant-bubble',
+        'system': 'system-bubble',
+        'file': 'file-bubble',
+        'image': 'file-bubble'
+    }
+    return role_classes.get(role, 'other-bubble')
+
+
+def _build_pdf_html_body(entry: Dict[str, Any]) -> str:
+    """Build the HTML body content for a single conversation PDF."""
+    conversation = entry['conversation']
+    messages = entry['messages']
+    summary_intro = entry.get('summary_intro', {}) or {}
+
+    transcript_messages = [m for m in messages if m.get('is_transcript_message')]
+    detail_messages = [m for m in messages if m.get('details')]
+    reference_messages = [m for m in messages if m.get('citations')]
+    thought_messages = [m for m in messages if m.get('thoughts')]
+    supplemental_messages = [m for m in messages if not m.get('is_transcript_message')]
+
+    parts: List[str] = []
+
+    # --- Title and metadata ---
+    parts.append(f'<h1>{_escape_html(conversation.get("title", "Untitled"))}</h1>')
+    meta_items = [
+        f'<b>Last Updated:</b> {_escape_html(str(conversation.get("last_updated", "")))}',
+        f'<b>Chat Type:</b> {_escape_html(str(conversation.get("chat_type", "personal")))}',
+        f'<b>Messages:</b> {conversation.get("message_count", len(messages))}'
+    ]
+    tags = conversation.get('tags')
+    if tags:
+        meta_items.append(f'<b>Tags:</b> {_escape_html(", ".join(_format_tag(t) for t in tags))}')
+    classification = conversation.get('classification')
+    if classification:
+        meta_items.append(
+            f'<b>Classification:</b> {_escape_html(", ".join(_format_tag(c) for c in classification))}'
+        )
+    parts.append(f'<p class="metadata">{" &nbsp;|&nbsp; ".join(meta_items)}</p>')
+
+    # --- Abstract ---
+    if summary_intro.get('enabled') and summary_intro.get('generated') and summary_intro.get('content'):
+        parts.append('<h2>Abstract</h2>')
+        abstract_html = markdown2.markdown(
+            summary_intro.get('content', ''),
+            extras=['fenced-code-blocks', 'tables']
+        )
+        parts.append(f'<div class="abstract">{abstract_html}</div>')
+        parts.append(
+            f'<p class="metadata"><i>Generated with '
+            f'{_escape_html(str(summary_intro.get("model_deployment") or "configured model"))} on '
+            f'{_escape_html(str(summary_intro.get("generated_at", "")))}</i></p>'
+        )
+    elif summary_intro.get('enabled') and summary_intro.get('error'):
+        error_text = _escape_html(str(summary_intro.get('error', '')))
+        parts.append(
+            '<p class="note"><i>A summary intro was requested, '
+            'but could not be generated for this export.</i><br/>'
+            f'<small>Error: {error_text}</small></p>'
+        )
+
+    # --- Transcript with chat bubbles ---
+    parts.append('<h2>Transcript</h2>')
+    if not transcript_messages:
+        parts.append(
+            '<p><i>No user or assistant transcript messages were available for export.</i></p>'
+        )
+    else:
+        for message in transcript_messages:
+            role = message.get('role', '')
+            bubble_class = _pdf_bubble_class(role)
+            label = message.get('label', '')
+            speaker = message.get('speaker_label', '')
+            timestamp = message.get('timestamp', '')
+            content = message.get('content_text', '') or 'No content recorded.'
+
+            parts.append(f'<div class="bubble {bubble_class}">')
+            ts_str = (
+                f' &nbsp;|&nbsp; <span class="ts">{_escape_html(str(timestamp))}</span>'
+                if timestamp else ''
+            )
+            parts.append(
+                f'<p class="bubble-header"><b>{_escape_html(label)} — '
+                f'{_escape_html(speaker)}</b>{ts_str}</p>'
+            )
+            content_html = markdown2.markdown(
+                content,
+                extras=['fenced-code-blocks', 'tables', 'break-on-newline']
+            )
+            parts.append(content_html)
+            parts.append('</div>')
+
+    # --- Appendix A: Conversation Metadata ---
+    parts.append('<h2>Appendix A — Conversation Metadata</h2>')
+    metadata_to_render = _remove_empty_values({
+        'context': conversation.get('context'),
+        'classification': conversation.get('classification'),
+        'strict': conversation.get('strict'),
+        'is_pinned': conversation.get('is_pinned'),
+        'scope_locked': conversation.get('scope_locked'),
+        'locked_contexts': conversation.get('locked_contexts'),
+        'message_counts_by_role': conversation.get('message_counts_by_role'),
+        'citation_counts': conversation.get('citation_counts'),
+        'thought_count': conversation.get('thought_count')
+    })
+    _append_html_table(parts, metadata_to_render)
+
+    # --- Appendix B: Message Details ---
+    if detail_messages:
+        parts.append('<h2>Appendix B — Message Details</h2>')
+        for message in detail_messages:
+            parts.append(
+                f'<h3>{_escape_html(message.get("label", ""))} — '
+                f'{_escape_html(message.get("speaker_label", ""))}</h3>'
+            )
+            if message.get('timestamp'):
+                parts.append(
+                    f'<p class="metadata"><i>{_escape_html(str(message.get("timestamp")))}</i></p>'
+                )
+            _append_html_table(parts, message.get('details', {}))
+
+    # --- Appendix C: References ---
+    if reference_messages:
+        parts.append('<h2>Appendix C — References</h2>')
+        for message in reference_messages:
+            parts.append(
+                f'<h3>{_escape_html(message.get("label", ""))} — '
+                f'{_escape_html(message.get("speaker_label", ""))}</h3>'
+            )
+            if message.get('timestamp'):
+                parts.append(
+                    f'<p class="metadata"><i>{_escape_html(str(message.get("timestamp")))}</i></p>'
+                )
+            _append_html_citations(parts, message)
+
+    # --- Appendix D: Processing Thoughts ---
+    if thought_messages:
+        parts.append('<h2>Appendix D — Processing Thoughts</h2>')
+        for message in thought_messages:
+            parts.append(
+                f'<h3>{_escape_html(message.get("label", ""))} — '
+                f'{_escape_html(message.get("speaker_label", ""))}</h3>'
+            )
+            if message.get('timestamp'):
+                parts.append(
+                    f'<p class="metadata"><i>{_escape_html(str(message.get("timestamp")))}</i></p>'
+                )
+            parts.append('<ol>')
+            for thought in message.get('thoughts', []):
+                thought_label = (thought.get('step_type') or 'step').replace('_', ' ').title()
+                parts.append(
+                    f'<li><b>{_escape_html(thought_label)}:</b> '
+                    f'{_escape_html(str(thought.get("content") or "No content recorded."))}'
+                )
+                if thought.get('duration_ms') is not None:
+                    parts.append(
+                        f'<br/><small><b>Duration:</b> {thought.get("duration_ms")} ms</small>'
+                    )
+                if thought.get('timestamp'):
+                    parts.append(
+                        f'<br/><small><b>Timestamp:</b> '
+                        f'{_escape_html(str(thought.get("timestamp")))}</small>'
+                    )
+                if thought.get('detail'):
+                    parts.append('<br/><small><b>Detail:</b></small>')
+                    _append_html_code_block(parts, thought.get('detail'))
+                parts.append('</li>')
+            parts.append('</ol>')
+
+    # --- Appendix E: Supplemental Messages ---
+    if supplemental_messages:
+        parts.append('<h2>Appendix E — Supplemental Messages</h2>')
+        for message in supplemental_messages:
+            parts.append(
+                f'<h3>{_escape_html(message.get("label", ""))} — '
+                f'{_escape_html(message.get("speaker_label", ""))}</h3>'
+            )
+            if message.get('timestamp'):
+                parts.append(
+                    f'<p class="metadata"><i>{_escape_html(str(message.get("timestamp")))}</i></p>'
+                )
+            content = message.get('content_text', '') or 'No content recorded.'
+            content_html = markdown2.markdown(
+                content,
+                extras=['fenced-code-blocks', 'tables', 'break-on-newline']
+            )
+            parts.append(content_html)
+
+    return '\n'.join(parts)
+
+
+def _render_pdf_bytes(body_html: str) -> bytes:
+    """Render HTML body content to PDF bytes using PyMuPDF Story API."""
+    MEDIABOX = fitz.paper_rect("letter")
+    WHERE = MEDIABOX + (36, 36, -36, -36)
+
+    story = fitz.Story(html=body_html, user_css=_PDF_CSS)
+
+    tmp_path = None
+    try:
+        with tempfile.NamedTemporaryFile(suffix='.pdf', delete=False) as tmp:
+            tmp_path = tmp.name
+
+        writer = fitz.DocumentWriter(tmp_path)
+        more = True
+        while more:
+            device = writer.begin_page(MEDIABOX)
+            more, _ = story.place(WHERE)
+            story.draw(device)
+            writer.end_page()
+        writer.close()
+        del story
+        del writer
+
+        with open(tmp_path, 'rb') as f:
+            return f.read()
+    finally:
+        if tmp_path:
+            try:
+                os.unlink(tmp_path)
+            except OSError:
+                pass
+
+
+def _conversation_to_pdf_bytes(entry: Dict[str, Any]) -> bytes:
+    """Convert a conversation export entry to PDF bytes."""
+    body_html = _build_pdf_html_body(entry)
+    return _render_pdf_bytes(body_html)
+
+
+def _html_body_to_pdf_bytes(body_html: str) -> bytes:
+    """Convert raw HTML body content to PDF bytes."""
+    return _render_pdf_bytes(body_html)
+
+
+def _append_html_table(parts: List[str], mapping: Dict[str, Any]):
+    """Append a key-value mapping as an HTML table."""
+    if not isinstance(mapping, dict) or not mapping:
+        parts.append('<p><i>No data available.</i></p>')
+        return
+
+    parts.append('<table>')
+    parts.append('<tr><th>Property</th><th>Value</th></tr>')
+    for key, value in mapping.items():
+        label = _format_markdown_key(key)
+        if isinstance(value, dict):
+            formatted = _format_nested_html_value(value)
+        elif isinstance(value, list):
+            formatted = (
+                ', '.join(_escape_html(str(item)) for item in value)
+                if value else '<i>None</i>'
+            )
+        elif isinstance(value, bool):
+            formatted = 'Yes' if value else 'No'
+        else:
+            formatted = _escape_html(str(value))
+        parts.append(f'<tr><td><b>{_escape_html(label)}</b></td><td>{formatted}</td></tr>')
+    parts.append('</table>')
+
+
+def _format_nested_html_value(mapping: Dict[str, Any], depth: int = 0) -> str:
+    """Format a nested dict as an HTML string for table cells."""
+    if not mapping:
+        return '<i>None</i>'
+
+    items = []
+    for key, value in mapping.items():
+        label = _format_markdown_key(key)
+        if isinstance(value, dict):
+            nested = _format_nested_html_value(value, depth + 1)
+            items.append(f'<b>{_escape_html(label)}:</b><br/>{nested}')
+        elif isinstance(value, list):
+            list_str = (
+                ', '.join(_escape_html(str(v)) for v in value)
+                if value else 'None'
+            )
+            items.append(f'<b>{_escape_html(label)}:</b> {list_str}')
+        elif isinstance(value, bool):
+            items.append(f'<b>{_escape_html(label)}:</b> {"Yes" if value else "No"}')
+        else:
+            items.append(f'<b>{_escape_html(label)}:</b> {_escape_html(str(value))}')
+    return '<br/>'.join(items)
+
+
+def _append_html_citations(parts: List[str], message: Dict[str, Any]):
+    """Append citation data as HTML."""
+    citations = message.get('citations', [])
+    if not citations:
+        parts.append('<p><i>No citations were recorded for this message.</i></p>')
+        return
+
+    doc_citations = [c for c in citations if c.get('citation_type') == 'document']
+    web_citations = [c for c in citations if c.get('citation_type') == 'web']
+    agent_citations = [c for c in citations if c.get('citation_type') == 'agent_tool']
+    legacy_citations = [c for c in citations if c.get('citation_type') == 'legacy']
+
+    if doc_citations:
+        parts.append('<h4>Document Sources</h4>')
+        parts.append('<ol>')
+        for citation in doc_citations:
+            parts.append(
+                f'<li><b>{_escape_html(str(citation.get("label", "Document source")))}</b>'
+            )
+            detail_items = _remove_empty_values({
+                'citation_id': citation.get('citation_id'),
+                'page_number': citation.get('page_number'),
+                'classification': citation.get('classification'),
+                'score': citation.get('score'),
+                'metadata_type': citation.get('metadata_type')
+            })
+            if detail_items:
+                detail_str = '; '.join(
+                    f'{_format_markdown_key(k)}: {_escape_html(str(v))}'
+                    for k, v in detail_items.items()
+                )
+                parts.append(f'<br/><small>{detail_str}</small>')
+            if citation.get('metadata_content'):
+                parts.append('<br/><small><b>Metadata Content:</b></small>')
+                _append_html_code_block(parts, citation.get('metadata_content'))
+            parts.append('</li>')
+        parts.append('</ol>')
+
+    if web_citations:
+        parts.append('<h4>Web Sources</h4>')
+        parts.append('<ol>')
+        for citation in web_citations:
+            title = _escape_html(
+                str(citation.get('title') or citation.get('label') or 'Web source')
+            )
+            url = citation.get('url')
+            if url:
+                parts.append(f'<li><a href="{_escape_html(url)}">{title}</a></li>')
+            else:
+                parts.append(f'<li>{title}</li>')
+        parts.append('</ol>')
+
+    if agent_citations:
+        parts.append('<h4>Tool Invocations</h4>')
+        parts.append('<ol>')
+        for citation in agent_citations:
+            label = _escape_html(
+                str(citation.get('tool_name') or citation.get('function_name') or 'Tool')
+            )
+            parts.append(f'<li><b>{label}</b>')
+            detail_items = _remove_empty_values({
+                'function_name': citation.get('function_name'),
+                'plugin_name': citation.get('plugin_name'),
+                'success': citation.get('success'),
+                'timestamp': citation.get('timestamp')
+            })
+            if detail_items:
+                detail_str = '; '.join(
+                    f'{_format_markdown_key(k)}: {_escape_html(str(v))}'
+                    for k, v in detail_items.items()
+                )
+                parts.append(f'<br/><small>{detail_str}</small>')
+            parts.append('</li>')
+        parts.append('</ol>')
+
+    if legacy_citations:
+        parts.append('<h4>Legacy Citation Records</h4>')
+        parts.append('<ol>')
+        for citation in legacy_citations:
+            parts.append(
+                f'<li>{_escape_html(str(citation.get("label", "Legacy citation")))}</li>'
+            )
+        parts.append('</ol>')
+
+
+def _append_html_code_block(parts: List[str], value: Any):
+    """Append a code block in HTML format."""
+    if isinstance(value, (dict, list)):
+        code_text = json.dumps(value, indent=2, ensure_ascii=False, default=str)
+    else:
+        code_text = str(value)
+    parts.append(f'<pre>{_escape_html(code_text)}</pre>')
diff --git a/application/single_app/route_backend_conversations.py b/application/single_app/route_backend_conversations.py
index f267d729..58c2fd41 100644
--- a/application/single_app/route_backend_conversations.py
+++ b/application/single_app/route_backend_conversations.py
@@ -3,11 +3,14 @@
 from config import *
 from functions_authentication import *
 from functions_settings import *
-from functions_conversation_metadata import get_conversation_metadata
+from functions_conversation_metadata import get_conversation_metadata, update_conversation_with_metadata
+from functions_conversation_unread import clear_conversation_unread, normalize_conversation_unread_state
+from functions_notifications import mark_chat_response_notifications_read_for_conversation
 from flask import Response, request
 from functions_debug import debug_print
 from swagger_wrapper import swagger_route, get_auth_security
 from functions_activity_logging import log_conversation_creation, log_conversation_deletion, log_conversation_archival
+from functions_thoughts import archive_thoughts_for_conversation, delete_thoughts_for_conversation
 
 def register_route_backend_conversations(app):
 
@@ -287,8 +290,9 @@ def get_conversations():
             return jsonify({'error': 'User not authenticated'}), 401
         query = f"SELECT * FROM c WHERE c.user_id = '{user_id}' ORDER BY c.last_updated DESC"
         items = list(cosmos_conversations_container.query_items(query=query, enable_cross_partition_query=True))
+        normalized_items = [normalize_conversation_unread_state(item) for item in items]
         return jsonify({
-            'conversations': items
+            'conversations': normalized_items
         }), 200
 
 
@@ -311,7 +315,10 @@ def create_conversation():
             'tags': [],
             'strict': False,
             'is_pinned': False,
-            'is_hidden': False
+            'is_hidden': False,
+            'has_unread_assistant_response': False,
+            'last_unread_assistant_message_id': None,
+            'last_unread_assistant_at': None,
         }
         cosmos_conversations_container.upsert_item(conversation_item)
         
@@ -430,7 +437,14 @@ def delete_conversation(conversation_id):
                 cosmos_archived_messages_container.upsert_item(archived_doc)
 
             cosmos_messages_container.delete_item(doc['id'], partition_key=conversation_id)
-        
+
+        # Archive/delete thoughts for conversation
+        user_id_for_thoughts = conversation_item.get('user_id')
+        if archiving_enabled:
+            archive_thoughts_for_conversation(conversation_id, user_id_for_thoughts)
+        else:
+            delete_thoughts_for_conversation(conversation_id, user_id_for_thoughts)
+
         # Log conversation deletion before actual deletion
         log_conversation_deletion(
             user_id=conversation_item.get('user_id'),
@@ -530,7 +544,13 @@ def delete_multiple_conversations():
                         cosmos_archived_messages_container.upsert_item(archived_message)
                     
                     cosmos_messages_container.delete_item(message['id'], partition_key=conversation_id)
-                
+
+                # Archive/delete thoughts for conversation
+                if archiving_enabled:
+                    archive_thoughts_for_conversation(conversation_id, user_id)
+                else:
+                    delete_thoughts_for_conversation(conversation_id, user_id)
+
                 # Log conversation deletion before actual deletion
                 log_conversation_deletion(
                     user_id=user_id,
@@ -779,6 +799,7 @@ def get_conversation_metadata_api(conversation_id):
                 item=conversation_id,
                 partition_key=conversation_id
             )
+            conversation_item = normalize_conversation_unread_state(conversation_item)
             
             # Ensure that the conversation belongs to the current user
             if conversation_item.get('user_id') != user_id:
@@ -796,9 +817,13 @@ def get_conversation_metadata_api(conversation_id):
                 "strict": conversation_item.get('strict', False),
                 "is_pinned": conversation_item.get('is_pinned', False),
                 "is_hidden": conversation_item.get('is_hidden', False),
+                "has_unread_assistant_response": conversation_item.get('has_unread_assistant_response', False),
+                "last_unread_assistant_message_id": conversation_item.get('last_unread_assistant_message_id'),
+                "last_unread_assistant_at": conversation_item.get('last_unread_assistant_at'),
                 "scope_locked": conversation_item.get('scope_locked'),
                 "locked_contexts": conversation_item.get('locked_contexts', []),
-                "chat_type": conversation_item.get('chat_type')
+                "chat_type": conversation_item.get('chat_type'),
+                "summary": conversation_item.get('summary')
             }), 200
             
         except CosmosResourceNotFoundError:
@@ -807,6 +832,135 @@ def get_conversation_metadata_api(conversation_id):
             print(f"Error retrieving conversation metadata: {e}")
             return jsonify({'error': 'Failed to retrieve conversation metadata'}), 500
 
+    @app.route('/api/conversations/<conversation_id>/mark-read', methods=['POST'])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    def mark_conversation_read_api(conversation_id):
+        """Clear unread assistant-response state and related chat notifications."""
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({'error': 'User not authenticated'}), 401
+
+        try:
+            conversation_item = cosmos_conversations_container.read_item(
+                item=conversation_id,
+                partition_key=conversation_id
+            )
+            conversation_item = normalize_conversation_unread_state(conversation_item)
+
+            if conversation_item.get('user_id') != user_id:
+                return jsonify({'error': 'Forbidden'}), 403
+
+            conversation_item = clear_conversation_unread(conversation_item)
+            cosmos_conversations_container.upsert_item(conversation_item)
+
+            notifications_marked_read = mark_chat_response_notifications_read_for_conversation(
+                user_id,
+                conversation_id
+            )
+
+            return jsonify({
+                'success': True,
+                'conversation_id': conversation_id,
+                'has_unread_assistant_response': False,
+                'notifications_marked_read': notifications_marked_read,
+            }), 200
+        except CosmosResourceNotFoundError:
+            return jsonify({'error': 'Conversation not found'}), 404
+        except Exception as e:
+            debug_print(f"Error marking conversation {conversation_id} as read: {e}")
+            return jsonify({'error': 'Failed to mark conversation as read'}), 500
+
+    @app.route('/api/conversations/<conversation_id>/summary', methods=['POST'])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    def generate_conversation_summary_api(conversation_id):
+        """
+        Generate (or regenerate) a summary for a conversation and persist it.
+
+        Request body (optional):
+            { "model_deployment": "gpt-4o" }
+
+        Returns the generated summary dict on success.
+        """
+        from route_backend_conversation_export import generate_conversation_summary, _normalize_content
+        from functions_chat import sort_messages_by_thread
+
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({'error': 'User not authenticated'}), 401
+
+        try:
+            conversation_item = cosmos_conversations_container.read_item(
+                item=conversation_id,
+                partition_key=conversation_id
+            )
+            if conversation_item.get('user_id') != user_id:
+                return jsonify({'error': 'Forbidden'}), 403
+        except CosmosResourceNotFoundError:
+            return jsonify({'error': 'Conversation not found'}), 404
+        except Exception as e:
+            debug_print(f"Error reading conversation for summary: {e}")
+            return jsonify({'error': 'Failed to read conversation'}), 500
+
+        body = request.get_json(silent=True) or {}
+        model_deployment = body.get('model_deployment', '')
+
+        # Query messages for this conversation
+        try:
+            query = "SELECT * FROM c WHERE c.conversation_id = @cid ORDER BY c.timestamp ASC"
+            params = [{"name": "@cid", "value": conversation_id}]
+            raw_messages = list(cosmos_messages_container.query_items(
+                query=query,
+                parameters=params,
+                enable_cross_partition_query=True
+            ))
+        except Exception as e:
+            debug_print(f"Error querying messages for summary: {e}")
+            return jsonify({'error': 'Failed to query messages'}), 500
+
+        if not raw_messages:
+            return jsonify({'error': 'No messages in this conversation'}), 400
+
+        # Build lightweight export-style message list for the summary helper
+        ordered_messages = sort_messages_by_thread(raw_messages)
+        export_messages = []
+        for msg in ordered_messages:
+            role = msg.get('role', 'unknown')
+            # Content may be a string OR a list of content parts — normalise it
+            content = _normalize_content(msg.get('content', ''))
+            speaker = 'USER' if role == 'user' else 'ASSISTANT' if role == 'assistant' else role.upper()
+            export_messages.append({
+                'role': role,
+                'content_text': content,
+                'speaker_label': speaker
+            })
+
+        message_time_start = ordered_messages[0].get('timestamp') if ordered_messages else None
+        message_time_end = ordered_messages[-1].get('timestamp') if ordered_messages else None
+
+        settings = get_settings()
+
+        try:
+            summary_data = generate_conversation_summary(
+                messages=export_messages,
+                conversation_title=conversation_item.get('title', 'Untitled'),
+                settings=settings,
+                model_deployment=model_deployment,
+                message_time_start=message_time_start,
+                message_time_end=message_time_end,
+                conversation_id=conversation_id
+            )
+            return jsonify({'success': True, 'summary': summary_data}), 200
+
+        except (ValueError, RuntimeError) as known_exc:
+            return jsonify({'error': str(known_exc)}), 400
+        except Exception as exc:
+            debug_print(f"Summary generation API error: {exc}")
+            return jsonify({'error': 'Summary generation failed'}), 500
+
     @app.route('/api/conversations/<conversation_id>/scope_lock', methods=['PATCH'])
     @swagger_route(security=get_auth_security())
     @login_required
diff --git a/application/single_app/route_backend_documents.py b/application/single_app/route_backend_documents.py
index fb4eb19b..0e9d490b 100644
--- a/application/single_app/route_backend_documents.py
+++ b/application/single_app/route_backend_documents.py
@@ -7,6 +7,7 @@
 from utils_cache import invalidate_personal_search_cache
 from functions_debug import *
 from functions_activity_logging import log_document_upload, log_document_metadata_update_transaction
+import io
 import os
 import requests
 from flask import current_app
@@ -72,7 +73,58 @@ def get_file_content():
 
             filename = items_sorted[0].get('filename', 'Untitled')
             is_table = items_sorted[0].get('is_table', False)
-            debug_print(f"[GET_FILE_CONTENT] Filename: {filename}, is_table: {is_table}")
+            file_content_source = items_sorted[0].get('file_content_source', '')
+            debug_print(f"[GET_FILE_CONTENT] Filename: {filename}, is_table: {is_table}, source: {file_content_source}")
+
+            # Handle blob-stored tabular files (enhanced citations enabled)
+            if file_content_source == 'blob':
+                blob_container = items_sorted[0].get('blob_container', '')
+                blob_path = items_sorted[0].get('blob_path', '')
+                debug_print(f"[GET_FILE_CONTENT] Blob-stored file: container={blob_container}, path={blob_path}")
+
+                if not blob_container or not blob_path:
+                    return jsonify({'error': 'Blob storage reference is incomplete'}), 500
+
+                try:
+                    blob_service_client = CLIENTS.get("storage_account_office_docs_client")
+                    if not blob_service_client:
+                        return jsonify({'error': 'Blob storage client not available'}), 500
+
+                    blob_client = blob_service_client.get_blob_client(
+                        container=blob_container,
+                        blob=blob_path
+                    )
+                    stream = blob_client.download_blob()
+                    blob_data = stream.readall()
+
+                    # Convert to CSV using pandas for display
+                    file_ext = os.path.splitext(filename)[1].lower()
+                    if file_ext == '.csv':
+                        import pandas
+                        df = pandas.read_csv(io.BytesIO(blob_data))
+                        combined_content = df.to_csv(index=False)
+                    elif file_ext in ['.xlsx', '.xlsm']:
+                        import pandas
+                        df = pandas.read_excel(io.BytesIO(blob_data), engine='openpyxl')
+                        combined_content = df.to_csv(index=False)
+                    elif file_ext == '.xls':
+                        import pandas
+                        df = pandas.read_excel(io.BytesIO(blob_data), engine='xlrd')
+                        combined_content = df.to_csv(index=False)
+                    else:
+                        combined_content = blob_data.decode('utf-8', errors='replace')
+
+                    debug_print(f"[GET_FILE_CONTENT] Successfully read blob content, length: {len(combined_content)}")
+                    return jsonify({
+                        'file_content': combined_content,
+                        'filename': filename,
+                        'is_table': is_table,
+                        'file_content_source': 'blob'
+                    }), 200
+
+                except Exception as blob_err:
+                    debug_print(f"[GET_FILE_CONTENT] Error reading from blob: {blob_err}")
+                    return jsonify({'error': f'Error reading file from storage: {str(blob_err)}'}), 500
 
             add_file_task_to_file_processing_log(document_id=file_id, user_id=user_id, content="Combining file content from chunks, filename: " + filename + ", is_table: " + str(is_table))
             combined_parts = []
@@ -1378,7 +1430,7 @@ def api_get_shared_users(document_id):
                         approval_status = entry.get('approval_status', 'unknown')
                         try:
                             # Get user details from Microsoft Graph
-                            graph_url = f"https://graph.microsoft.com/v1.0/users/{oid}"
+                            graph_url = get_graph_endpoint(f"/users/{oid}")
                             response = requests.get(graph_url, headers=headers)
                             
                             if response.status_code == 200:
diff --git a/application/single_app/route_backend_plugins.py b/application/single_app/route_backend_plugins.py
index 77aab866..153f07ec 100644
--- a/application/single_app/route_backend_plugins.py
+++ b/application/single_app/route_backend_plugins.py
@@ -27,11 +27,22 @@
     delete_group_action,
     validate_group_action_payload,
 )
-from functions_keyvault import SecretReturnType
+from functions_keyvault import (
+    SecretReturnType,
+    redact_plugin_secret_values,
+    retrieve_secret_from_key_vault_by_full_name,
+    ui_trigger_word,
+    validate_secret_name_dynamic,
+)
 #from functions_personal_actions import delete_personal_action
 
 from functions_debug import debug_print
 from json_schema_validation import validate_plugin
+from functions_activity_logging import (
+    log_action_creation,
+    log_action_update,
+    log_action_deletion,
+)
 
 def discover_plugin_types():
     # Dynamically discover allowed plugin types from available plugin classes.
@@ -211,6 +222,51 @@ def get_plugin_types():
 
 bpap = Blueprint('admin_plugins', __name__)
 
+
+def _redact_plugin_for_logging(plugin):
+    """Return a plugin manifest with secret-bearing values redacted for logging."""
+    if not isinstance(plugin, dict):
+        return plugin
+    return redact_plugin_secret_values(plugin)
+
+
+def _resolve_secret_value_for_sql_test(value, field_name):
+    """Resolve a Key Vault reference for SQL test-connection flows."""
+    if not isinstance(value, str) or not value:
+        return value
+    if not validate_secret_name_dynamic(value):
+        return value
+
+    resolved_value = retrieve_secret_from_key_vault_by_full_name(value)
+    if validate_secret_name_dynamic(resolved_value):
+        raise ValueError(f"Unable to resolve stored Key Vault secret for SQL field '{field_name}'.")
+    return resolved_value
+
+
+def _load_existing_plugin_for_sql_test(plugin_context, user_id):
+    """Load an existing plugin manifest with Key Vault reference names for edit-time SQL tests."""
+    if not isinstance(plugin_context, dict):
+        return None
+
+    plugin_scope = (plugin_context.get('scope') or 'user').lower()
+    plugin_identifier = plugin_context.get('id') or plugin_context.get('name')
+    if not plugin_identifier:
+        return None
+
+    if plugin_scope == 'group':
+        active_group = require_active_group(user_id)
+        assert_group_role(
+            user_id,
+            active_group,
+            allowed_roles=("Owner", "Admin", "DocumentManager", "User"),
+        )
+        return get_group_action(active_group, plugin_identifier, return_type=SecretReturnType.NAME)
+
+    if plugin_scope == 'global':
+        return get_global_action(plugin_identifier, return_type=SecretReturnType.NAME)
+
+    return get_personal_action(user_id, plugin_identifier, return_type=SecretReturnType.NAME)
+
 # === USER PLUGINS ENDPOINTS ===
 @bpap.route('/api/user/plugins', methods=['GET'])
 @swagger_route(security=get_auth_security())
@@ -268,12 +324,14 @@ def set_user_plugins():
     global_plugin_names = set(p['name'].lower() for p in global_plugins if 'name' in p)
     
     # Get current personal actions to determine what to delete
-    current_actions = get_personal_actions(user_id)
+    current_actions = get_personal_actions(user_id, return_type=SecretReturnType.NAME)
     current_action_names = set(action['name'] for action in current_actions)
+    current_action_ids = {action.get('id') for action in current_actions if action.get('id')}
     
     # Filter out plugins whose name matches a global plugin name
     filtered_plugins = []
     new_plugin_names = set()
+    new_plugin_ids = set()
     
     for plugin in plugins:
         if plugin.get('name', '').lower() in global_plugin_names:
@@ -290,7 +348,7 @@ def set_user_plugins():
         plugin.setdefault('additionalFields', {})
         
         # Remove Cosmos DB system fields that are not part of the plugin schema
-        cosmos_fields = ['_attachments', '_etag', '_rid', '_self', '_ts', 'created_at', 'updated_at', 'id', 'user_id', 'last_updated']
+        cosmos_fields = ['_attachments', '_etag', '_rid', '_self', '_ts', 'created_at', 'updated_at', 'user_id', 'last_updated']
         for field in cosmos_fields:
             if field in plugin:
                 del plugin[field]
@@ -324,27 +382,53 @@ def set_user_plugins():
             else:
                 plugin['type'] = 'unknown'  # Default type
         
-        print(f"Plugin build: {plugin}")
+        debug_print(f"Plugin build: {_redact_plugin_for_logging(plugin)}")
         validation_error = validate_plugin(plugin)
         if validation_error:
             return jsonify({'error': f'Plugin validation failed: {validation_error}'}), 400
         
         filtered_plugins.append(plugin)
         new_plugin_names.add(plugin['name'])
+        if plugin.get('id'):
+            new_plugin_ids.add(plugin['id'])
     
     # Save each plugin to the personal_actions container
+    plugins_to_delete = []
     try:
         for plugin in filtered_plugins:
             save_personal_action(user_id, plugin)
         
         # Delete any plugins that are no longer in the list
-        plugins_to_delete = current_action_names - new_plugin_names
-        for plugin_name in plugins_to_delete:
-            delete_personal_action(user_id, plugin_name)
+        for action in current_actions:
+            action_id = action.get('id')
+            action_name = action.get('name')
+            if action_id and action_id in new_plugin_ids:
+                continue
+            if action_name in new_plugin_names:
+                continue
+            plugins_to_delete.append(action)
+
+        for action in plugins_to_delete:
+            delete_personal_action(user_id, action.get('id') or action.get('name'))
             
     except Exception as e:
         debug_print(f"Error saving personal actions for user {user_id}: {e}")
         return jsonify({'error': 'Failed to save plugins'}), 500
+
+    # Log individual action activities
+    for plugin in filtered_plugins:
+        p_name = plugin.get('name', '')
+        p_id = plugin.get('id', '')
+        p_type = plugin.get('type', '')
+        if (p_id and p_id in current_action_ids) or p_name in current_action_names:
+            log_action_update(user_id=user_id, action_id=p_id, action_name=p_name, action_type=p_type, scope='personal')
+        else:
+            log_action_creation(user_id=user_id, action_id=p_id, action_name=p_name, action_type=p_type, scope='personal')
+    for action in plugins_to_delete:
+        action_id = action.get('id', '')
+        action_name = action.get('name', '')
+        log_action_deletion(user_id=user_id, action_id=action_id, action_name=action_name, scope='personal')
+
     log_event("User plugins updated", extra={"user_id": user_id, "plugins_count": len(filtered_plugins)})
     return jsonify({'success': True})
 
@@ -360,6 +444,7 @@ def delete_user_plugin(plugin_name):
     if not deleted:
         return jsonify({'error': 'Plugin not found.'}), 404
     
+    log_action_deletion(user_id=user_id, action_id=plugin_name, action_name=plugin_name, scope='personal')
     log_event("User plugin deleted", extra={"user_id": user_id, "plugin_name": plugin_name})
     return jsonify({'success': True})
 
@@ -460,6 +545,13 @@ def create_group_action_route():
     for key in ('group_id', 'last_updated', 'user_id', 'is_global', 'is_group', 'scope'):
         payload.pop(key, None)
 
+    # Handle endpoint based on plugin type (same logic as personal plugins)
+    plugin_type = payload.get('type', '')
+    if plugin_type in ['sql_schema', 'sql_query']:
+        payload.setdefault('endpoint', f'sql://{plugin_type}')
+    elif plugin_type == 'msgraph':
+        payload.setdefault('endpoint', 'https://graph.microsoft.com')
+
     # Merge with schema to ensure all required fields are present (same as global actions)
     schema_dir = os.path.join(current_app.root_path, 'static', 'json', 'schemas')
     merged = get_merged_plugin_settings(payload.get('type'), payload, schema_dir)
@@ -467,11 +559,12 @@ def create_group_action_route():
     payload['additionalFields'] = merged.get('additionalFields', payload.get('additionalFields', {}))
 
     try:
-        saved = save_group_action(active_group, payload)
+        saved = save_group_action(active_group, payload, user_id=user_id)
     except Exception as exc:
         debug_print('Failed to save group action: %s', exc)
         return jsonify({'error': 'Unable to save action'}), 500
 
+    log_action_creation(user_id=user_id, action_id=saved.get('id', ''), action_name=saved.get('name', ''), action_type=saved.get('type', ''), scope='group', group_id=active_group)
     return jsonify(saved), 201
 
 
@@ -516,6 +609,13 @@ def update_group_action_route(action_id):
     merged['is_group'] = True
     merged['id'] = existing.get('id', action_id)
 
+    # Handle endpoint based on plugin type (same logic as personal plugins)
+    plugin_type = merged.get('type', '')
+    if plugin_type in ['sql_schema', 'sql_query']:
+        merged.setdefault('endpoint', f'sql://{plugin_type}')
+    elif plugin_type == 'msgraph':
+        merged.setdefault('endpoint', 'https://graph.microsoft.com')
+
     try:
         validate_group_action_payload(merged, partial=False)
     except ValueError as exc:
@@ -528,11 +628,12 @@ def update_group_action_route(action_id):
     merged['additionalFields'] = schema_merged.get('additionalFields', merged.get('additionalFields', {}))
 
     try:
-        saved = save_group_action(active_group, merged)
+        saved = save_group_action(active_group, merged, user_id=user_id)
     except Exception as exc:
         debug_print('Failed to update group action %s: %s', action_id, exc)
         return jsonify({'error': 'Unable to update action'}), 500
 
+    log_action_update(user_id=user_id, action_id=action_id, action_name=saved.get('name', ''), action_type=saved.get('type', ''), scope='group', group_id=active_group)
     return jsonify(saved), 200
 
 
@@ -563,6 +664,7 @@ def delete_group_action_route(action_id):
 
     if not removed:
         return jsonify({'error': 'Action not found'}), 404
+    log_action_deletion(user_id=user_id, action_id=action_id, action_name=action_id, scope='group', group_id=active_group)
     return jsonify({'message': 'Action deleted'}), 200
 
 @bpap.route('/api/user/plugins/types', methods=['GET'])
@@ -588,6 +690,8 @@ def get_core_plugin_settings():
         'enable_text_plugin': bool(settings.get('enable_text_plugin', True)),
         'enable_default_embedding_model_plugin': bool(settings.get('enable_default_embedding_model_plugin', True)),
         'enable_fact_memory_plugin': bool(settings.get('enable_fact_memory_plugin', True)),
+        'enable_tabular_processing_plugin': bool(settings.get('enable_tabular_processing_plugin', False)),
+        'enable_enhanced_citations': bool(settings.get('enable_enhanced_citations', False)),
         'enable_semantic_kernel': bool(settings.get('enable_semantic_kernel', False)),
         'allow_user_plugins': bool(settings.get('allow_user_plugins', True)),
         'allow_group_plugins': bool(settings.get('allow_group_plugins', True)),
@@ -610,6 +714,7 @@ def update_core_plugin_settings():
         'enable_text_plugin',
         'enable_default_embedding_model_plugin',
         'enable_fact_memory_plugin',
+        'enable_tabular_processing_plugin',
         'allow_user_plugins',
         'allow_group_plugins'
     ]
@@ -627,6 +732,11 @@ def update_core_plugin_settings():
             return jsonify({'error': f"Field '{key}' must be a boolean."}), 400
         updates[key] = data[key]
     logging.info("Validated plugin settings: %s", updates)
+    # Dependency: tabular processing requires enhanced citations
+    if updates.get('enable_tabular_processing_plugin', False):
+        full_settings = get_settings()
+        if not full_settings.get('enable_enhanced_citations', False):
+            return jsonify({'error': 'Tabular Processing requires Enhanced Citations to be enabled.'}), 400
     # Update settings
     success = update_settings(updates)
     if success:
@@ -662,7 +772,7 @@ def add_plugin():
         allowed_types = discover_plugin_types()
         validation_error = validate_plugin(new_plugin)
         if validation_error:
-            log_event("Add plugin failed: validation error", level=logging.WARNING, extra={"action": "add", "plugin": new_plugin, "error": validation_error})
+            log_event("Add plugin failed: validation error", level=logging.WARNING, extra={"action": "add", "plugin": _redact_plugin_for_logging(new_plugin), "error": validation_error})
             return jsonify({'error': validation_error}), 400
         
         if allowed_types is not None and new_plugin.get('type') not in allowed_types:
@@ -673,7 +783,7 @@ def add_plugin():
         is_valid, validation_errors = PluginHealthChecker.validate_plugin_manifest(new_plugin, plugin_type)
         if not is_valid:
             log_event("Add plugin failed: manifest validation error", level=logging.WARNING, 
-                     extra={"action": "add", "plugin": new_plugin, "errors": validation_errors})
+                     extra={"action": "add", "plugin": _redact_plugin_for_logging(new_plugin), "errors": validation_errors})
             return jsonify({'error': f"Manifest validation failed: {'; '.join(validation_errors)}"}), 400
         
         # Merge with schema to ensure all required fields are present
@@ -684,7 +794,7 @@ def add_plugin():
         
         # Prevent duplicate names (case-insensitive)
         if any(p['name'].lower() == new_plugin['name'].lower() for p in plugins):
-            log_event("Add plugin failed: duplicate name", level=logging.WARNING, extra={"action": "add", "plugin": new_plugin})
+            log_event("Add plugin failed: duplicate name", level=logging.WARNING, extra={"action": "add", "plugin": _redact_plugin_for_logging(new_plugin)})
             return jsonify({'error': 'Plugin with this name already exists.'}), 400
         
         # Assign a unique ID
@@ -692,9 +802,10 @@ def add_plugin():
         new_plugin['id'] = plugin_id
         
         # Save to global actions container
-        save_global_action(new_plugin)
+        save_global_action(new_plugin, user_id=str(get_current_user_id()))
         
-        log_event("Plugin added", extra={"action": "add", "plugin": new_plugin, "user": str(getattr(request, 'user', 'unknown'))})
+        log_action_creation(user_id=str(get_current_user_id()), action_id=plugin_id, action_name=new_plugin.get('name', ''), action_type=new_plugin.get('type', ''), scope='global')
+        log_event("Plugin added", extra={"action": "add", "plugin": _redact_plugin_for_logging(new_plugin), "user": str(get_current_user_id())})
         
         # --- HOT RELOAD TRIGGER ---
         setattr(builtins, "kernel_reload_needed", True)
@@ -716,7 +827,7 @@ def edit_plugin(plugin_name):
         allowed_types = discover_plugin_types()
         validation_error = validate_plugin(updated_plugin)
         if validation_error:
-            log_event("Edit plugin failed: validation error", level=logging.WARNING, extra={"action": "edit", "plugin": updated_plugin, "error": validation_error})
+            log_event("Edit plugin failed: validation error", level=logging.WARNING, extra={"action": "edit", "plugin": _redact_plugin_for_logging(updated_plugin), "error": validation_error})
             return jsonify({'error': validation_error}), 400
         
         if allowed_types is not None and updated_plugin.get('type') not in allowed_types:
@@ -727,7 +838,7 @@ def edit_plugin(plugin_name):
         is_valid, validation_errors = PluginHealthChecker.validate_plugin_manifest(updated_plugin, plugin_type)
         if not is_valid:
             log_event("Edit plugin failed: manifest validation error", level=logging.WARNING, 
-                     extra={"action": "edit", "plugin": updated_plugin, "errors": validation_errors})
+                     extra={"action": "edit", "plugin": _redact_plugin_for_logging(updated_plugin), "errors": validation_errors})
             return jsonify({'error': f"Manifest validation failed: {'; '.join(validation_errors)}"}), 400
         
         # Merge with schema to ensure all required fields are present
@@ -744,18 +855,24 @@ def edit_plugin(plugin_name):
                 break
         
         if found_plugin:
+            duplicate_name = updated_plugin.get('name', '').lower()
+            if duplicate_name and any(
+                p.get('name', '').lower() == duplicate_name and p.get('id') != found_plugin.get('id')
+                for p in plugins
+            ):
+                log_event("Edit plugin failed: duplicate name", level=logging.WARNING, extra={"action": "edit", "plugin": _redact_plugin_for_logging(updated_plugin)})
+                return jsonify({'error': 'Plugin with this name already exists.'}), 400
+
             # Preserve the existing ID if it exists
             if 'id' in found_plugin:
                 updated_plugin['id'] = found_plugin['id']
             else:
                 updated_plugin['id'] = str(uuid.uuid4())
             
-            # Delete old and save updated
-            if 'id' in found_plugin:
-                delete_global_action(found_plugin['id'])
-            save_global_action(updated_plugin)
+            save_global_action(updated_plugin, user_id=str(get_current_user_id()))
             
-            log_event("Plugin edited", extra={"action": "edit", "plugin": updated_plugin, "user": str(getattr(request, 'user', 'unknown'))})
+            log_action_update(user_id=str(get_current_user_id()), action_id=updated_plugin.get('id', ''), action_name=plugin_name, action_type=updated_plugin.get('type', ''), scope='global')
+            log_event("Plugin edited", extra={"action": "edit", "plugin": _redact_plugin_for_logging(updated_plugin), "user": str(get_current_user_id())})
             # --- HOT RELOAD TRIGGER ---
             setattr(builtins, "kernel_reload_needed", True)
             return jsonify({'success': True})
@@ -796,7 +913,8 @@ def delete_plugin(plugin_name):
         if 'id' in plugin_to_delete:
             delete_global_action(plugin_to_delete['id'])
         
-        log_event("Plugin deleted", extra={"action": "delete", "plugin_name": plugin_name, "user": str(getattr(request, 'user', 'unknown'))})
+        log_action_deletion(user_id=str(get_current_user_id()), action_id=plugin_to_delete.get('id', ''), action_name=plugin_name, action_type=plugin_to_delete.get('type', ''), scope='global')
+        log_event("Plugin deleted", extra={"action": "delete", "plugin_name": plugin_name, "user": str(get_current_user_id())})
         # --- HOT RELOAD TRIGGER ---
         setattr(builtins, "kernel_reload_needed", True)
         return jsonify({'success': True})
@@ -928,4 +1046,150 @@ def _merge_group_and_global_actions(group_actions, global_actions):
     return normalized_actions
 
 
+@bpap.route('/api/plugins/test-sql-connection', methods=['POST'])
+@swagger_route(security=get_auth_security())
+@login_required
+@user_required
+def test_sql_connection():
+    """Test a SQL database connection using provided configuration."""
+    data = request.get_json(silent=True) or {}
+    user_id = get_current_user_id()
+    database_type = (data.get('database_type') or 'sqlserver').lower()
+    connection_method = data.get('connection_method', 'parameters')
+    connection_string = data.get('connection_string', '')
+    server = data.get('server', '')
+    database = data.get('database', '')
+    port = data.get('port', '')
+    driver = data.get('driver', '')
+    username = data.get('username', '')
+    password = data.get('password', '')
+    auth_type = data.get('auth_type', 'username_password')
+    timeout = min(int(data.get('timeout', 10)), 15)  # Cap at 15 seconds for test
+
+    try:
+        existing_plugin = _load_existing_plugin_for_sql_test(data.get('existing_plugin'), user_id)
+    except PermissionError as exc:
+        return jsonify({'success': False, 'error': str(exc)}), 403
+    except LookupError as exc:
+        return jsonify({'success': False, 'error': str(exc)}), 404
+    except ValueError as exc:
+        return jsonify({'success': False, 'error': str(exc)}), 400
+
+    existing_additional_fields = {}
+    if isinstance(existing_plugin, dict) and isinstance(existing_plugin.get('additionalFields'), dict):
+        existing_additional_fields = existing_plugin['additionalFields']
+
+    if connection_string == ui_trigger_word:
+        connection_string = existing_additional_fields.get('connection_string', '')
+    if password == ui_trigger_word:
+        password = existing_additional_fields.get('password', '')
+
+    unresolved_fields = []
+    if connection_string == ui_trigger_word:
+        unresolved_fields.append('connection string')
+    if password == ui_trigger_word:
+        unresolved_fields.append('password')
+    if unresolved_fields:
+        field_list = ', '.join(unresolved_fields)
+        return jsonify({'success': False, 'error': f"Stored SQL secret could not be resolved for testing. Re-enter the {field_list}."}), 400
+
+    try:
+        connection_string = _resolve_secret_value_for_sql_test(connection_string, 'connection_string')
+        password = _resolve_secret_value_for_sql_test(password, 'password')
+    except ValueError as exc:
+        return jsonify({'success': False, 'error': str(exc)}), 400
+
+    # Map azure_sql to sqlserver
+    if database_type in ('azure_sql', 'azuresql'):
+        database_type = 'sqlserver'
+
+    try:
+        if database_type == 'sqlserver':
+            import pyodbc
+            if connection_method == 'connection_string' and connection_string:
+                conn = pyodbc.connect(connection_string, timeout=timeout)
+            else:
+                if not server or not database:
+                    return jsonify({'success': False, 'error': 'Server and database are required for individual parameters connection.'}), 400
+                drv = driver or 'ODBC Driver 17 for SQL Server'
+                conn_str = f"DRIVER={{{drv}}};SERVER={server};DATABASE={database}"
+                if port:
+                    conn_str += f",{port}"
+                if auth_type == 'username_password' and username and password:
+                    conn_str += f";UID={username};PWD={password}"
+                elif auth_type == 'managed_identity':
+                    conn_str += ";Authentication=ActiveDirectoryMsi"
+                elif auth_type == 'integrated':
+                    conn_str += ";Trusted_Connection=yes"
+                conn = pyodbc.connect(conn_str, timeout=timeout)
+            cursor = conn.cursor()
+            cursor.execute("SELECT 1")
+            cursor.close()
+            conn.close()
+            return jsonify({'success': True, 'message': f'Successfully connected to {data.get("database", "database")} on {data.get("server", "server")}.'})
+
+        elif database_type == 'postgresql':
+            import psycopg2
+            if connection_method == 'connection_string' and connection_string:
+                conn = psycopg2.connect(connection_string, connect_timeout=timeout)
+            else:
+                if not server or not database:
+                    return jsonify({'success': False, 'error': 'Server and database are required.'}), 400
+                conn_params = {'host': server, 'database': database, 'connect_timeout': timeout}
+                if port:
+                    conn_params['port'] = int(port)
+                if username:
+                    conn_params['user'] = username
+                if password:
+                    conn_params['password'] = password
+                conn = psycopg2.connect(**conn_params)
+            cursor = conn.cursor()
+            cursor.execute("SELECT 1")
+            cursor.close()
+            conn.close()
+            return jsonify({'success': True, 'message': f'Successfully connected to PostgreSQL database {data.get("database", "")}.'})
+
+        elif database_type == 'mysql':
+            import pymysql
+            if connection_method == 'connection_string' and connection_string:
+                # pymysql doesn't natively parse connection strings, so use params
+                return jsonify({'success': False, 'error': 'MySQL test connection requires individual parameters, not a connection string.'}), 400
+            if not server or not database:
+                return jsonify({'success': False, 'error': 'Server and database are required.'}), 400
+            conn_params = {'host': server, 'database': database, 'connect_timeout': timeout}
+            if port:
+                conn_params['port'] = int(port)
+            if username:
+                conn_params['user'] = username
+            if password:
+                conn_params['password'] = password
+            conn = pymysql.connect(**conn_params)
+            cursor = conn.cursor()
+            cursor.execute("SELECT 1")
+            cursor.close()
+            conn.close()
+            return jsonify({'success': True, 'message': f'Successfully connected to MySQL database {data.get("database", "")}.'})
+
+        elif database_type == 'sqlite':
+            import sqlite3
+            db_path = connection_string or database
+            if not db_path:
+                return jsonify({'success': False, 'error': 'Database path is required for SQLite.'}), 400
+            conn = sqlite3.connect(db_path, timeout=timeout)
+            cursor = conn.cursor()
+            cursor.execute("SELECT 1")
+            cursor.close()
+            conn.close()
+            return jsonify({'success': True, 'message': f'Successfully connected to SQLite database.'})
 
+        else:
+            return jsonify({'success': False, 'error': f'Unsupported database type: {database_type}'}), 400
+
+    except ImportError as e:
+        return jsonify({'success': False, 'error': f'Database driver not installed: {str(e)}'}), 400
+    except Exception as e:
+        error_msg = str(e)
+        # Sanitize error message to avoid leaking sensitive details
+        if 'password' in error_msg.lower() or 'pwd' in error_msg.lower():
+            error_msg = 'Authentication failed. Please check your credentials.'
+        return jsonify({'success': False, 'error': f'Connection failed: {error_msg}'}), 400
diff --git a/application/single_app/route_backend_public_workspaces.py b/application/single_app/route_backend_public_workspaces.py
index ffe679eb..6d14f357 100644
--- a/application/single_app/route_backend_public_workspaces.py
+++ b/application/single_app/route_backend_public_workspaces.py
@@ -44,12 +44,7 @@ def get_user_details_from_graph(user_id):
         if not token:
             return {"displayName": "", "email": ""}
 
-        if AZURE_ENVIRONMENT == "usgovernment":
-            user_endpoint = f"https://graph.microsoft.us/v1.0/users/{user_id}"
-        elif AZURE_ENVIRONMENT == "custom":
-            user_endpoint = f"{CUSTOM_GRAPH_URL_VALUE}/{user_id}"
-        else:
-            user_endpoint = f"https://graph.microsoft.com/v1.0/users/{user_id}"
+        user_endpoint = get_graph_endpoint(f"/users/{user_id}")
             
         headers = {
             "Authorization": f"Bearer {token}",
diff --git a/application/single_app/route_backend_settings.py b/application/single_app/route_backend_settings.py
index 7be73134..aefb2f12 100644
--- a/application/single_app/route_backend_settings.py
+++ b/application/single_app/route_backend_settings.py
@@ -706,9 +706,18 @@ def _test_redis_connection(payload):
             cache_endpoint = get_redis_cache_infrastructure_endpoint(redis_hostname)
             token = credential.get_token(cache_endpoint)
             redis_password = token.token
+        elif redis_auth_type == 'key_vault':
+            if not redis_key:
+                return jsonify({'error': 'Key Vault secret name is required for Key Vault authentication'}), 400
+            try:
+                from functions_keyvault import retrieve_secret_direct
+                redis_password = retrieve_secret_direct(redis_key)
+            except Exception as kv_err:
+                log_event(f"[REDIS_TEST] Key Vault retrieval failed for secret '{redis_key}': {str(kv_err)}", level="error")
+                return jsonify({'error': 'Failed to retrieve Redis key from Key Vault. Check Application Insights using "[REDIS_TEST]" for details.'}), 500
         else:
             if not redis_key:
-                return jsonify({'error': 'Redis key is required for key auth'}), 400
+                return jsonify({'error': 'Redis key is required for key authentication'}), 400
             redis_password = redis_key
 
         r = redis.Redis(
@@ -1043,4 +1052,4 @@ def _test_key_vault_connection(payload):
 
     except Exception as e:
         log_event(f"[AKV_TEST] Key Vault connection error: {str(e)}", level="error")
-        return jsonify({'error': f'Key Vault connection error. Check Application Insights using "[AKV_TEST]" for details.'}), 500
\ No newline at end of file
+        return jsonify({'error': 'Key Vault connection failed. Check Application Insights using "[AKV_TEST]" for details.'}), 500
\ No newline at end of file
diff --git a/application/single_app/route_backend_thoughts.py b/application/single_app/route_backend_thoughts.py
new file mode 100644
index 00000000..a7624a3f
--- /dev/null
+++ b/application/single_app/route_backend_thoughts.py
@@ -0,0 +1,80 @@
+# route_backend_thoughts.py
+
+from flask import request, jsonify
+from functions_authentication import login_required, user_required, get_current_user_id
+from functions_settings import get_settings
+from functions_thoughts import get_thoughts_for_message, get_pending_thoughts
+from swagger_wrapper import swagger_route, get_auth_security
+from functions_appinsights import log_event
+
+
+def register_route_backend_thoughts(app):
+
+    @app.route('/api/conversations/<conversation_id>/messages/<message_id>/thoughts', methods=['GET'])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    def api_get_message_thoughts(conversation_id, message_id):
+        """Return persisted thoughts for a specific assistant message."""
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({'error': 'User not authenticated'}), 401
+
+        settings = get_settings()
+        if not settings.get('enable_thoughts', False):
+            return jsonify({'thoughts': [], 'enabled': False}), 200
+
+        try:
+            thoughts = get_thoughts_for_message(conversation_id, message_id, user_id)
+            # Strip internal Cosmos fields before returning
+            sanitized = []
+            for t in thoughts:
+                sanitized.append({
+                    'id': t.get('id'),
+                    'step_index': t.get('step_index'),
+                    'step_type': t.get('step_type'),
+                    'content': t.get('content'),
+                    'detail': t.get('detail'),
+                    'duration_ms': t.get('duration_ms'),
+                    'timestamp': t.get('timestamp')
+                })
+            return jsonify({'thoughts': sanitized, 'enabled': True}), 200
+        except Exception as e:
+            log_event(f"api_get_message_thoughts error: {e}", level="WARNING")
+            return jsonify({'error': 'Failed to retrieve thoughts'}), 500
+
+    @app.route('/api/conversations/<conversation_id>/thoughts/pending', methods=['GET'])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    def api_get_pending_thoughts(conversation_id):
+        """Return the latest in-progress thoughts for a conversation.
+
+        Used by the non-streaming frontend to poll for thought updates
+        while waiting for the chat response.
+        """
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({'error': 'User not authenticated'}), 401
+
+        settings = get_settings()
+        if not settings.get('enable_thoughts', False):
+            return jsonify({'thoughts': [], 'enabled': False}), 200
+
+        try:
+            thoughts = get_pending_thoughts(conversation_id, user_id)
+            sanitized = []
+            for t in thoughts:
+                sanitized.append({
+                    'id': t.get('id'),
+                    'step_index': t.get('step_index'),
+                    'step_type': t.get('step_type'),
+                    'content': t.get('content'),
+                    'detail': t.get('detail'),
+                    'duration_ms': t.get('duration_ms'),
+                    'timestamp': t.get('timestamp')
+                })
+            return jsonify({'thoughts': sanitized, 'enabled': True}), 200
+        except Exception as e:
+            log_event(f"api_get_pending_thoughts error: {e}", level="WARNING")
+            return jsonify({'error': 'Failed to retrieve pending thoughts'}), 500
diff --git a/application/single_app/route_backend_user_agreement.py b/application/single_app/route_backend_user_agreement.py
index f46559ff..b76213b3 100644
--- a/application/single_app/route_backend_user_agreement.py
+++ b/application/single_app/route_backend_user_agreement.py
@@ -130,7 +130,7 @@ def api_accept_user_agreement():
             return jsonify({"error": "workspace_id and workspace_type are required"}), 400
 
         # Validate workspace type
-        valid_types = ["personal", "group", "public"]
+        valid_types = ["personal", "group", "public", "chat"]
         if workspace_type not in valid_types:
             return jsonify({"error": f"Invalid workspace_type. Must be one of: {', '.join(valid_types)}"}), 400
 
diff --git a/application/single_app/route_backend_users.py b/application/single_app/route_backend_users.py
index d2ca52f8..ad3ae5f4 100644
--- a/application/single_app/route_backend_users.py
+++ b/application/single_app/route_backend_users.py
@@ -24,12 +24,7 @@ def api_user_search():
         if not token:
             return jsonify({"error": "Could not acquire access token"}), 401
 
-        if AZURE_ENVIRONMENT == "usgovernment":
-            user_endpoint = "https://graph.microsoft.us/v1.0/users"
-        elif AZURE_ENVIRONMENT == "custom":
-            user_endpoint = CUSTOM_GRAPH_URL_VALUE
-        else:
-            user_endpoint = "https://graph.microsoft.com/v1.0/users"
+        user_endpoint = get_graph_endpoint("/users")
             
         headers = {
             "Authorization": f"Bearer {token}",
diff --git a/application/single_app/route_enhanced_citations.py b/application/single_app/route_enhanced_citations.py
index c81ef225..44fc4fcb 100644
--- a/application/single_app/route_enhanced_citations.py
+++ b/application/single_app/route_enhanced_citations.py
@@ -8,6 +8,7 @@
 import requests
 import mimetypes
 import io
+import pandas
 
 from functions_authentication import login_required, user_required, get_current_user_id
 from functions_settings import get_settings, enabled_required
@@ -15,7 +16,7 @@
 from functions_group import get_user_groups
 from functions_public_workspaces import get_user_visible_public_workspace_ids_from_settings
 from swagger_wrapper import swagger_route, get_auth_security
-from config import CLIENTS, storage_account_user_documents_container_name, storage_account_group_documents_container_name, storage_account_public_documents_container_name, IMAGE_EXTENSIONS, VIDEO_EXTENSIONS, AUDIO_EXTENSIONS
+from config import CLIENTS, storage_account_user_documents_container_name, storage_account_group_documents_container_name, storage_account_public_documents_container_name, storage_account_personal_chat_container_name, IMAGE_EXTENSIONS, VIDEO_EXTENSIONS, AUDIO_EXTENSIONS, TABULAR_EXTENSIONS, cosmos_messages_container, cosmos_conversations_container
 from functions_debug import debug_print
 
 def register_enhanced_citations_routes(app):
@@ -183,6 +184,274 @@ def get_enhanced_citation_pdf():
         except Exception as e:
             return jsonify({"error": str(e)}), 500
 
+    @app.route("/api/enhanced_citations/tabular", methods=["GET"])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    @enabled_required("enable_enhanced_citations")
+    def get_enhanced_citation_tabular():
+        """
+        Serve original tabular file (CSV, XLSX, etc.) from blob storage for download.
+        Used for chat-uploaded tabular files stored in blob storage.
+        """
+        conversation_id = request.args.get("conversation_id")
+        file_id = request.args.get("file_id")
+
+        if not conversation_id or not file_id:
+            return jsonify({"error": "conversation_id and file_id are required"}), 400
+
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({"error": "User not authenticated"}), 401
+
+        try:
+            # Verify the current user owns the conversation
+            try:
+                conversation = cosmos_conversations_container.read_item(
+                    item=conversation_id,
+                    partition_key=conversation_id
+                )
+            except Exception:
+                return jsonify({"error": "Conversation not found"}), 404
+
+            if conversation.get('user_id') != user_id:
+                return jsonify({"error": "Forbidden"}), 403
+
+            # Look up the file message in Cosmos to get blob reference
+            query_str = """
+                SELECT * FROM c
+                WHERE c.conversation_id = @conversation_id
+                AND c.id = @file_id
+            """
+            items = list(cosmos_messages_container.query_items(
+                query=query_str,
+                parameters=[
+                    {'name': '@conversation_id', 'value': conversation_id},
+                    {'name': '@file_id', 'value': file_id}
+                ],
+                partition_key=conversation_id
+            ))
+
+            if not items:
+                return jsonify({"error": "File not found"}), 404
+
+            file_msg = items[0]
+            file_content_source = file_msg.get('file_content_source', '')
+
+            if file_content_source != 'blob':
+                return jsonify({"error": "File is not stored in blob storage"}), 400
+
+            blob_container = file_msg.get('blob_container', '')
+            blob_path = file_msg.get('blob_path', '')
+            filename = file_msg.get('filename', 'download')
+
+            if not blob_container or not blob_path:
+                return jsonify({"error": "Blob reference is incomplete"}), 500
+
+            blob_service_client = CLIENTS.get("storage_account_office_docs_client")
+            if not blob_service_client:
+                return jsonify({"error": "Storage not available"}), 500
+
+            blob_client = blob_service_client.get_blob_client(
+                container=blob_container,
+                blob=blob_path
+            )
+            stream = blob_client.download_blob()
+            content = stream.readall()
+
+            # Determine content type
+            content_type, _ = mimetypes.guess_type(filename)
+            if not content_type:
+                content_type = 'application/octet-stream'
+
+            return Response(
+                content,
+                content_type=content_type,
+                headers={
+                    'Content-Length': str(len(content)),
+                    'Content-Disposition': f'attachment; filename="{filename}"',
+                    'Cache-Control': 'private, max-age=300',
+                }
+            )
+
+        except Exception as e:
+            debug_print(f"Error serving tabular citation: {e}")
+            return jsonify({"error": str(e)}), 500
+
+    @app.route("/api/enhanced_citations/tabular_workspace", methods=["GET"])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    @enabled_required("enable_enhanced_citations")
+    def get_enhanced_citation_tabular_workspace():
+        """
+        Serve tabular file (CSV, XLSX, etc.) from blob storage for workspace documents.
+        Uses doc_id to look up the document across personal, group, and public workspaces.
+        """
+        doc_id = request.args.get("doc_id")
+        if not doc_id:
+            return jsonify({"error": "doc_id is required"}), 400
+
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({"error": "User not authenticated"}), 401
+
+        try:
+            doc_response, status_code = get_document(user_id, doc_id)
+            if status_code != 200:
+                return doc_response, status_code
+
+            raw_doc = doc_response.get_json()
+            file_name = raw_doc.get('file_name', '')
+            ext = file_name.lower().split('.')[-1] if '.' in file_name else ''
+
+            if ext not in ('csv', 'xlsx', 'xls', 'xlsm'):
+                return jsonify({"error": "File is not a tabular file"}), 400
+
+            return serve_enhanced_citation_content(raw_doc, force_download=True)
+
+        except Exception as e:
+            debug_print(f"Error serving tabular workspace citation: {e}")
+            return jsonify({"error": str(e)}), 500
+
+    @app.route("/api/enhanced_citations/tabular_preview", methods=["GET"])
+    @swagger_route(security=get_auth_security())
+    @login_required
+    @user_required
+    @enabled_required("enable_enhanced_citations")
+    def get_enhanced_citation_tabular_preview():
+        """
+        Return JSON preview of a tabular file for rendering as an HTML table.
+        Reads the file into a pandas DataFrame and returns columns + rows as JSON.
+        """
+        doc_id = request.args.get("doc_id")
+        sheet_name = request.args.get("sheet_name")
+        sheet_index = request.args.get("sheet_index")
+        max_rows = min(request.args.get("max_rows", 200, type=int), 500)
+        if not doc_id:
+            return jsonify({"error": "doc_id is required"}), 400
+
+        user_id = get_current_user_id()
+        if not user_id:
+            return jsonify({"error": "User not authenticated"}), 401
+
+        try:
+            doc_response, status_code = get_document(user_id, doc_id)
+            if status_code != 200:
+                return doc_response, status_code
+
+            raw_doc = doc_response.get_json()
+            file_name = raw_doc.get('file_name', '')
+            ext = file_name.lower().rsplit('.', 1)[-1] if '.' in file_name else ''
+            if ext not in ('csv', 'xlsx', 'xls', 'xlsm'):
+                return jsonify({"error": "File is not a tabular file"}), 400
+
+            # Download blob with size cap to protect memory
+            settings = get_settings()
+            max_blob_size = int(settings.get('tabular_preview_max_blob_size_mb', 200)) * 1024 * 1024
+            workspace_type, container_name = determine_workspace_type_and_container(raw_doc)
+            blob_name = get_blob_name(raw_doc, workspace_type)
+            blob_service_client = CLIENTS.get("storage_account_office_docs_client")
+            if not blob_service_client:
+                return jsonify({"error": "Blob storage client not available"}), 500
+            blob_client = blob_service_client.get_blob_client(container=container_name, blob=blob_name)
+            blob_props = blob_client.get_blob_properties()
+            if blob_props.size > max_blob_size:
+                return jsonify({"error": "File is too large to preview"}), 400
+            data = blob_client.download_blob().readall()
+
+            # Read into DataFrame, limiting rows for preview efficiency
+            # Read max_rows + 1 so we can detect truncation without loading the full file
+            nrows_limit = max_rows + 1
+            selected_sheet = None
+            sheet_names = []
+            if ext == 'csv':
+                df = pandas.read_csv(io.BytesIO(data), keep_default_na=False, dtype=str, nrows=nrows_limit)
+            elif ext in ('xlsx', 'xlsm'):
+                excel_file = pandas.ExcelFile(io.BytesIO(data), engine='openpyxl')
+                sheet_names = list(excel_file.sheet_names)
+                if not sheet_names:
+                    return jsonify({"error": "Workbook does not contain any readable sheets"}), 400
+
+                if sheet_name:
+                    requested_sheet_name = sheet_name.strip()
+                    matching_sheet_name = next(
+                        (candidate for candidate in sheet_names if candidate.lower() == requested_sheet_name.lower()),
+                        None,
+                    )
+                    if not matching_sheet_name:
+                        return jsonify({
+                            "error": f"Sheet '{requested_sheet_name}' was not found. Available sheets: {sheet_names}"
+                        }), 400
+                    selected_sheet = matching_sheet_name
+                elif sheet_index not in (None, ''):
+                    try:
+                        resolved_sheet_index = int(sheet_index)
+                    except ValueError:
+                        return jsonify({"error": "sheet_index must be an integer"}), 400
+                    if resolved_sheet_index < 0 or resolved_sheet_index >= len(sheet_names):
+                        return jsonify({
+                            "error": f"sheet_index {resolved_sheet_index} is out of range. Available sheets: {sheet_names}"
+                        }), 400
+                    selected_sheet = sheet_names[resolved_sheet_index]
+                else:
+                    selected_sheet = sheet_names[0]
+
+                df = excel_file.parse(selected_sheet, keep_default_na=False, dtype=str, nrows=nrows_limit)
+            elif ext == 'xls':
+                excel_file = pandas.ExcelFile(io.BytesIO(data), engine='xlrd')
+                sheet_names = list(excel_file.sheet_names)
+                if not sheet_names:
+                    return jsonify({"error": "Workbook does not contain any readable sheets"}), 400
+
+                if sheet_name:
+                    requested_sheet_name = sheet_name.strip()
+                    matching_sheet_name = next(
+                        (candidate for candidate in sheet_names if candidate.lower() == requested_sheet_name.lower()),
+                        None,
+                    )
+                    if not matching_sheet_name:
+                        return jsonify({
+                            "error": f"Sheet '{requested_sheet_name}' was not found. Available sheets: {sheet_names}"
+                        }), 400
+                    selected_sheet = matching_sheet_name
+                elif sheet_index not in (None, ''):
+                    try:
+                        resolved_sheet_index = int(sheet_index)
+                    except ValueError:
+                        return jsonify({"error": "sheet_index must be an integer"}), 400
+                    if resolved_sheet_index < 0 or resolved_sheet_index >= len(sheet_names):
+                        return jsonify({
+                            "error": f"sheet_index {resolved_sheet_index} is out of range. Available sheets: {sheet_names}"
+                        }), 400
+                    selected_sheet = sheet_names[resolved_sheet_index]
+                else:
+                    selected_sheet = sheet_names[0]
+
+                df = excel_file.parse(selected_sheet, keep_default_na=False, dtype=str, nrows=nrows_limit)
+            else:
+                return jsonify({"error": f"Unsupported file type: {ext}"}), 400
+
+            total_rows = len(df)
+            truncated = total_rows > max_rows
+            preview = df.head(max_rows)
+
+            return jsonify({
+                "filename": file_name,
+                "selected_sheet": selected_sheet,
+                "sheet_names": sheet_names,
+                "sheet_count": len(sheet_names),
+                "total_rows": total_rows if not truncated else None,
+                "total_columns": len(df.columns),
+                "columns": list(df.columns),
+                "rows": preview.values.tolist(),
+                "truncated": truncated
+            })
+
+        except Exception as e:
+            debug_print(f"Error generating tabular preview: {e}")
+            return jsonify({"error": str(e)}), 500
+
 def get_document(user_id, doc_id):
     """
     Get document metadata - searches across all enabled workspace types
diff --git a/application/single_app/route_frontend_admin_settings.py b/application/single_app/route_frontend_admin_settings.py
index 578e1545..b9e69c51 100644
--- a/application/single_app/route_frontend_admin_settings.py
+++ b/application/single_app/route_frontend_admin_settings.py
@@ -9,10 +9,24 @@
 from swagger_wrapper import swagger_route, get_auth_security
 from datetime import datetime, timedelta
 
+ALLOWED_PIL_IMAGE_UPLOAD_FORMATS = ('PNG', 'JPEG')
+
 def allowed_file(filename, allowed_extensions):
     return '.' in filename and \
            filename.rsplit('.', 1)[1].lower() in allowed_extensions
 
+def open_allowed_uploaded_image(file_bytes, filename):
+    img = Image.open(BytesIO(file_bytes), formats=list(ALLOWED_PIL_IMAGE_UPLOAD_FORMATS))
+    img.load()
+
+    detected_format = (img.format or '').upper()
+    if detected_format not in ALLOWED_PIL_IMAGE_UPLOAD_FORMATS:
+        raise ValueError(
+            f"Unsupported image format for {filename}. Allowed formats: {', '.join(ALLOWED_PIL_IMAGE_UPLOAD_FORMATS)}"
+        )
+
+    return img, detected_format
+
 def register_route_frontend_admin_settings(app):
     @app.route('/admin/settings', methods=['GET', 'POST'])
     @swagger_route(security=get_auth_security())
@@ -98,6 +112,8 @@ def admin_settings():
             settings['enable_text_plugin'] = False
         if 'enable_fact_memory_plugin' not in settings:
             settings['enable_fact_memory_plugin'] = False
+        if 'enable_tabular_processing_plugin' not in settings:
+            settings['enable_tabular_processing_plugin'] = False
         if 'enable_default_embedding_model_plugin' not in settings:
             settings['enable_default_embedding_model_plugin'] = False
         if 'enable_multi_agent_orchestration' not in settings:
@@ -787,6 +803,7 @@ def is_valid_url(url):
                 'enable_enhanced_citations': enable_enhanced_citations,
                 'enable_enhanced_citations_mount': form_data.get('enable_enhanced_citations_mount') == 'on' and enable_enhanced_citations,
                 'enhanced_citations_mount': form_data.get('enhanced_citations_mount', '/view_documents').strip(),
+                'tabular_preview_max_blob_size_mb': int(form_data.get('tabular_preview_max_blob_size_mb', 200)),
                 'office_docs_storage_account_blob_endpoint': office_docs_storage_account_blob_endpoint,
                 'office_docs_storage_account_url': office_docs_storage_account_url,
                 'office_docs_authentication_type': form_data.get('office_docs_authentication_type', 'key'),
@@ -809,9 +826,10 @@ def is_valid_url(url):
                 'require_member_of_safety_violation_admin': require_member_of_safety_violation_admin, # ADDED
                 'require_member_of_feedback_admin': require_member_of_feedback_admin, # ADDED
 
-                # Feedback & Archiving
+                # Feedback, Archiving & Thoughts
                 'enable_user_feedback': form_data.get('enable_user_feedback') == 'on',
                 'enable_conversation_archiving': form_data.get('enable_conversation_archiving') == 'on',
+                'enable_thoughts': form_data.get('enable_thoughts') == 'on',
 
                 # Search (Web Search via Azure AI Foundry agent)
                 'enable_web_search': enable_web_search,
@@ -869,6 +887,7 @@ def is_valid_url(url):
                 'max_file_size_mb': max_file_size_mb,
                 'conversation_history_limit': conversation_history_limit,
                 'default_system_prompt': form_data.get('default_system_prompt', '').strip(),
+                'access_denied_message': form_data.get('access_denied_message', settings.get('access_denied_message', '')).strip(),
 
                 # Video file settings with Azure Video Indexer Settings
                 'video_indexer_endpoint': form_data.get('video_indexer_endpoint', video_indexer_endpoint).strip(),
@@ -938,13 +957,12 @@ def is_valid_url(url):
                     )
 
                     # 3) Load into Pillow from the original bytes for processing
-                    in_memory_for_process = BytesIO(file_bytes) # Use original bytes
-                    img = Image.open(in_memory_for_process)
+                    img, detected_format = open_allowed_uploaded_image(file_bytes, logo_file.filename)
                     
                     add_file_task_to_file_processing_log(
                         document_id='Image_Upload', # Placeholder if needed
                         user_id='New_image',
-                        content=f"Loaded image for processing: {logo_file.filename}"
+                        content=f"Loaded image for processing: {logo_file.filename} (format: {detected_format})"
                     )
 
                     # Ensure image mode is compatible (e.g., convert palette modes)
@@ -1021,13 +1039,12 @@ def is_valid_url(url):
                     )
 
                     # 2) Load into Pillow from the original bytes for processing
-                    in_memory_for_process = BytesIO(file_bytes) # Use original bytes
-                    img = Image.open(in_memory_for_process)
+                    img, detected_format = open_allowed_uploaded_image(file_bytes, logo_dark_file.filename)
                     
                     add_file_task_to_file_processing_log(
                         document_id='Image_Upload', # Placeholder if needed
                         user_id='New_image',
-                        content=f"Loaded dark mode logo image for processing: {logo_dark_file.filename}"
+                        content=f"Loaded dark mode logo image for processing: {logo_dark_file.filename} (format: {detected_format})"
                     )
 
                     # 3) Ensure image mode is compatible (e.g., convert palette modes)
@@ -1103,13 +1120,12 @@ def is_valid_url(url):
                     )
 
                     # 2) Load into Pillow from the original bytes for processing
-                    in_memory_for_process = BytesIO(file_bytes) # Use original bytes
-                    img = Image.open(in_memory_for_process)
+                    img, detected_format = open_allowed_uploaded_image(file_bytes, favicon_file.filename)
                     
                     add_file_task_to_file_processing_log(
                         document_id='Image_Upload', # Placeholder if needed
                         user_id='New_image',
-                        content=f"Loaded favicon image for processing: {favicon_file.filename}"
+                        content=f"Loaded favicon image for processing: {favicon_file.filename} (format: {detected_format})"
                     )
 
                     # 3) Ensure image mode is compatible (e.g., convert palette modes)
diff --git a/application/single_app/route_frontend_chats.py b/application/single_app/route_frontend_chats.py
index ca0feb1a..67a41879 100644
--- a/application/single_app/route_frontend_chats.py
+++ b/application/single_app/route_frontend_chats.py
@@ -237,8 +237,33 @@ def upload_file():
                 # Handle XML, YAML, and LOG files as text for inline chat
                 extracted_content  = extract_text_file(temp_file_path)
             elif file_ext_nodot in TABULAR_EXTENSIONS:
-                extracted_content = extract_table_file(temp_file_path, file_ext)
                 is_table = True
+
+                # Upload tabular file to blob storage for tabular processing plugin access
+                if settings.get('enable_enhanced_citations', False):
+                    try:
+                        blob_service_client = CLIENTS.get("storage_account_office_docs_client")
+                        if blob_service_client:
+                            blob_path = f"{user_id}/{conversation_id}/{filename}"
+                            blob_client = blob_service_client.get_blob_client(
+                                container=storage_account_personal_chat_container_name,
+                                blob=blob_path
+                            )
+                            metadata = {
+                                "conversation_id": str(conversation_id),
+                                "user_id": str(user_id)
+                            }
+                            with open(temp_file_path, "rb") as blob_f:
+                                blob_client.upload_blob(blob_f, overwrite=True, metadata=metadata)
+                            log_event(f"Uploaded chat tabular file to blob storage: {blob_path}")
+                    except Exception as blob_err:
+                        log_event(
+                            f"Warning: Failed to upload chat tabular file to blob storage: {blob_err}",
+                            level=logging.WARNING
+                        )
+                else:
+                    # Only extract content for Cosmos storage when enhanced citations is disabled
+                    extracted_content = extract_table_file(temp_file_path, file_ext)
             else:
                 return jsonify({'error': 'Unsupported file type'}), 400
 
@@ -395,25 +420,50 @@ def upload_file():
 
                 current_thread_id = str(uuid.uuid4())
                 
-                file_message = {
-                    'id': file_message_id,
-                    'conversation_id': conversation_id,
-                    'role': 'file',
-                    'filename': filename,
-                    'file_content': extracted_content,
-                    'is_table': is_table,
-                    'timestamp': datetime.utcnow().isoformat(),
-                    'model_deployment_name': None,
-                    'metadata': {
-                        'thread_info': {
-                            'thread_id': current_thread_id,
-                            'previous_thread_id': previous_thread_id,
-                            'active_thread': True,
-                            'thread_attempt': 1
+                # When enhanced citations is enabled and file is tabular, store a lightweight
+                # reference without file_content to avoid Cosmos DB size limits.
+                # The tabular data lives in blob storage and is served from there.
+                if is_table and settings.get('enable_enhanced_citations', False):
+                    file_message = {
+                        'id': file_message_id,
+                        'conversation_id': conversation_id,
+                        'role': 'file',
+                        'filename': filename,
+                        'is_table': is_table,
+                        'file_content_source': 'blob',
+                        'blob_container': storage_account_personal_chat_container_name,
+                        'blob_path': f"{user_id}/{conversation_id}/{filename}",
+                        'timestamp': datetime.utcnow().isoformat(),
+                        'model_deployment_name': None,
+                        'metadata': {
+                            'thread_info': {
+                                'thread_id': current_thread_id,
+                                'previous_thread_id': previous_thread_id,
+                                'active_thread': True,
+                                'thread_attempt': 1
+                            }
                         }
                     }
-                }
-                
+                else:
+                    file_message = {
+                        'id': file_message_id,
+                        'conversation_id': conversation_id,
+                        'role': 'file',
+                        'filename': filename,
+                        'file_content': extracted_content,
+                        'is_table': is_table,
+                        'timestamp': datetime.utcnow().isoformat(),
+                        'model_deployment_name': None,
+                        'metadata': {
+                            'thread_info': {
+                                'thread_id': current_thread_id,
+                                'previous_thread_id': previous_thread_id,
+                                'active_thread': True,
+                                'thread_attempt': 1
+                            }
+                        }
+                    }
+
                 # Add vision analysis if available
                 if vision_analysis:
                     file_message['vision_analysis'] = vision_analysis
diff --git a/application/single_app/route_openapi.py b/application/single_app/route_openapi.py
index 238e9a4c..16b08550 100644
--- a/application/single_app/route_openapi.py
+++ b/application/single_app/route_openapi.py
@@ -2,19 +2,17 @@
 """
 OpenAPI Plugin Routes
 
-This module provides routes for managing OpenAPI plugin file uploads and URL validation.
+This module provides routes for managing OpenAPI plugin file uploads.
 """
 
 import os
 import tempfile
 import uuid
 from flask import request, jsonify, current_app
-from werkzeug.utils import secure_filename
 from functions_authentication import login_required, user_required
 from openapi_security import openapi_validator
 from openapi_auth_analyzer import analyze_openapi_authentication, get_authentication_help_text
 from swagger_wrapper import swagger_route, get_auth_security
-from functions_security import is_valid_storage_name
 from functions_debug import debug_print
 
 def register_openapi_routes(app):
@@ -136,214 +134,6 @@ def upload_openapi_spec():
                 'error': 'Internal server error during upload'
             }), 500
     
-    @app.route('/api/openapi/validate-url', methods=['POST'])
-    @swagger_route(security=get_auth_security())
-    @login_required
-    @user_required
-    def validate_openapi_url():
-        """
-        Validate and download an OpenAPI specification from a URL.
-        
-        Expected JSON data:
-        - url: The URL to the OpenAPI specification
-        
-        Returns:
-        - success: Boolean indicating if validation was successful
-        - file_id: The unique file ID for the stored specification
-        - api_info: Basic information about the OpenAPI spec
-        - error: Error message if validation failed
-        """
-        try:
-            data = request.get_json()
-            if not data or 'url' not in data:
-                return jsonify({
-                    'success': False,
-                    'error': 'URL is required'
-                }), 400
-            
-            url = data['url'].strip()
-            if not url:
-                return jsonify({
-                    'success': False,
-                    'error': 'URL cannot be empty'
-                }), 400
-            
-            # Validate URL and fetch content
-            valid, spec, error = openapi_validator.validate_url_content(url)
-            
-            if not valid:
-                return jsonify({
-                    'success': False,
-                    'error': f'Validation failed: {error}'
-                }), 400
-            
-            # Generate filename from URL or spec title
-            info = spec.get('info', {})
-            title = info.get('title', 'openapi_spec')
-            # Sanitize title for filename
-            title = secure_filename(title) or 'openapi_spec'
-            safe_filename = f"{title}.yaml"
-            
-            # Create secure storage directory
-            upload_dir = os.path.join(current_app.instance_path, 'openapi_specs')
-            os.makedirs(upload_dir, exist_ok=True)
-            
-            # Generate unique filename to prevent conflicts
-            unique_id = str(uuid.uuid4())[:8]
-            base_name, ext = os.path.splitext(safe_filename)
-            stored_filename = f"{base_name}_{unique_id}{ext}"
-            if not is_valid_storage_name(stored_filename):
-                return jsonify({
-                    'success': False,
-                    'error': 'Invalid storage filename'
-                }), 400
-            storage_path = os.path.join(upload_dir, stored_filename)
-            
-            # Save spec to file
-            import yaml
-            with open(storage_path, 'w', encoding='utf-8') as f:
-                yaml.dump(spec, f, default_flow_style=False, allow_unicode=True)
-            
-            # Extract basic spec information
-            api_info = {
-                'title': info.get('title', 'Unknown API'),
-                'description': info.get('description', ''),
-                'version': info.get('version', ''),
-                'openapi_version': spec.get('openapi', ''),
-                'servers': spec.get('servers', []),
-                'paths_count': len(spec.get('paths', {})),
-                'components_count': len(spec.get('components', {})),
-                'source_url': url
-            }
-            
-            # Analyze authentication schemes
-            auth_analysis = analyze_openapi_authentication(spec)
-            
-            return jsonify({
-                'success': True,
-                'file_id': stored_filename,
-                'api_info': api_info,
-                'spec_content': spec,  # Include the spec content for frontend processing
-                'authentication': auth_analysis
-            })
-            
-        except Exception as e:
-            debug_print(f"Error validating OpenAPI URL: {str(e)}")
-            return jsonify({
-                'success': False,
-                'error': 'Internal server error during validation'
-            }), 500
-    
-    @app.route('/api/openapi/download-from-url', methods=['POST'])
-    @swagger_route(security=get_auth_security())
-    @login_required
-    @user_required
-    def download_openapi_from_url():
-        """
-        Download and store an OpenAPI specification from a URL.
-        
-        Expected JSON data:
-        - url: The URL to the OpenAPI specification
-        - filename: Optional custom filename (will be sanitized)
-        
-        Returns:
-        - success: Boolean indicating if download was successful
-        - filename: The secure filename used for storage
-        - storage_path: Path where the file was stored
-        - spec_info: Basic information about the OpenAPI spec
-        - error: Error message if download failed
-        """
-        try:
-            data = request.get_json()
-            if not data or 'url' not in data:
-                return jsonify({
-                    'success': False,
-                    'error': 'URL is required'
-                }), 400
-            
-            url = data['url'].strip()
-            custom_filename = data.get('filename', '').strip()
-            
-            if not url:
-                return jsonify({
-                    'success': False,
-                    'error': 'URL cannot be empty'
-                }), 400
-            
-            # Validate URL and fetch content
-            valid, spec, error = openapi_validator.validate_url_content(url)
-            
-            if not valid:
-                return jsonify({
-                    'success': False,
-                    'error': f'Validation failed: {error}'
-                }), 400
-            
-            # Determine filename
-            if custom_filename:
-                # Validate custom filename
-                filename_valid, filename_error = openapi_validator.validate_filename(custom_filename)
-                if not filename_valid:
-                    return jsonify({
-                        'success': False,
-                        'error': f'Invalid custom filename: {filename_error}'
-                    }), 400
-                safe_filename = openapi_validator.create_safe_filename(custom_filename)
-            else:
-                # Generate filename from URL or spec title
-                info = spec.get('info', {})
-                title = info.get('title', 'openapi_spec')
-                # Sanitize title for filename
-                title = secure_filename(title) or 'openapi_spec'
-                safe_filename = f"{title}.yaml"
-            
-            # Create secure storage directory
-            upload_dir = os.path.join(current_app.instance_path, 'openapi_specs')
-            os.makedirs(upload_dir, exist_ok=True)
-            
-            # Generate unique filename to prevent conflicts
-            unique_id = str(uuid.uuid4())[:8]
-            base_name, ext = os.path.splitext(safe_filename)
-            stored_filename = f"{base_name}_{unique_id}{ext}"
-            if not is_valid_storage_name(stored_filename):
-                return jsonify({
-                    'success': False,
-                    'error': 'Invalid storage filename'
-                }), 400
-            storage_path = os.path.join(upload_dir, stored_filename)
-            
-            # Save spec to file
-            import yaml
-            with open(storage_path, 'w', encoding='utf-8') as f:
-                yaml.dump(spec, f, default_flow_style=False, allow_unicode=True)
-            
-            # Extract basic spec information
-            info = spec.get('info', {})
-            spec_info = {
-                'title': info.get('title', 'Unknown API'),
-                'description': info.get('description', ''),
-                'version': info.get('version', ''),
-                'openapi_version': spec.get('openapi', ''),
-                'servers': spec.get('servers', []),
-                'paths_count': len(spec.get('paths', {})),
-                'components_count': len(spec.get('components', {})),
-                'source_url': url
-            }
-            
-            return jsonify({
-                'success': True,
-                'filename': stored_filename,
-                'storage_path': storage_path,
-                'spec_info': spec_info
-            })
-            
-        except Exception as e:
-            debug_print(f"Error downloading OpenAPI spec from URL: {str(e)}")
-            return jsonify({
-                'success': False,
-                'error': 'Internal server error during download'
-            }), 500
-    
     @app.route('/api/openapi/list-uploaded', methods=['GET'])
     @swagger_route(security=get_auth_security())
     @login_required
diff --git a/application/single_app/semantic_kernel_loader.py b/application/single_app/semantic_kernel_loader.py
index 78f54203..c2a7cc1e 100644
--- a/application/single_app/semantic_kernel_loader.py
+++ b/application/single_app/semantic_kernel_loader.py
@@ -19,6 +19,7 @@
 from semantic_kernel.functions.kernel_plugin import KernelPlugin
 from semantic_kernel_plugins.embedding_model_plugin import EmbeddingModelPlugin
 from semantic_kernel_plugins.fact_memory_plugin import FactMemoryPlugin
+from semantic_kernel_plugins.tabular_processing_plugin import TabularProcessingPlugin
 from functions_settings import get_settings, get_user_settings
 from foundry_agent_runtime import AzureAIFoundryChatCompletionAgent
 from functions_appinsights import log_event, get_appinsights_logger
@@ -408,6 +409,13 @@ def load_embedding_model_plugin(kernel: Kernel, settings):
             description="Provides text embedding functions using the configured embedding model."
         )
 
+def load_tabular_processing_plugin(kernel: Kernel):
+    kernel.add_plugin(
+        TabularProcessingPlugin(),
+        plugin_name="tabular_processing",
+        description="Provides data analysis on tabular files (CSV, XLSX) stored in blob storage. Can list files, describe schemas, aggregate columns, filter rows, run queries, and perform group-by operations."
+    )
+
 def load_core_plugins_only(kernel: Kernel, settings):
     """Load only core plugins for model-only conversations without agents."""
     debug_print(f"[SK Loader] Loading core plugins only for model-only mode...")
@@ -429,6 +437,10 @@ def load_core_plugins_only(kernel: Kernel, settings):
         load_text_plugin(kernel)
         log_event("[SK Loader] Loaded Text plugin.", level=logging.INFO)
 
+    if settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+        load_tabular_processing_plugin(kernel)
+        log_event("[SK Loader] Loaded Tabular Processing plugin.", level=logging.INFO)
+
 # =================== Semantic Kernel Initialization ===================
 def initialize_semantic_kernel(user_id: str=None, redis_client=None):
     debug_print(f"[SK Loader] Initializing Semantic Kernel and plugins...")
@@ -721,6 +733,195 @@ def normalize(s):
         print(f"[SK Loader] Error loading agent-specific plugins: {e}")
         log_event(f"[SK Loader] Error loading agent-specific plugins: {e}", level=logging.ERROR, exceptionTraceback=True)
 
+
+def _extract_sql_schema_for_instructions(kernel) -> str:
+    """
+    Check if any SQL Schema plugins are loaded in the kernel and extract their schema
+    information to inject into agent instructions.
+    
+    Returns a formatted schema summary string, or empty string if no SQL schema plugins found.
+    """
+    from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+    
+    schema_parts = []
+    
+    try:
+        # Iterate through all registered plugins in the kernel
+        for plugin_name, plugin in kernel.plugins.items():
+            # Check if the underlying plugin object is a SQLSchemaPlugin
+            # Kernel plugins wrap the original object - we need to check the underlying instance
+            plugin_obj = None
+            
+            # Try to access the underlying plugin instance
+            if isinstance(plugin, SQLSchemaPlugin):
+                plugin_obj = plugin
+            elif hasattr(plugin, '_plugin_instance'):
+                if isinstance(plugin._plugin_instance, SQLSchemaPlugin):
+                    plugin_obj = plugin._plugin_instance
+            else:
+                # Check if any function in this plugin belongs to a SQLSchemaPlugin
+                for func_name, func in plugin.functions.items():
+                    if hasattr(func, 'method') and hasattr(func.method, '__self__'):
+                        if isinstance(func.method.__self__, SQLSchemaPlugin):
+                            plugin_obj = func.method.__self__
+                            break
+            
+            if plugin_obj is not None:
+                print(f"[SK Loader] Found SQL Schema plugin: {plugin_name}, fetching schema...")
+                try:
+                    schema_result = plugin_obj.get_database_schema()
+                    if schema_result and hasattr(schema_result, 'data'):
+                        schema_data = schema_result.data
+                    else:
+                        schema_data = schema_result
+                    
+                    if isinstance(schema_data, dict) and "tables" in schema_data:
+                        db_name = schema_data.get("database_name", "Unknown")
+                        db_type = schema_data.get("database_type", "Unknown")
+                        
+                        schema_text = f"### Database: {db_name} ({db_type})\n\n"
+                        
+                        for table_name, table_info in schema_data["tables"].items():
+                            schema_name = table_info.get("schema_name", "dbo")
+                            qualified_name = f"{schema_name}.{table_name}" if schema_name else table_name
+                            schema_text += f"**Table: {qualified_name}**\n"
+                            
+                            columns = table_info.get("columns", [])
+                            if columns:
+                                schema_text += "| Column | Type | Nullable |\n|--------|------|----------|\n"
+                                for col in columns:
+                                    col_name = col.get("column_name", "?")
+                                    col_type = col.get("data_type", "?")
+                                    nullable = "Yes" if col.get("is_nullable", True) else "No"
+                                    schema_text += f"| {col_name} | {col_type} | {nullable} |\n"
+                            
+                            pks = table_info.get("primary_keys", [])
+                            if pks:
+                                schema_text += f"Primary Key(s): {', '.join(pks)}\n"
+                            
+                            schema_text += "\n"
+                        
+                        # Add relationships
+                        relationships = schema_data.get("relationships", [])
+                        if relationships:
+                            schema_text += "**Relationships (Foreign Keys):**\n"
+                            for rel in relationships:
+                                parent = rel.get("parent_table", "?")
+                                parent_col = rel.get("parent_column", "?")
+                                ref = rel.get("referenced_table", "?")
+                                ref_col = rel.get("referenced_column", "?")
+                                schema_text += f"- {parent}.{parent_col} → {ref}.{ref_col}\n"
+                            schema_text += "\n"
+                        
+                        schema_parts.append(schema_text)
+                        print(f"[SK Loader] Successfully extracted schema for {db_name}: {len(schema_data['tables'])} tables")
+                    else:
+                        print(f"[SK Loader] Schema data for {plugin_name} was empty or had unexpected format")
+                        
+                except Exception as e:
+                    print(f"[SK Loader] Warning: Failed to fetch schema from {plugin_name}: {e}")
+                    log_event(f"[SK Loader] Failed to fetch SQL schema for injection: {e}",
+                             extra={"plugin_name": plugin_name, "error": str(e)},
+                             level=logging.WARNING)
+    except Exception as e:
+        print(f"[SK Loader] Warning: Error iterating kernel plugins for SQL schema: {e}")
+        log_event(f"[SK Loader] Error iterating kernel plugins for SQL schema: {e}",
+                 extra={"error": str(e)}, level=logging.WARNING)
+    
+    # Fallback: If no SQLSchemaPlugin was found, check for SQLQueryPlugin instances
+    # and create a temporary SQLSchemaPlugin from their connection config to extract schema
+    if not schema_parts:
+        from semantic_kernel_plugins.sql_query_plugin import SQLQueryPlugin as _SQLQueryPlugin
+        
+        try:
+            for plugin_name, plugin in kernel.plugins.items():
+                query_obj = None
+                
+                if isinstance(plugin, _SQLQueryPlugin):
+                    query_obj = plugin
+                elif hasattr(plugin, '_plugin_instance'):
+                    if isinstance(plugin._plugin_instance, _SQLQueryPlugin):
+                        query_obj = plugin._plugin_instance
+                else:
+                    for func_name, func in plugin.functions.items():
+                        if hasattr(func, 'method') and hasattr(func.method, '__self__'):
+                            if isinstance(func.method.__self__, _SQLQueryPlugin):
+                                query_obj = func.method.__self__
+                                break
+                
+                if query_obj is not None:
+                    print(f"[SK Loader] Fallback: Found SQLQueryPlugin '{plugin_name}', creating temporary schema extractor...")
+                    try:
+                        temp_manifest = {
+                            'type': 'sql_schema',
+                            'name': f'{plugin_name}_temp_schema',
+                            'database_type': getattr(query_obj, 'database_type', 'azure_sql'),
+                            'server': getattr(query_obj, 'server', ''),
+                            'database': getattr(query_obj, 'database', ''),
+                            'username': getattr(query_obj, 'username', ''),
+                            'password': getattr(query_obj, 'password', ''),
+                            'driver': getattr(query_obj, 'driver', ''),
+                            'connection_string': getattr(query_obj, 'connection_string', ''),
+                        }
+                        temp_schema = SQLSchemaPlugin(temp_manifest)
+                        schema_result = temp_schema.get_database_schema()
+                        if schema_result and hasattr(schema_result, 'data'):
+                            schema_data = schema_result.data
+                        else:
+                            schema_data = schema_result
+                        
+                        if isinstance(schema_data, dict) and "tables" in schema_data:
+                            db_name = schema_data.get("database_name", "Unknown")
+                            db_type = schema_data.get("database_type", "Unknown")
+                            
+                            schema_text = f"### Database: {db_name} ({db_type})\n\n"
+                            
+                            for table_name, table_info in schema_data["tables"].items():
+                                schema_name = table_info.get("schema_name", "dbo")
+                                qualified_name = f"{schema_name}.{table_name}" if schema_name else table_name
+                                schema_text += f"**Table: {qualified_name}**\n"
+                                
+                                columns = table_info.get("columns", [])
+                                if columns:
+                                    schema_text += "| Column | Type | Nullable |\n|--------|------|----------|\n"
+                                    for col in columns:
+                                        col_name = col.get("column_name", "?")
+                                        col_type = col.get("data_type", "?")
+                                        nullable = "Yes" if col.get("is_nullable", True) else "No"
+                                        schema_text += f"| {col_name} | {col_type} | {nullable} |\n"
+                                
+                                pks = table_info.get("primary_keys", [])
+                                if pks:
+                                    schema_text += f"Primary Key(s): {', '.join(pks)}\n"
+                                
+                                schema_text += "\n"
+                            
+                            relationships = schema_data.get("relationships", [])
+                            if relationships:
+                                schema_text += "**Relationships (Foreign Keys):**\n"
+                                for rel in relationships:
+                                    parent = rel.get("parent_table", "?")
+                                    parent_col = rel.get("parent_column", "?")
+                                    ref = rel.get("referenced_table", "?")
+                                    ref_col = rel.get("referenced_column", "?")
+                                    schema_text += f"- {parent}.{parent_col} → {ref}.{ref_col}\n"
+                                schema_text += "\n"
+                            
+                            schema_parts.append(schema_text)
+                            print(f"[SK Loader] Fallback: Successfully extracted schema from SQLQueryPlugin '{plugin_name}': {len(schema_data['tables'])} tables")
+                    except Exception as e:
+                        print(f"[SK Loader] Fallback: Failed to extract schema from SQLQueryPlugin '{plugin_name}': {e}")
+                        log_event(f"[SK Loader] Fallback schema extraction failed",
+                                 extra={"plugin_name": plugin_name, "error": str(e)},
+                                 level=logging.WARNING)
+        except Exception as e:
+            print(f"[SK Loader] Warning: Error in fallback SQL schema extraction: {e}")
+            log_event(f"[SK Loader] Error in fallback SQL schema extraction: {e}",
+                     extra={"error": str(e)}, level=logging.WARNING)
+    
+    return "\n".join(schema_parts)
+
+
 def load_single_agent_for_kernel(kernel, agent_cfg, settings, context_obj, redis_client=None, mode_label="global"):
     """
     DRY helper to load a single agent (default agent) for the kernel.
@@ -859,6 +1060,27 @@ def load_single_agent_for_kernel(kernel, agent_cfg, settings, context_obj, redis
                 group_id=group_id,
             )
 
+            # Auto-inject SQL database schema into agent instructions if SQL plugins are loaded
+            try:
+                sql_schema_summary = _extract_sql_schema_for_instructions(kernel)
+                if sql_schema_summary:
+                    agent_config["instructions"] = (
+                        agent_config.get("instructions", "") +
+                        "\n\n## Available Database Schema\n"
+                        "The following database tables and columns are available for SQL queries. "
+                        "ALWAYS use these exact table and column names when writing SQL queries.\n\n" +
+                        sql_schema_summary +
+                        "\n\nWhen a user asks a question about data, use the schema above to construct "
+                        "the appropriate SQL query and execute it using the SQL Query plugin functions. "
+                        "Do NOT ask the user for table or column names — use the schema provided above."
+                    )
+                    print(f"[SK Loader] Injected SQL schema into agent instructions for {agent_config['name']}")
+            except Exception as e:
+                print(f"[SK Loader] Warning: Failed to inject SQL schema into instructions: {e}")
+                log_event(f"[SK Loader] Failed to inject SQL schema into agent instructions: {e}",
+                         extra={"agent_name": agent_config["name"], "error": str(e)},
+                         level=logging.WARNING)
+
         try:
             kwargs = {
                 "name": agent_config["name"],
@@ -1013,6 +1235,14 @@ def load_plugins_for_kernel(kernel, plugin_manifests, settings, mode_label="glob
         except Exception as e:
             log_event(f"[SK Loader] Failed to load Fact Memory Plugin: {e}", level=logging.WARNING)
 
+    # Register Tabular Processing Plugin if enabled (requires enhanced citations)
+    if settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+        try:
+            load_tabular_processing_plugin(kernel)
+            log_event("[SK Loader] Loaded Tabular Processing plugin.", level=logging.INFO)
+        except Exception as e:
+            log_event(f"[SK Loader] Failed to load Tabular Processing plugin: {e}", level=logging.WARNING)
+
     # Conditionally load static embedding model plugin
     if settings.get('enable_default_embedding_model_plugin', True):
         try:
@@ -1357,7 +1587,11 @@ def load_user_semantic_kernel(kernel: Kernel, settings, user_id: str, redis_clie
         load_embedding_model_plugin(kernel, settings)
         print(f"[SK Loader] Loaded Default Embedding Model plugin.")
         log_event("[SK Loader] Loaded Default Embedding Model plugin.", level=logging.INFO)
-    
+
+    if settings.get('enable_tabular_processing_plugin', False) and settings.get('enable_enhanced_citations', False):
+        load_tabular_processing_plugin(kernel)
+        log_event("[SK Loader] Loaded Tabular Processing plugin.", level=logging.INFO)
+
     # Get selected agent from user settings (this still needs to be in user settings for UI state)
     user_settings = get_user_settings(user_id).get('settings', {})
     selected_agent = user_settings.get('selected_agent')
diff --git a/application/single_app/semantic_kernel_plugins/logged_plugin_loader.py b/application/single_app/semantic_kernel_plugins/logged_plugin_loader.py
index 64443633..f7c9e38c 100644
--- a/application/single_app/semantic_kernel_plugins/logged_plugin_loader.py
+++ b/application/single_app/semantic_kernel_plugins/logged_plugin_loader.py
@@ -80,6 +80,10 @@ def load_plugin_from_manifest(self, manifest: Dict[str, Any],
             # Register the plugin with the kernel
             self._register_plugin_with_kernel(plugin_instance, plugin_name)
             
+            # Auto-create companion SQL Schema plugin when loading a SQL Query plugin
+            if plugin_type == 'sql_query':
+                self._auto_create_companion_schema_plugin(manifest, plugin_name)
+            
             log_event(
                 f"[Logged Plugin Loader] Successfully loaded plugin: {plugin_name}",
                 extra={
@@ -117,8 +121,8 @@ def _create_plugin_instance(self, manifest: Dict[str, Any]):
             return self._create_openapi_plugin(manifest)
         elif plugin_type == 'python':
             return self._create_python_plugin(manifest)
-        #elif plugin_type in ['sql_schema', 'sql_query']:
-        #    return self._create_sql_plugin(manifest)
+        elif plugin_type in ['sql_schema', 'sql_query']:
+            return self._create_sql_plugin(manifest)
         else:
             try:
                 debug_print(f"[Logged Plugin Loader] Attempting to discover plugin type: {plugin_type}")
@@ -221,6 +225,60 @@ def _create_sql_plugin(self, manifest: Dict[str, Any]):
             self.logger.error(f"Failed to import SQL plugin class for {plugin_type}: {e}")
             return None
     
+    def _auto_create_companion_schema_plugin(self, query_manifest: Dict[str, Any], query_plugin_name: str):
+        """
+        Auto-create a companion SQLSchemaPlugin when a SQLQueryPlugin is loaded.
+        This ensures the agent has access to database schema information for constructing queries.
+        The schema plugin uses the same connection details as the query plugin.
+        """
+        try:
+            # Derive schema plugin name from query plugin name
+            if query_plugin_name.endswith('_query'):
+                schema_plugin_name = query_plugin_name[:-6] + '_schema'
+            else:
+                schema_plugin_name = query_plugin_name + '_schema'
+            
+            # Check if schema plugin already exists in kernel
+            if schema_plugin_name in self.kernel.plugins:
+                log_event(
+                    f"[Logged Plugin Loader] Companion schema plugin already exists: {schema_plugin_name}",
+                    level=logging.DEBUG
+                )
+                return
+            
+            # Create schema manifest from query manifest (same connection details)
+            schema_manifest = dict(query_manifest)
+            schema_manifest['type'] = 'sql_schema'
+            schema_manifest['name'] = schema_plugin_name
+            
+            # Create the schema plugin instance
+            schema_instance = SQLSchemaPlugin(schema_manifest)
+            
+            # Enable logging if supported
+            if hasattr(schema_instance, 'enable_invocation_logging'):
+                schema_instance.enable_invocation_logging(True)
+            
+            # Wrap functions if it's a BasePlugin
+            if isinstance(schema_instance, BasePlugin):
+                self._wrap_plugin_functions(schema_instance, schema_plugin_name)
+            
+            # Register with kernel
+            self._register_plugin_with_kernel(schema_instance, schema_plugin_name)
+            
+            log_event(
+                f"[Logged Plugin Loader] Auto-created companion SQL Schema plugin: {schema_plugin_name}",
+                extra={"query_plugin": query_plugin_name, "schema_plugin": schema_plugin_name},
+                level=logging.INFO
+            )
+            
+        except Exception as e:
+            log_event(
+                f"[Logged Plugin Loader] Warning: Failed to auto-create companion schema plugin",
+                extra={"query_plugin": query_plugin_name, "error": str(e)},
+                level=logging.WARNING,
+                exceptionTraceback=True
+            )
+    
     def _wrap_plugin_functions(self, plugin_instance, plugin_name: str):
         """Wrap all kernel functions in a plugin with logging."""
         log_event(f"[Logged Plugin Loader] Checking logging status for plugin", 
diff --git a/application/single_app/semantic_kernel_plugins/openapi_plugin_factory.py b/application/single_app/semantic_kernel_plugins/openapi_plugin_factory.py
index d2a91477..b5e80507 100644
--- a/application/single_app/semantic_kernel_plugins/openapi_plugin_factory.py
+++ b/application/single_app/semantic_kernel_plugins/openapi_plugin_factory.py
@@ -4,7 +4,6 @@
 Factory class for creating OpenAPI plugins from different sources:
 - Stored content in user settings (preferred)
 - Uploaded files (deprecated)
-- URLs (deprecated)
 - File paths (deprecated)
 """
 
@@ -64,8 +63,6 @@ def create_from_config(cls, config: Dict[str, Any]) -> OpenApiPlugin:
             raise ValueError("openapi_spec_content is required for content source type")
         elif source_type == 'file':
             openapi_spec_path = cls._get_uploaded_file_path(config)
-        elif source_type == 'url':
-            openapi_spec_path = cls._get_downloaded_file_path(config)
         elif source_type == 'path':
             openapi_spec_path = cls._get_local_file_path(config)
         else:
@@ -95,22 +92,6 @@ def _get_uploaded_file_path(cls, config: Dict[str, Any]) -> str:
         return file_path
     
     @classmethod
-    def _get_downloaded_file_path(cls, config: Dict[str, Any]) -> str:
-        """Get file path for downloaded OpenAPI spec from URL."""
-        file_id = config.get('openapi_file_id')
-        if not file_id:
-            raise ValueError("openapi_file_id is required for URL source type")
-        
-        # Construct path to downloaded file
-        file_path = os.path.join(cls.UPLOADED_FILES_DIR, f"{file_id}.yaml")
-        if not os.path.exists(file_path):
-            # Try JSON extension
-            file_path = os.path.join(cls.UPLOADED_FILES_DIR, f"{file_id}.json")
-            if not os.path.exists(file_path):
-                raise FileNotFoundError(f"Downloaded file not found: {file_id}")
-        
-        return file_path
-    
     @classmethod
     def _get_local_file_path(cls, config: Dict[str, Any]) -> str:
         """Get file path for local OpenAPI spec."""
diff --git a/application/single_app/semantic_kernel_plugins/plugin_health_checker.py b/application/single_app/semantic_kernel_plugins/plugin_health_checker.py
index f8488ea1..962d0cf2 100644
--- a/application/single_app/semantic_kernel_plugins/plugin_health_checker.py
+++ b/application/single_app/semantic_kernel_plugins/plugin_health_checker.py
@@ -47,9 +47,18 @@ def validate_plugin_manifest(manifest: Dict[str, Any], plugin_type: str) -> Tupl
                 errors.append(f"Plugin type '{plugin_type}' requires 'auth' field")
         
         elif plugin_type in ['sql_query', 'sql_schema']:
-            if 'database_type' not in manifest:
+            additional_fields = manifest.get('additionalFields', {})
+            if not isinstance(additional_fields, dict):
+                additional_fields = {}
+
+            database_type = manifest.get('database_type') or additional_fields.get('database_type')
+            connection_string = manifest.get('connection_string') or additional_fields.get('connection_string')
+            server = manifest.get('server') or additional_fields.get('server')
+            database = manifest.get('database') or additional_fields.get('database')
+
+            if not database_type:
                 errors.append(f"SQL plugin requires 'database_type' field")
-            if not manifest.get('connection_string') and not (manifest.get('server') and manifest.get('database')):
+            if not connection_string and not (server and database):
                 errors.append("SQL plugin requires either 'connection_string' or 'server' and 'database' fields")
         
         elif plugin_type == 'log_analytics':
diff --git a/application/single_app/semantic_kernel_plugins/plugin_invocation_logger.py b/application/single_app/semantic_kernel_plugins/plugin_invocation_logger.py
index f982f0a4..bddf9cda 100644
--- a/application/single_app/semantic_kernel_plugins/plugin_invocation_logger.py
+++ b/application/single_app/semantic_kernel_plugins/plugin_invocation_logger.py
@@ -11,6 +11,7 @@
 import logging
 import functools
 import inspect
+import threading
 from typing import Any, Dict, List, Optional, Callable
 from datetime import datetime
 from dataclasses import dataclass, asdict
@@ -51,24 +52,29 @@ def __init__(self):
         self.invocations: List[PluginInvocation] = []
         self.max_history = 1000  # Keep last 1000 invocations in memory
         self.logger = get_appinsights_logger() or logging.getLogger(__name__)
+        self._callbacks: Dict[str, List[Callable[[PluginInvocation], None]]] = {}
+        self._callback_lock = threading.Lock()
         
     def log_invocation(self, invocation: PluginInvocation):
         """Log a plugin invocation to Application Insights and local history."""
         # Add to local history
         self.invocations.append(invocation)
-        
+
         # Trim history if needed
         if len(self.invocations) > self.max_history:
             self.invocations = self.invocations[-self.max_history:]
-        
+
         # Enhanced terminal logging
         self._log_to_terminal(invocation)
-        
+
         # Log to Application Insights
         self._log_to_appinsights(invocation)
-        
+
         # Log to standard logging
         self._log_to_standard(invocation)
+
+        # Fire registered thought callbacks
+        self._fire_callbacks(invocation)
     
     def _log_to_terminal(self, invocation: PluginInvocation):
         """Log detailed invocation information to terminal."""
@@ -277,6 +283,34 @@ def clear_history(self):
         """Clear the invocation history."""
         self.invocations.clear()
 
+    def register_callback(self, key, callback):
+        """Register a callback fired on each plugin invocation for the given key.
+
+        Args:
+            key: A string key, typically f"{user_id}:{conversation_id}".
+            callback: Called with the PluginInvocation after it is logged.
+        """
+        with self._callback_lock:
+            if key not in self._callbacks:
+                self._callbacks[key] = []
+            self._callbacks[key].append(callback)
+
+    def deregister_callbacks(self, key):
+        """Remove all callbacks for the given key."""
+        with self._callback_lock:
+            self._callbacks.pop(key, None)
+
+    def _fire_callbacks(self, invocation):
+        """Fire matching callbacks for this invocation's user+conversation."""
+        key = f"{invocation.user_id}:{invocation.conversation_id}"
+        with self._callback_lock:
+            callbacks = list(self._callbacks.get(key, []))
+        for cb in callbacks:
+            try:
+                cb(invocation)
+            except Exception as e:
+                log_event(f"Plugin invocation callback error: {e}", level="WARNING")
+
 
 # Global instance
 _plugin_logger = PluginInvocationLogger()
diff --git a/application/single_app/semantic_kernel_plugins/sql_query_plugin.py b/application/single_app/semantic_kernel_plugins/sql_query_plugin.py
index ccad030f..084c4c9b 100644
--- a/application/single_app/semantic_kernel_plugins/sql_query_plugin.py
+++ b/application/single_app/semantic_kernel_plugins/sql_query_plugin.py
@@ -176,11 +176,12 @@ def metadata(self) -> Dict[str, Any]:
         user_desc = self._metadata.get("description", f"SQL Query plugin for {self.database_type} database")
         api_desc = (
             "This plugin executes SQL queries against databases and returns structured results. "
-            "It supports SQL Server, PostgreSQL, MySQL, and SQLite databases. The plugin includes "
-            "query sanitization, validation, and security features including parameterized queries, "
-            "read-only mode, result limiting, and timeout protection. It automatically cleans queries "
-            "from unnecessary characters and formats results for easy consumption by AI agents. "
-            "The plugin handles database-specific SQL variations and connection management."
+            "It supports SQL Server, PostgreSQL, MySQL, and SQLite databases. "
+            "WORKFLOW: Before executing any query, you MUST first use the SQL Schema plugin to discover "
+            "available tables, column names, data types, and relationships. Then construct valid SQL queries "
+            "using the discovered schema with correct fully-qualified table names (e.g., dbo.TableName). "
+            "The plugin includes query sanitization, validation, and security features including "
+            "parameterized queries, read-only mode, result limiting, and timeout protection."
         )
         full_desc = f"{user_desc}\n\n{api_desc}"
         
@@ -215,14 +216,24 @@ def metadata(self) -> Dict[str, Any]:
                         {"name": "query", "type": "str", "description": "The SQL query to validate", "required": True}
                     ],
                     "returns": {"type": "ResultWithMetadata", "description": "Validation result with any issues found"}
+                },
+                {
+                    "name": "query_database",
+                    "description": "Execute a SQL query to answer a question about the database",
+                    "parameters": [
+                        {"name": "question", "type": "str", "description": "The natural language question being answered", "required": True},
+                        {"name": "query", "type": "str", "description": "The SQL query to execute", "required": True},
+                        {"name": "max_rows", "type": "int", "description": "Maximum number of rows to return (overrides default)", "required": False}
+                    ],
+                    "returns": {"type": "ResultWithMetadata", "description": "Query results with columns, data, and original question context"}
                 }
             ]
         }
 
     def get_functions(self) -> List[str]:
-        return ["execute_query", "execute_scalar", "validate_query"]
+        return ["execute_query", "execute_scalar", "validate_query", "query_database"]
 
-    @kernel_function(description="Execute a SQL query and return results")
+    @kernel_function(description="Execute a SQL query against the database and return results as structured data with columns and rows. If the database schema is provided in your instructions, use those exact table and column names to construct valid SQL queries. If no schema is available in your instructions, call get_database_schema or get_table_list from the SQL Schema plugin to discover tables first. Always use fully qualified table names (e.g., dbo.TableName) when available. Results are limited by max_rows to prevent excessive data transfer.")
     @plugin_function_logger("SQLQueryPlugin")
     def execute_query(
         self, 
@@ -301,7 +312,7 @@ def execute_query(
             }
             return ResultWithMetadata(error_result, self.metadata)
 
-    @kernel_function(description="Execute a query that returns a single value")
+    @kernel_function(description="Execute a query that returns a single scalar value (e.g., COUNT, SUM, MAX, MIN). If the database schema is provided in your instructions, use it directly to construct the query. Otherwise, call get_database_schema from the SQL Schema plugin first to discover table and column names.")
     @plugin_function_logger("SQLQueryPlugin")
     def execute_scalar(
         self, 
@@ -360,7 +371,7 @@ def execute_scalar(
             }
             return ResultWithMetadata(error_result, self.metadata)
 
-    @kernel_function(description="Validate a SQL query without executing it")
+    @kernel_function(description="Validate a SQL query for syntax correctness and safety without executing it. Use this to pre-check complex queries before execution, especially when constructing multi-table JOINs or complex WHERE clauses.")
     @plugin_function_logger("SQLQueryPlugin")
     def validate_query(self, query: str) -> ResultWithMetadata:
         """Validate a SQL query without executing it"""
@@ -380,6 +391,80 @@ def validate_query(self, query: str) -> ResultWithMetadata:
             }
             return ResultWithMetadata(error_result, self.metadata)
 
+    @kernel_function(description="Execute a SQL query to answer a question about the database. This is a convenience function that executes a SQL query and returns results along with the original question for context. If the database schema is provided in your instructions, use those table and column names directly to construct the query. Otherwise, first call get_database_schema from the SQL Schema plugin to discover the schema. Then construct the appropriate SQL query and provide it along with the original question.")
+    @plugin_function_logger("SQLQueryPlugin")
+    def query_database(
+        self,
+        question: str,
+        query: str,
+        max_rows: Optional[int] = None
+    ) -> ResultWithMetadata:
+        """Execute a SQL query to answer a specific question about the database"""
+        try:
+            # Clean and validate the query
+            cleaned_query = self._clean_query(query)
+            validation_result = self._validate_query(cleaned_query)
+            
+            if not validation_result["is_valid"]:
+                raise ValueError(f"Invalid query: {validation_result['issues']}")
+            
+            conn = self._get_connection()
+            cursor = conn.cursor()
+            
+            # Set query timeout
+            if hasattr(cursor, 'settimeout'):
+                cursor.settimeout(self.timeout)
+            
+            cursor.execute(cleaned_query)
+            
+            # Get column names
+            if hasattr(cursor, 'description') and cursor.description:
+                columns = [desc[0] for desc in cursor.description]
+            else:
+                columns = []
+            
+            # Fetch results with row limit
+            effective_max_rows = max_rows or self.max_rows
+            
+            if self.database_type == 'sqlite':
+                rows = cursor.fetchall()
+                if len(rows) > effective_max_rows:
+                    rows = rows[:effective_max_rows]
+                results = [dict(row) for row in rows]
+            else:
+                rows = cursor.fetchmany(effective_max_rows)
+                results = []
+                for row in rows:
+                    if isinstance(row, (list, tuple)):
+                        results.append(dict(zip(columns, row)))
+                    else:
+                        results.append(row)
+            
+            # Prepare result data with question context
+            result_data = {
+                "question": question,
+                "columns": columns,
+                "data": results,
+                "row_count": len(results),
+                "is_truncated": len(results) >= effective_max_rows,
+                "query": cleaned_query
+            }
+            
+            log_event(f"[SQLQueryPlugin] query_database executed successfully, returned {len(results)} rows", extra={"question": question})
+            return ResultWithMetadata(result_data, self.metadata)
+            
+        except Exception as e:
+            log_event(f"[SQLQueryPlugin] Error in query_database: {e}", extra={"question": question})
+            error_result = {
+                "error": str(e),
+                "question": question,
+                "query": query,
+                "columns": [],
+                "data": [],
+                "row_count": 0
+            }
+            return ResultWithMetadata(error_result, self.metadata)
+
     def _clean_query(self, query: str) -> str:
         """Clean query from unnecessary characters and formatting"""
         if not query:
diff --git a/application/single_app/semantic_kernel_plugins/sql_schema_plugin.py b/application/single_app/semantic_kernel_plugins/sql_schema_plugin.py
index 01d89aa2..380b9a5f 100644
--- a/application/single_app/semantic_kernel_plugins/sql_schema_plugin.py
+++ b/application/single_app/semantic_kernel_plugins/sql_schema_plugin.py
@@ -165,11 +165,11 @@ def metadata(self) -> Dict[str, Any]:
         user_desc = self._metadata.get("description", f"SQL Schema plugin for {self.database_type} database")
         api_desc = (
             "This plugin connects to SQL databases and extracts schema information including tables, columns, "
-            "data types, primary keys, foreign keys, and relationships. It supports SQL Server, PostgreSQL, "
-            "MySQL, and SQLite databases. The plugin provides structured schema data that can be used by "
-            "AI agents to understand database structure and generate appropriate SQL queries. "
-            "Authentication supports connection strings, username/password, and integrated authentication. "
-            "The plugin handles database-specific SQL variations for schema extraction."
+            "data types, primary keys, foreign keys, and relationships. WORKFLOW: ALWAYS call get_database_schema "
+            "or get_table_list FIRST before executing any SQL queries via the SQL Query plugin. This ensures "
+            "you have accurate table names, column names, and relationship information to construct valid queries. "
+            "It supports SQL Server, PostgreSQL, MySQL, and SQLite databases. "
+            "Authentication supports connection strings, username/password, and integrated authentication."
         )
         full_desc = f"{user_desc}\n\n{api_desc}"
         
@@ -219,7 +219,7 @@ def get_functions(self) -> List[str]:
         return ["get_database_schema", "get_table_schema", "get_table_list", "get_relationships"]
 
     @plugin_function_logger("SQLSchemaPlugin")
-    @kernel_function(description="Get complete database schema including all tables, columns, and relationships")
+    @kernel_function(description="Get complete database schema including all tables, columns, data types, primary keys, foreign keys, and relationships. If the database schema is already provided in your instructions, use that directly and do NOT call this function. Only call this function if you need to discover the schema and it was not already provided. The returned schema should be used to construct valid SQL queries with the correct fully-qualified table names (e.g., dbo.TableName) and column references.")
     def get_database_schema(
         self, 
         include_system_tables: bool = False,
@@ -255,24 +255,26 @@ def get_database_schema(
             
             # Get schema for each table
             for table in tables:
-                if isinstance(table, tuple) and len(table) >= 2:
+                try:
+                    # Robust row parsing — works with pyodbc.Row, tuple, list, etc.
                     table_name = table[0]
-                    schema_name = table[1]
-                    qualified_table_name = f"{schema_name}.{table_name}"
-                else:
-                    table_name = table[0] if isinstance(table, tuple) else table
+                    schema_name = table[1] if len(table) >= 2 else None
+                    qualified_table_name = f"{schema_name}.{table_name}" if schema_name else str(table_name)
+                except (TypeError, IndexError):
+                    table_name = str(table)
                     schema_name = None
                     qualified_table_name = table_name
                     
                 try:
-                    table_schema = self._get_table_schema_data(cursor, table_name, schema_name)
-                    schema_data["tables"][table_name] = table_schema
-                    print(f"[SQLSchemaPlugin] Got schema for table: {qualified_table_name}")
+                    table_schema = self._get_table_schema_data(cursor, str(table_name), str(schema_name) if schema_name else None)
+                    schema_data["tables"][str(table_name)] = table_schema
+                    print(f"[SQLSchemaPlugin] Got schema for table: {qualified_table_name} ({len(table_schema.get('columns', []))} columns)")
                 except Exception as e:
                     print(f"[SQLSchemaPlugin] Error getting schema for table {qualified_table_name}: {e}")
                     log_event(f"[SQLSchemaPlugin] Error getting table schema", extra={
                         "table_name": qualified_table_name,
-                        "error": str(e)
+                        "error": str(e),
+                        "raw_row": repr(table)
                     })
             
             # Get relationships
@@ -310,30 +312,8 @@ def get_database_schema(
                 {"error": error_msg},
                 {"source": "sql_schema_plugin", "success": False}
             )
-            
-            # Get tables
-            tables_query = self._get_tables_query(include_system_tables, table_filter)
-            cursor.execute(tables_query)
-            tables = cursor.fetchall()
-            
-            # Get schema for each table
-            for table_row in tables:
-                table_name = table_row[0] if isinstance(table_row, (list, tuple)) else table_row
-                table_schema = self._get_table_schema_data(cursor, table_name)
-                schema_data["tables"][table_name] = table_schema
-            
-            # Get relationships
-            relationships = self._get_relationships_data(cursor)
-            schema_data["relationships"] = relationships
-            
-            log_event(f"[SQLSchemaPlugin] Retrieved schema for {len(schema_data['tables'])} tables")
-            return ResultWithMetadata(schema_data, self.metadata)
-            
-        except Exception as e:
-            log_event(f"[SQLSchemaPlugin] Error getting database schema: {e}")
-            raise
 
-    @kernel_function(description="Get detailed schema for a specific table")
+    @kernel_function(description="Get the detailed schema (column names, data types, constraints) for a specific table. If the database schema is already provided in your instructions, use that directly instead of calling this function. Only call this if you need details for a specific table not already in your instructions.")
     @plugin_function_logger("SQLSchemaPlugin")
     def get_table_schema(self, table_name: str) -> ResultWithMetadata:
         """Get detailed schema for a specific table"""
@@ -350,7 +330,7 @@ def get_table_schema(self, table_name: str) -> ResultWithMetadata:
             log_event(f"[SQLSchemaPlugin] Error getting table schema for {table_name}: {e}")
             raise
 
-    @kernel_function(description="Get list of all tables in the database")
+    @kernel_function(description="Return the names of all tables in the database. If the database schema is already provided in your instructions, use that directly instead of calling this function. Only call this if you need to discover available tables and they are not already listed in your instructions.")
     @plugin_function_logger("SQLSchemaPlugin")
     def get_table_list(
         self, 
@@ -368,14 +348,14 @@ def get_table_list(
             
             table_list = []
             for table_row in tables:
-                if isinstance(table_row, (list, tuple)):
+                try:
                     table_info = {
-                        "table_name": table_row[0],
-                        "schema": table_row[1] if len(table_row) > 1 else None,
-                        "table_type": table_row[2] if len(table_row) > 2 else "TABLE"
+                        "table_name": str(table_row[0]),
+                        "schema": str(table_row[1]) if len(table_row) > 1 else None,
+                        "table_type": str(table_row[2]) if len(table_row) > 2 else "TABLE"
                     }
-                else:
-                    table_info = {"table_name": table_row, "schema": None, "table_type": "TABLE"}
+                except (TypeError, IndexError):
+                    table_info = {"table_name": str(table_row), "schema": None, "table_type": "TABLE"}
                 table_list.append(table_info)
             
             log_event(f"[SQLSchemaPlugin] Retrieved {len(table_list)} tables")
@@ -385,7 +365,7 @@ def get_table_list(
             log_event(f"[SQLSchemaPlugin] Error getting table list: {e}")
             raise
 
-    @kernel_function(description="Get foreign key relationships between tables")
+    @kernel_function(description="Get foreign key relationships between tables. If the database schema and relationships are already provided in your instructions, use those directly instead of calling this function. Only call this if you need relationship details not already in your instructions.")
     def get_relationships(self, table_name: Optional[str] = None) -> ResultWithMetadata:
         """Get foreign key relationships between tables"""
         try:
@@ -402,17 +382,20 @@ def get_relationships(self, table_name: Optional[str] = None) -> ResultWithMetad
             raise
 
     def _get_tables_query(self, include_system_tables: bool, table_filter: Optional[str]) -> str:
-        """Get database-specific query for listing tables"""
+        """Get database-specific query for listing tables.
+        Uses sys.tables/sys.schemas for SQL Server (more reliable than INFORMATION_SCHEMA
+        in Azure SQL environments with restricted permissions)."""
         if self.database_type == 'sqlserver':
             base_query = """
-                SELECT TABLE_NAME, TABLE_SCHEMA, TABLE_TYPE 
-                FROM INFORMATION_SCHEMA.TABLES 
-                WHERE TABLE_TYPE = 'BASE TABLE'
+                SELECT t.name AS TABLE_NAME, s.name AS TABLE_SCHEMA, 'BASE TABLE' AS TABLE_TYPE
+                FROM sys.tables t
+                INNER JOIN sys.schemas s ON t.schema_id = s.schema_id
+                WHERE t.type = 'U'
             """
             if not include_system_tables:
-                base_query += " AND TABLE_SCHEMA NOT IN ('sys', 'information_schema')"
+                base_query += " AND s.name NOT IN ('sys', 'information_schema')"
             if table_filter:
-                base_query += f" AND TABLE_NAME LIKE '{table_filter.replace('*', '%')}'"
+                base_query += f" AND t.name LIKE '{table_filter.replace('*', '%')}'"
             return base_query
             
         elif self.database_type == 'postgresql':
@@ -467,22 +450,30 @@ def _get_table_schema_data(self, cursor, table_name: str, schema_name: str = Non
         if pk_query:
             cursor.execute(pk_query)
             pks = cursor.fetchall()
-            schema_data["primary_keys"] = [pk[0] if isinstance(pk, (list, tuple)) else pk for pk in pks]
+            schema_data["primary_keys"] = [str(pk[0]) for pk in pks]
         
         return schema_data
 
     def _get_columns_query(self, table_name: str, schema_name: str = None) -> str:
-        """Get database-specific query for table columns"""
+        """Get database-specific query for table columns.
+        Uses sys.columns/sys.types for SQL Server (consistent with sys.tables used for enumeration)."""
         if self.database_type == 'sqlserver':
-            where_clause = f"WHERE TABLE_NAME = '{table_name}'"
-            if schema_name:
-                where_clause += f" AND TABLE_SCHEMA = '{schema_name}'"
+            schema_filter = f"AND s.name = '{schema_name}'" if schema_name else ""
             return f"""
-                SELECT COLUMN_NAME, DATA_TYPE, IS_NULLABLE, COLUMN_DEFAULT, 
-                       CHARACTER_MAXIMUM_LENGTH, NUMERIC_PRECISION, NUMERIC_SCALE
-                FROM INFORMATION_SCHEMA.COLUMNS 
-                {where_clause}
-                ORDER BY ORDINAL_POSITION
+                SELECT 
+                    c.name AS COLUMN_NAME,
+                    TYPE_NAME(c.user_type_id) AS DATA_TYPE,
+                    CASE WHEN c.is_nullable = 1 THEN 'YES' ELSE 'NO' END AS IS_NULLABLE,
+                    dc.definition AS COLUMN_DEFAULT,
+                    c.max_length AS CHARACTER_MAXIMUM_LENGTH,
+                    c.precision AS NUMERIC_PRECISION,
+                    c.scale AS NUMERIC_SCALE
+                FROM sys.columns c
+                INNER JOIN sys.tables t ON c.object_id = t.object_id
+                INNER JOIN sys.schemas s ON t.schema_id = s.schema_id
+                LEFT JOIN sys.default_constraints dc ON c.default_object_id = dc.object_id
+                WHERE t.name = '{table_name}' {schema_filter}
+                ORDER BY c.column_id
             """
         elif self.database_type == 'postgresql':
             return f"""
@@ -498,16 +489,18 @@ def _get_columns_query(self, table_name: str, schema_name: str = None) -> str:
             return f"PRAGMA table_info({table_name})"
 
     def _get_primary_keys_query(self, table_name: str, schema_name: str = None) -> Optional[str]:
-        """Get database-specific query for primary keys"""
+        """Get database-specific query for primary keys.
+        Uses sys.indexes/sys.index_columns for SQL Server (consistent with sys.tables)."""
         if self.database_type == 'sqlserver':
-            where_clause = f"WHERE TABLE_NAME = '{table_name}'"
-            if schema_name:
-                where_clause += f" AND TABLE_SCHEMA = '{schema_name}'"
+            schema_filter = f"AND s.name = '{schema_name}'" if schema_name else ""
             return f"""
-                SELECT COLUMN_NAME
-                FROM INFORMATION_SCHEMA.KEY_COLUMN_USAGE
-                {where_clause}
-                AND CONSTRAINT_NAME LIKE 'PK_%'
+                SELECT c.name AS COLUMN_NAME
+                FROM sys.index_columns ic
+                INNER JOIN sys.columns c ON ic.object_id = c.object_id AND ic.column_id = c.column_id
+                INNER JOIN sys.indexes i ON ic.object_id = i.object_id AND ic.index_id = i.index_id
+                INNER JOIN sys.tables t ON i.object_id = t.object_id
+                INNER JOIN sys.schemas s ON t.schema_id = s.schema_id
+                WHERE i.is_primary_key = 1 AND t.name = '{table_name}' {schema_filter}
             """
         elif self.database_type == 'postgresql':
             return f"""
diff --git a/application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py b/application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py
new file mode 100644
index 00000000..fd5e2597
--- /dev/null
+++ b/application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py
@@ -0,0 +1,1862 @@
+# tabular_processing_plugin.py
+"""
+TabularProcessingPlugin for Semantic Kernel: provides data analysis operations
+on tabular files (CSV, XLSX, XLS, XLSM) stored in Azure Blob Storage.
+
+Works with workspace documents (user-documents, group-documents, public-documents)
+and chat-uploaded documents (personal-chat container).
+"""
+import asyncio
+import copy
+from datetime import date, datetime
+import io
+import json
+import logging
+import warnings
+import pandas
+from typing import Annotated, Optional, List
+from semantic_kernel.functions import kernel_function
+from semantic_kernel_plugins.plugin_invocation_logger import plugin_function_logger
+from functions_appinsights import log_event
+from config import (
+    CLIENTS,
+    TABULAR_EXTENSIONS,
+    storage_account_user_documents_container_name,
+    storage_account_personal_chat_container_name,
+    storage_account_group_documents_container_name,
+    storage_account_public_documents_container_name,
+)
+
+
+class TabularProcessingPlugin:
+    """Provides data analysis functions on tabular files stored in blob storage."""
+
+    SUPPORTED_EXTENSIONS = tuple(f'.{extension}' for extension in sorted(TABULAR_EXTENSIONS))
+    DISCOVERY_FUNCTION_NAMES = (
+        'list_tabular_files',
+        'describe_tabular_file',
+    )
+    ANALYSIS_FUNCTION_NAMES = (
+        'lookup_value',
+        'aggregate_column',
+        'filter_rows',
+        'query_tabular_data',
+        'group_by_aggregate',
+        'group_by_datetime_component',
+    )
+    THOUGHT_EXCLUDED_PARAMETER_NAMES = (
+        'user_id',
+        'conversation_id',
+        'group_id',
+        'public_workspace_id',
+    )
+    DAY_NAME_ORDER = [
+        'Monday',
+        'Tuesday',
+        'Wednesday',
+        'Thursday',
+        'Friday',
+        'Saturday',
+        'Sunday'
+    ]
+    MONTH_NAME_ORDER = [
+        'January',
+        'February',
+        'March',
+        'April',
+        'May',
+        'June',
+        'July',
+        'August',
+        'September',
+        'October',
+        'November',
+        'December'
+    ]
+
+    def __init__(self):
+        self._df_cache = {}  # Per-instance cache: (container, blob_name, sheet_name) -> DataFrame
+        self._blob_data_cache = {}  # Per-instance cache: (container, blob_name) -> raw bytes
+        self._workbook_metadata_cache = {}  # Per-instance cache: (container, blob_name) -> workbook metadata
+        self._default_sheet_overrides = {}  # (container, blob_name) -> default sheet name
+
+    @classmethod
+    def get_discovery_function_names(cls):
+        """Return discovery-oriented kernel function names exposed by the plugin."""
+        return cls.DISCOVERY_FUNCTION_NAMES
+
+    @classmethod
+    def get_analysis_function_names(cls):
+        """Return analytical kernel function names exposed by the plugin."""
+        return cls.ANALYSIS_FUNCTION_NAMES
+
+    @classmethod
+    def get_thought_excluded_parameter_names(cls):
+        """Return parameter names omitted from user-visible thought payloads."""
+        return cls.THOUGHT_EXCLUDED_PARAMETER_NAMES
+
+    def set_default_sheet(self, container_name: str, blob_name: str, sheet_name: str):
+        """Set the default sheet for a workbook so the model doesn't need to specify it."""
+        self._default_sheet_overrides[(container_name, blob_name)] = sheet_name
+
+    def _get_blob_service_client(self):
+        """Get the blob service client from CLIENTS cache."""
+        client = CLIENTS.get("storage_account_office_docs_client")
+        if not client:
+            raise RuntimeError("Blob storage client not available. Enhanced citations must be enabled.")
+        return client
+
+    def _list_tabular_blobs(self, container_name: str, prefix: str) -> List[str]:
+        """List all tabular file blobs under a given prefix."""
+        client = self._get_blob_service_client()
+        container_client = client.get_container_client(container_name)
+        blobs = []
+        for blob in container_client.list_blobs(name_starts_with=prefix):
+            name_lower = blob['name'].lower()
+            if any(name_lower.endswith(ext) for ext in self.SUPPORTED_EXTENSIONS):
+                blobs.append(blob['name'])
+        return blobs
+
+    def _download_tabular_blob_bytes(self, container_name: str, blob_name: str) -> bytes:
+        """Download a blob once and reuse the raw bytes across sheet-aware operations."""
+        cache_key = (container_name, blob_name)
+        if cache_key in self._blob_data_cache:
+            return self._blob_data_cache[cache_key]
+
+        client = self._get_blob_service_client()
+        blob_client = client.get_blob_client(container=container_name, blob=blob_name)
+        stream = blob_client.download_blob()
+        data = stream.readall()
+        self._blob_data_cache[cache_key] = data
+        return data
+
+    def _get_excel_engine(self, blob_name: str) -> Optional[str]:
+        """Return the pandas Excel engine for a workbook, or None for CSV files."""
+        name_lower = blob_name.lower()
+        if name_lower.endswith('.xlsx') or name_lower.endswith('.xlsm'):
+            return 'openpyxl'
+        if name_lower.endswith('.xls'):
+            return 'xlrd'
+        return None
+
+    def _get_workbook_metadata(self, container_name: str, blob_name: str) -> dict:
+        """Return workbook metadata including available sheet names for Excel files."""
+        cache_key = (container_name, blob_name)
+        if cache_key in self._workbook_metadata_cache:
+            return copy.deepcopy(self._workbook_metadata_cache[cache_key])
+
+        engine = self._get_excel_engine(blob_name)
+        metadata = {
+            'is_workbook': bool(engine),
+            'sheet_names': [],
+            'sheet_count': 0,
+            'default_sheet': None,
+        }
+
+        if engine:
+            data = self._download_tabular_blob_bytes(container_name, blob_name)
+            excel_file = pandas.ExcelFile(io.BytesIO(data), engine=engine)
+            sheet_names = list(excel_file.sheet_names)
+            metadata.update({
+                'sheet_names': sheet_names,
+                'sheet_count': len(sheet_names),
+                'default_sheet': sheet_names[0] if sheet_names else None,
+            })
+
+        self._workbook_metadata_cache[cache_key] = copy.deepcopy(metadata)
+        return copy.deepcopy(metadata)
+
+    def _resolve_sheet_selection(
+        self,
+        container_name: str,
+        blob_name: str,
+        sheet_name: Optional[str] = None,
+        sheet_index: Optional[str] = None,
+        require_explicit_sheet: bool = False,
+    ) -> tuple:
+        """Resolve a workbook sheet selection and enforce explicit choice when required."""
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            return None, workbook_metadata
+
+        available_sheets = workbook_metadata.get('sheet_names', [])
+        if not available_sheets:
+            raise ValueError(f"Workbook '{blob_name}' does not contain any readable sheets.")
+
+        normalized_sheet_name = (sheet_name or '').strip()
+        if normalized_sheet_name:
+            for candidate in available_sheets:
+                if candidate == normalized_sheet_name:
+                    return candidate, workbook_metadata
+            for candidate in available_sheets:
+                if candidate.lower() == normalized_sheet_name.lower():
+                    return candidate, workbook_metadata
+            raise ValueError(
+                f"Sheet '{normalized_sheet_name}' was not found in workbook '{blob_name}'. "
+                f"Available sheets: {available_sheets}."
+            )
+
+        normalized_sheet_index = None if sheet_index is None else str(sheet_index).strip()
+        if normalized_sheet_index not in (None, ''):
+            try:
+                resolved_sheet_index = int(normalized_sheet_index)
+            except ValueError as exc:
+                raise ValueError(
+                    f"sheet_index must be an integer for workbook '{blob_name}'."
+                ) from exc
+
+            if resolved_sheet_index < 0 or resolved_sheet_index >= len(available_sheets):
+                raise ValueError(
+                    f"sheet_index {resolved_sheet_index} is out of range for workbook '{blob_name}'. "
+                    f"Available sheets: {available_sheets}."
+                )
+            return available_sheets[resolved_sheet_index], workbook_metadata
+
+        if len(available_sheets) == 1:
+            return available_sheets[0], workbook_metadata
+
+        # Use pre-selected default sheet if one was set by the orchestration layer
+        override_key = (container_name, blob_name)
+        if override_key in self._default_sheet_overrides:
+            override_sheet = self._default_sheet_overrides[override_key]
+            for candidate in available_sheets:
+                if candidate == override_sheet or candidate.lower() == override_sheet.lower():
+                    return candidate, workbook_metadata
+
+        if require_explicit_sheet:
+            raise ValueError(
+                f"Workbook '{blob_name}' has multiple sheets: {available_sheets}. "
+                "Specify sheet_name or sheet_index on analytical calls."
+            )
+
+        return workbook_metadata.get('default_sheet'), workbook_metadata
+
+    def _filter_rows_across_sheets(
+        self,
+        container_name: str,
+        blob_name: str,
+        filename: str,
+        column: str,
+        operator_str: str,
+        value: str,
+        max_rows: int = 100,
+    ) -> Optional[str]:
+        """Search for matching rows across all sheets that contain the requested column.
+
+        Returns a combined JSON result when matches are found on any sheet,
+        or None if the workbook is not multi-sheet (caller should fall through).
+        """
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            return None
+
+        available_sheets = workbook_metadata.get('sheet_names', [])
+        if len(available_sheets) <= 1:
+            return None
+
+        combined_results = []
+        sheets_searched = []
+        sheets_matched = []
+        total_matches = 0
+
+        for sheet in available_sheets:
+            df = self._read_tabular_blob_to_dataframe(
+                container_name,
+                blob_name,
+                sheet_name=sheet,
+            )
+            df = self._try_numeric_conversion(df)
+
+            if column not in df.columns:
+                continue
+
+            sheets_searched.append(sheet)
+            series = df[column]
+            op = operator_str.strip().lower()
+
+            numeric_value = None
+            try:
+                numeric_value = float(value)
+            except (ValueError, TypeError):
+                pass
+
+            if op in ('==', 'equals'):
+                if numeric_value is not None and pandas.api.types.is_numeric_dtype(series):
+                    mask = series == numeric_value
+                else:
+                    mask = series.astype(str).str.lower() == value.lower()
+            elif op == '!=':
+                if numeric_value is not None and pandas.api.types.is_numeric_dtype(series):
+                    mask = series != numeric_value
+                else:
+                    mask = series.astype(str).str.lower() != value.lower()
+            elif op == '>':
+                mask = series > numeric_value if numeric_value is not None else pandas.Series([False] * len(series))
+            elif op == '<':
+                mask = series < numeric_value if numeric_value is not None else pandas.Series([False] * len(series))
+            elif op == '>=':
+                mask = series >= numeric_value if numeric_value is not None else pandas.Series([False] * len(series))
+            elif op == '<=':
+                mask = series <= numeric_value if numeric_value is not None else pandas.Series([False] * len(series))
+            elif op == 'contains':
+                mask = series.astype(str).str.contains(value, case=False, na=False)
+            elif op == 'startswith':
+                mask = series.astype(str).str.lower().str.startswith(value.lower())
+            elif op == 'endswith':
+                mask = series.astype(str).str.lower().str.endswith(value.lower())
+            else:
+                continue
+
+            sheet_matches = int(mask.sum())
+            if sheet_matches == 0:
+                continue
+
+            sheets_matched.append(sheet)
+            total_matches += sheet_matches
+            remaining_capacity = max(0, max_rows - len(combined_results))
+            if remaining_capacity > 0:
+                filtered = df[mask].head(remaining_capacity)
+                for row in filtered.to_dict(orient='records'):
+                    row['_sheet'] = sheet
+                    combined_results.append(row)
+
+        if not sheets_searched:
+            return None
+
+        log_event(
+            f"[TabularProcessingPlugin] Cross-sheet filter_rows: "
+            f"searched {len(sheets_searched)} sheets, "
+            f"matched on {len(sheets_matched)} ({sheets_matched}), "
+            f"total_matches={total_matches}",
+            level=logging.INFO,
+        )
+
+        return json.dumps({
+            "filename": filename,
+            "selected_sheet": "ALL (cross-sheet search)",
+            "sheets_searched": sheets_searched,
+            "sheets_matched": sheets_matched,
+            "total_matches": total_matches,
+            "returned_rows": len(combined_results),
+            "data": combined_results,
+        }, indent=2, default=str)
+
+    def _lookup_value_across_sheets(
+        self,
+        container_name: str,
+        blob_name: str,
+        filename: str,
+        lookup_column: str,
+        lookup_value_str: str,
+        target_column: Optional[str] = None,
+        match_operator: str = "equals",
+        max_rows: int = 25,
+    ) -> Optional[str]:
+        """Look up matching rows across all sheets that contain the lookup column.
+
+        Returns a combined JSON result when matches are found,
+        or None if the workbook is not multi-sheet.
+        """
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            return None
+
+        available_sheets = workbook_metadata.get('sheet_names', [])
+        if len(available_sheets) <= 1:
+            return None
+
+        combined_results = []
+        sheets_searched = []
+        sheets_matched = []
+        total_matches = 0
+        operator = (match_operator or 'equals').strip().lower()
+        normalized_lookup_value = str(lookup_value_str)
+
+        for sheet in available_sheets:
+            df = self._read_tabular_blob_to_dataframe(
+                container_name,
+                blob_name,
+                sheet_name=sheet,
+            )
+            df = self._try_numeric_conversion(df)
+
+            if lookup_column not in df.columns:
+                continue
+
+            sheets_searched.append(sheet)
+            series = df[lookup_column]
+
+            if operator in {'equals', '=='}:
+                mask = series.astype(str).str.lower() == normalized_lookup_value.lower()
+            elif operator == 'contains':
+                mask = series.astype(str).str.contains(normalized_lookup_value, case=False, na=False)
+            elif operator == 'startswith':
+                mask = series.astype(str).str.lower().str.startswith(normalized_lookup_value.lower())
+            elif operator == 'endswith':
+                mask = series.astype(str).str.lower().str.endswith(normalized_lookup_value.lower())
+            else:
+                mask = series.astype(str).str.lower() == normalized_lookup_value.lower()
+
+            sheet_matches = int(mask.sum())
+            if sheet_matches == 0:
+                continue
+
+            sheets_matched.append(sheet)
+            total_matches += sheet_matches
+            remaining_capacity = max(0, max_rows - len(combined_results))
+            if remaining_capacity > 0:
+                matched_df = df[mask].head(remaining_capacity)
+                if target_column and target_column in df.columns:
+                    for _, row in matched_df.iterrows():
+                        combined_results.append({
+                            '_sheet': sheet,
+                            lookup_column: row[lookup_column],
+                            target_column: row[target_column],
+                            '_full_row': {str(k): v for k, v in row.to_dict().items()},
+                        })
+                else:
+                    for row in matched_df.to_dict(orient='records'):
+                        row['_sheet'] = sheet
+                        combined_results.append(row)
+
+        if not sheets_searched:
+            return None
+
+        log_event(
+            f"[TabularProcessingPlugin] Cross-sheet lookup_value: "
+            f"searched {len(sheets_searched)} sheets, "
+            f"matched on {len(sheets_matched)} ({sheets_matched}), "
+            f"total_matches={total_matches}",
+            level=logging.INFO,
+        )
+
+        return json.dumps({
+            "filename": filename,
+            "selected_sheet": "ALL (cross-sheet search)",
+            "sheets_searched": sheets_searched,
+            "sheets_matched": sheets_matched,
+            "total_matches": total_matches,
+            "returned_rows": len(combined_results),
+            "data": combined_results,
+        }, indent=2, default=str)
+
+    def _query_tabular_data_across_sheets(
+        self,
+        container_name: str,
+        blob_name: str,
+        filename: str,
+        query_expression: str,
+        max_rows: int = 100,
+    ) -> Optional[str]:
+        """Execute a pandas query expression across all sheets of a multi-sheet workbook.
+
+        Returns a combined JSON result when any sheet produces matches,
+        or None if the workbook is not multi-sheet.
+        """
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            return None
+
+        available_sheets = workbook_metadata.get('sheet_names', [])
+        if len(available_sheets) <= 1:
+            return None
+
+        combined_results = []
+        sheets_searched = []
+        sheets_matched = []
+        total_matches = 0
+
+        for sheet in available_sheets:
+            df = self._read_tabular_blob_to_dataframe(
+                container_name,
+                blob_name,
+                sheet_name=sheet,
+            )
+            df = self._try_numeric_conversion(df)
+
+            try:
+                result_df = df.query(query_expression)
+            except Exception:
+                # Query expression references columns not in this sheet — skip
+                continue
+
+            sheets_searched.append(sheet)
+            sheet_matches = len(result_df)
+            if sheet_matches == 0:
+                continue
+
+            sheets_matched.append(sheet)
+            total_matches += sheet_matches
+            remaining_capacity = max(0, max_rows - len(combined_results))
+            if remaining_capacity > 0:
+                for row in result_df.head(remaining_capacity).to_dict(orient='records'):
+                    row['_sheet'] = sheet
+                    combined_results.append(row)
+
+        if not sheets_searched:
+            return None
+
+        log_event(
+            f"[TabularProcessingPlugin] Cross-sheet query_tabular_data: "
+            f"searched {len(sheets_searched)} sheets, "
+            f"matched on {len(sheets_matched)} ({sheets_matched}), "
+            f"total_matches={total_matches}",
+            level=logging.INFO,
+        )
+
+        return json.dumps({
+            "filename": filename,
+            "selected_sheet": "ALL (cross-sheet search)",
+            "sheets_searched": sheets_searched,
+            "sheets_matched": sheets_matched,
+            "total_matches": total_matches,
+            "returned_rows": len(combined_results),
+            "data": combined_results,
+        }, indent=2, default=str)
+
+    def _format_datetime_column_label(self, value) -> str:
+        """Render date-like Excel header labels into stable analysis-friendly strings."""
+        timestamp_value = pandas.Timestamp(value)
+
+        if (
+            timestamp_value.hour == 0
+            and timestamp_value.minute == 0
+            and timestamp_value.second == 0
+            and timestamp_value.microsecond == 0
+        ):
+            if timestamp_value.day == 1:
+                return timestamp_value.strftime('%b-%y')
+            return timestamp_value.strftime('%Y-%m-%d')
+
+        return timestamp_value.strftime('%Y-%m-%d %H:%M:%S')
+
+    def _normalize_column_label(self, label, fallback_index: int) -> str:
+        """Convert arbitrary DataFrame column labels into stable string names."""
+        if label is None or (not isinstance(label, str) and pandas.isna(label)):
+            return f"Column {fallback_index}"
+
+        if isinstance(label, pandas.Timestamp):
+            return self._format_datetime_column_label(label)
+
+        if isinstance(label, datetime):
+            return self._format_datetime_column_label(label)
+
+        if isinstance(label, date):
+            return self._format_datetime_column_label(datetime.combine(label, datetime.min.time()))
+
+        normalized_label = str(label).strip()
+        return normalized_label or f"Column {fallback_index}"
+
+    def _normalize_dataframe_columns(self, df: pandas.DataFrame) -> pandas.DataFrame:
+        """Rename DataFrame columns to unique, JSON-safe string labels."""
+        normalized_df = df.copy()
+        normalized_columns = []
+        normalized_label_counts = {}
+
+        for column_index, column_label in enumerate(normalized_df.columns, start=1):
+            base_label = self._normalize_column_label(column_label, column_index)
+            occurrence_count = normalized_label_counts.get(base_label, 0) + 1
+            normalized_label_counts[base_label] = occurrence_count
+
+            if occurrence_count == 1:
+                normalized_columns.append(base_label)
+            else:
+                normalized_columns.append(f"{base_label} ({occurrence_count})")
+
+        normalized_df.columns = normalized_columns
+        return normalized_df
+
+    def _build_sheet_schema_summary(self, df: pandas.DataFrame, sheet_name: Optional[str], preview_rows: int = 3) -> dict:
+        """Build a compact schema summary for a single table or worksheet."""
+        df = self._normalize_dataframe_columns(df)
+        df_numeric = self._try_numeric_conversion(df.copy())
+        return {
+            'selected_sheet': sheet_name,
+            'row_count': len(df),
+            'column_count': len(df.columns),
+            'columns': list(df.columns),
+            'dtypes': {col: str(dtype) for col, dtype in df_numeric.dtypes.items()},
+            'preview': df.head(preview_rows).to_dict(orient='records'),
+            'null_counts': df.isnull().sum().to_dict(),
+        }
+
+    def _build_workbook_schema_summary(self, container_name: str, blob_name: str, filename: str, preview_rows: int = 3) -> dict:
+        """Build a workbook-aware schema summary for prompt preload and file description."""
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            df = self._read_tabular_blob_to_dataframe(container_name, blob_name)
+            summary = self._build_sheet_schema_summary(df, None, preview_rows=preview_rows)
+            summary.update({
+                'filename': filename,
+                'is_workbook': False,
+                'sheet_names': [],
+                'sheet_count': 0,
+            })
+            return summary
+
+        per_sheet_schemas = {}
+        for workbook_sheet_name in workbook_metadata.get('sheet_names', []):
+            df = self._read_tabular_blob_to_dataframe(
+                container_name,
+                blob_name,
+                sheet_name=workbook_sheet_name,
+            )
+            per_sheet_schemas[workbook_sheet_name] = self._build_sheet_schema_summary(
+                df,
+                workbook_sheet_name,
+                preview_rows=preview_rows,
+            )
+
+        return {
+            'filename': filename,
+            'is_workbook': True,
+            'sheet_names': workbook_metadata.get('sheet_names', []),
+            'sheet_count': workbook_metadata.get('sheet_count', 0),
+            'selected_sheet': None,
+            'per_sheet_schemas': per_sheet_schemas,
+        }
+
+    def _find_candidate_sheets_for_columns(
+        self,
+        container_name: str,
+        blob_name: str,
+        column_names: List[str],
+        exclude_sheet: Optional[str] = None,
+    ) -> List[str]:
+        """Return workbook sheets that contain one or more requested columns, ordered by best match."""
+        workbook_metadata = self._get_workbook_metadata(container_name, blob_name)
+        if not workbook_metadata.get('is_workbook'):
+            return []
+
+        normalized_targets = []
+        seen_targets = set()
+        for column_name in column_names or []:
+            normalized_column_name = str(column_name or '').strip().lower()
+            if not normalized_column_name or normalized_column_name in seen_targets:
+                continue
+            seen_targets.add(normalized_column_name)
+            normalized_targets.append(normalized_column_name)
+
+        if not normalized_targets:
+            return []
+
+        normalized_exclude_sheet = str(exclude_sheet or '').strip().lower()
+        ranked_candidates = []
+        for sheet_name in workbook_metadata.get('sheet_names', []):
+            if normalized_exclude_sheet and sheet_name.lower() == normalized_exclude_sheet:
+                continue
+
+            dataframe = self._read_tabular_blob_to_dataframe(
+                container_name,
+                blob_name,
+                sheet_name=sheet_name,
+            )
+            normalized_columns = {str(column).strip().lower() for column in dataframe.columns}
+            matched_columns = [
+                target_column for target_column in normalized_targets
+                if target_column in normalized_columns
+            ]
+            if not matched_columns:
+                continue
+
+            ranked_candidates.append((len(matched_columns), sheet_name))
+
+        ranked_candidates.sort(key=lambda item: (-item[0], item[1].lower()))
+        return [sheet_name for _, sheet_name in ranked_candidates]
+
+    def _build_missing_column_error_payload(
+        self,
+        container_name: str,
+        blob_name: str,
+        filename: str,
+        workbook_metadata: dict,
+        selected_sheet: Optional[str],
+        missing_column: str,
+        related_columns: Optional[List[str]] = None,
+        available_columns: Optional[List[str]] = None,
+    ) -> dict:
+        """Build a workbook-aware missing-column payload that points retries at better candidate sheets."""
+        available_columns = available_columns or []
+        payload = {
+            'error': f"Column '{missing_column}' not found. Available: {available_columns}",
+            'filename': filename,
+            'missing_column': missing_column,
+            'selected_sheet': selected_sheet if workbook_metadata.get('is_workbook') else None,
+        }
+
+        if workbook_metadata.get('is_workbook') and workbook_metadata.get('sheet_count', 0) > 1:
+            candidate_sheets = self._find_candidate_sheets_for_columns(
+                container_name,
+                blob_name,
+                [missing_column] + list(related_columns or []),
+                exclude_sheet=selected_sheet,
+            )
+            if candidate_sheets:
+                payload['candidate_sheets'] = candidate_sheets
+                payload['error'] = (
+                    f"Column '{missing_column}' not found on sheet '{selected_sheet}'. "
+                    f"Available: {available_columns}. Candidate sheets: {candidate_sheets}"
+                )
+
+        return payload
+
+    def _read_tabular_blob_to_dataframe(
+        self,
+        container_name: str,
+        blob_name: str,
+        sheet_name: Optional[str] = None,
+        sheet_index: Optional[str] = None,
+        require_explicit_sheet: bool = False,
+    ) -> pandas.DataFrame:
+        """Download a blob and read it into a pandas DataFrame. Uses per-instance cache."""
+        resolved_sheet_name, workbook_metadata = self._resolve_sheet_selection(
+            container_name,
+            blob_name,
+            sheet_name=sheet_name,
+            sheet_index=sheet_index,
+            require_explicit_sheet=require_explicit_sheet,
+        )
+        sheet_cache_key = resolved_sheet_name or '__default__'
+        cache_key = (container_name, blob_name, sheet_cache_key)
+        if cache_key in self._df_cache:
+            log_event(
+                f"[TabularProcessingPlugin] Cache hit for {blob_name}"
+                + (f" [{resolved_sheet_name}]" if resolved_sheet_name else ''),
+                level=logging.DEBUG,
+            )
+            return self._df_cache[cache_key].copy()
+
+        data = self._download_tabular_blob_bytes(container_name, blob_name)
+
+        name_lower = blob_name.lower()
+        if name_lower.endswith('.csv'):
+            df = pandas.read_csv(io.BytesIO(data), keep_default_na=False, dtype=str)
+        elif name_lower.endswith('.xlsx') or name_lower.endswith('.xlsm'):
+            df = pandas.read_excel(
+                io.BytesIO(data),
+                engine='openpyxl',
+                keep_default_na=False,
+                dtype=str,
+                sheet_name=resolved_sheet_name,
+            )
+        elif name_lower.endswith('.xls'):
+            df = pandas.read_excel(
+                io.BytesIO(data),
+                engine='xlrd',
+                keep_default_na=False,
+                dtype=str,
+                sheet_name=resolved_sheet_name,
+            )
+        else:
+            raise ValueError(f"Unsupported tabular file type: {blob_name}")
+
+        df = self._normalize_dataframe_columns(df)
+        self._df_cache[cache_key] = df
+        log_event(
+            f"[TabularProcessingPlugin] Cached DataFrame for {blob_name}"
+            + (f" [{resolved_sheet_name}]" if resolved_sheet_name else '')
+            + f" ({len(df)} rows)",
+            level=logging.DEBUG,
+        )
+        return df.copy()
+
+    def _try_numeric_conversion(self, df: pandas.DataFrame) -> pandas.DataFrame:
+        """Attempt to convert string columns to numeric where possible."""
+        for col in df.columns:
+            if pandas.api.types.is_datetime64_any_dtype(df[col]) or pandas.api.types.is_timedelta64_dtype(df[col]):
+                continue
+            try:
+                df[col] = pandas.to_numeric(df[col])
+            except (ValueError, TypeError):
+                pass
+        return df
+
+    def _parse_datetime_like_series(self, series: pandas.Series) -> pandas.Series:
+        """Best-effort parsing for datetime and time-like values."""
+        if pandas.api.types.is_datetime64_any_dtype(series):
+            return pandas.to_datetime(series, errors='coerce')
+
+        cleaned_series = series.astype(str).str.strip()
+        cleaned_series = cleaned_series.replace({
+            '': None,
+            'nan': None,
+            'NaN': None,
+            'nat': None,
+            'NaT': None,
+            'none': None,
+            'None': None,
+        })
+
+        parsed = pandas.Series(pandas.NaT, index=series.index, dtype='datetime64[ns]')
+
+        common_formats = [
+            '%m/%d/%Y %I:%M:%S %p',
+            '%m/%d/%Y %I:%M %p',
+            '%m/%d/%Y %H:%M:%S',
+            '%m/%d/%Y %H:%M',
+            '%Y-%m-%d %H:%M:%S',
+            '%Y-%m-%d %H:%M',
+            '%Y-%m-%dT%H:%M:%S',
+            '%Y-%m-%dT%H:%M:%S.%f',
+            '%Y-%m-%d',
+            '%m/%d/%Y',
+        ]
+
+        for datetime_format in common_formats:
+            remaining_mask = parsed.isna() & cleaned_series.notna()
+            if not remaining_mask.any():
+                break
+
+            parsed.loc[remaining_mask] = pandas.to_datetime(
+                cleaned_series[remaining_mask],
+                format=datetime_format,
+                errors='coerce'
+            )
+
+        remaining_mask = parsed.isna() & cleaned_series.notna()
+        if remaining_mask.any():
+            digits = cleaned_series[remaining_mask].str.replace(r'[^0-9]', '', regex=True)
+
+            hhmm_mask = digits.str.match(r'^\d{3,4}$', na=False)
+            if hhmm_mask.any():
+                hhmm_values = digits[hhmm_mask].str.zfill(4)
+                parsed.loc[hhmm_values.index] = pandas.to_datetime(
+                    hhmm_values,
+                    format='%H%M',
+                    errors='coerce'
+                )
+
+            remaining_mask = parsed.isna() & cleaned_series.notna()
+            if remaining_mask.any():
+                digits = cleaned_series[remaining_mask].str.replace(r'[^0-9]', '', regex=True)
+                hhmmss_mask = digits.str.match(r'^\d{5,6}$', na=False)
+                if hhmmss_mask.any():
+                    hhmmss_values = digits[hhmmss_mask].str.zfill(6)
+                    parsed.loc[hhmmss_values.index] = pandas.to_datetime(
+                        hhmmss_values,
+                        format='%H%M%S',
+                        errors='coerce'
+                    )
+
+            remaining_mask = parsed.isna() & cleaned_series.notna()
+            if remaining_mask.any():
+                with warnings.catch_warnings():
+                    warnings.simplefilter('ignore', UserWarning)
+                    parsed.loc[remaining_mask] = pandas.to_datetime(
+                        cleaned_series[remaining_mask],
+                        errors='coerce'
+                    )
+
+        return parsed
+
+    def _normalize_datetime_component(self, component: str) -> str:
+        """Normalize datetime component aliases to a canonical value."""
+        normalized = (component or '').strip().lower()
+        aliases = {
+            'years': 'year',
+            'months': 'month',
+            'monthname': 'month_name',
+            'month_name': 'month_name',
+            'days': 'day',
+            'dayofmonth': 'day',
+            'dates': 'date',
+            'hours': 'hour',
+            'hour_of_day': 'hour',
+            'timeofday': 'hour',
+            'time_of_day': 'hour',
+            'minutes': 'minute',
+            'dayofweek': 'day_name',
+            'day_of_week': 'day_name',
+            'weekday': 'day_name',
+            'weekday_name': 'day_name',
+            'day_name': 'day_name',
+            'weekdaynumber': 'weekday_number',
+            'weekday_number': 'weekday_number',
+            'quarters': 'quarter',
+        }
+        return aliases.get(normalized, normalized)
+
+    def _extract_datetime_component(self, parsed_series: pandas.Series, component: str) -> pandas.Series:
+        """Extract a supported datetime component from a parsed datetime series."""
+        normalized = self._normalize_datetime_component(component)
+
+        if normalized == 'year':
+            return parsed_series.dt.year
+        if normalized == 'month':
+            return parsed_series.dt.month
+        if normalized == 'month_name':
+            month_names = parsed_series.dt.month_name()
+            ordered_months = pandas.Categorical(
+                month_names,
+                categories=self.MONTH_NAME_ORDER,
+                ordered=True
+            )
+            return pandas.Series(ordered_months, index=parsed_series.index)
+        if normalized == 'day':
+            return parsed_series.dt.day
+        if normalized == 'date':
+            return parsed_series.dt.strftime('%Y-%m-%d')
+        if normalized == 'hour':
+            return parsed_series.dt.hour
+        if normalized == 'minute':
+            return parsed_series.dt.minute
+        if normalized == 'day_name':
+            day_names = parsed_series.dt.day_name()
+            ordered_days = pandas.Categorical(
+                day_names,
+                categories=self.DAY_NAME_ORDER,
+                ordered=True
+            )
+            return pandas.Series(ordered_days, index=parsed_series.index)
+        if normalized == 'weekday_number':
+            return parsed_series.dt.dayofweek
+        if normalized == 'quarter':
+            return parsed_series.dt.quarter
+        if normalized == 'week':
+            return parsed_series.dt.isocalendar().week.astype(int)
+
+        raise ValueError(
+            f"Unsupported datetime component '{component}'. "
+            "Use year, month, month_name, day, date, hour, minute, day_name, weekday_number, quarter, or week."
+        )
+
+    def _parse_boolean_argument(self, value, default=True) -> bool:
+        """Parse common string boolean values for plugin inputs."""
+        if isinstance(value, bool):
+            return value
+        if value is None:
+            return default
+
+        normalized = str(value).strip().lower()
+        if normalized in {'true', '1', 'yes', 'y', 'on'}:
+            return True
+        if normalized in {'false', '0', 'no', 'n', 'off'}:
+            return False
+        return default
+
+    def _ordered_grouped_results(self, grouped: pandas.Series, component: str) -> pandas.Series:
+        """Return grouped results in a natural chronological order where possible."""
+        normalized = self._normalize_datetime_component(component)
+        if normalized == 'day_name':
+            return grouped.reindex([day for day in self.DAY_NAME_ORDER if day in grouped.index])
+        if normalized == 'month_name':
+            return grouped.reindex([month for month in self.MONTH_NAME_ORDER if month in grouped.index])
+        return grouped.sort_index()
+
+    def _series_to_json_dict(self, series: pandas.Series) -> dict:
+        """Convert a pandas Series into a JSON-safe dictionary."""
+        safe_dict = {}
+        for index, value in series.items():
+            safe_dict[str(index)] = value.item() if hasattr(value, 'item') else value
+        return safe_dict
+
+    def _scalar_to_json_value(self, value):
+        """Convert a scalar value to a JSON-safe representation."""
+        if pandas.isna(value):
+            return None
+        return value.item() if hasattr(value, 'item') else value
+
+    def _build_grouped_summary(self, grouped: pandas.Series) -> dict:
+        """Build generic summary fields for grouped metric outputs."""
+        if grouped.empty:
+            return {}
+
+        descending_values = grouped.sort_values(ascending=False)
+        ascending_values = grouped.sort_values(ascending=True)
+        summary = {
+            'highest_group': str(descending_values.index[0]),
+            'highest_value': self._scalar_to_json_value(descending_values.iloc[0]),
+            'lowest_group': str(ascending_values.index[0]),
+            'lowest_value': self._scalar_to_json_value(ascending_values.iloc[0]),
+            'average_group_value': self._scalar_to_json_value(grouped.mean()),
+            'median_group_value': self._scalar_to_json_value(grouped.median()),
+        }
+
+        if len(descending_values) > 1:
+            summary['second_highest_group'] = str(descending_values.index[1])
+            summary['second_highest_value'] = self._scalar_to_json_value(descending_values.iloc[1])
+
+        return summary
+
+    def _resolve_blob_location(self, user_id: str, conversation_id: str, filename: str, source: str,
+                               group_id: str = None, public_workspace_id: str = None) -> tuple:
+        """Resolve container name and blob path from source type."""
+        source = source.lower().strip()
+        if source == 'chat':
+            container = storage_account_personal_chat_container_name
+            blob_path = f"{user_id}/{conversation_id}/{filename}"
+        elif source == 'workspace':
+            container = storage_account_user_documents_container_name
+            blob_path = f"{user_id}/{filename}"
+        elif source == 'group':
+            if not group_id:
+                raise ValueError("group_id is required for source='group'")
+            container = storage_account_group_documents_container_name
+            blob_path = f"{group_id}/{filename}"
+        elif source == 'public':
+            if not public_workspace_id:
+                raise ValueError("public_workspace_id is required for source='public'")
+            container = storage_account_public_documents_container_name
+            blob_path = f"{public_workspace_id}/{filename}"
+        else:
+            raise ValueError(f"Unknown source '{source}'. Use 'workspace', 'chat', 'group', or 'public'.")
+        return container, blob_path
+
+    def _resolve_blob_location_with_fallback(self, user_id: str, conversation_id: str, filename: str, source: str,
+                                              group_id: str = None, public_workspace_id: str = None) -> tuple:
+        """Try primary source first, then fall back to other containers if blob not found."""
+        source = source.lower().strip()
+        attempts = []
+
+        # Primary attempt based on specified source
+        try:
+            primary = self._resolve_blob_location(user_id, conversation_id, filename, source, group_id, public_workspace_id)
+            attempts.append(primary)
+        except ValueError:
+            pass
+
+        # Fallback attempts in priority order (skip the primary source)
+        if source != 'workspace':
+            attempts.append((storage_account_user_documents_container_name, f"{user_id}/{filename}"))
+        if source != 'group' and group_id:
+            attempts.append((storage_account_group_documents_container_name, f"{group_id}/{filename}"))
+        if source != 'public' and public_workspace_id:
+            attempts.append((storage_account_public_documents_container_name, f"{public_workspace_id}/{filename}"))
+        if source != 'chat':
+            attempts.append((storage_account_personal_chat_container_name, f"{user_id}/{conversation_id}/{filename}"))
+
+        client = self._get_blob_service_client()
+        for container, blob_path in attempts:
+            try:
+                blob_client = client.get_blob_client(container=container, blob=blob_path)
+                if blob_client.exists():
+                    log_event(f"[TabularProcessingPlugin] Found blob at {container}/{blob_path}", level=logging.DEBUG)
+                    return container, blob_path
+            except Exception:
+                continue
+
+        # If nothing found, return primary for the original error message
+        if attempts:
+            return attempts[0]
+        raise ValueError(f"Could not resolve blob location for {filename}")
+
+    @kernel_function(
+        description=(
+            "List all tabular data files available for a user. Checks workspace documents "
+            "(user-documents container), chat-uploaded documents (personal-chat container), "
+            "and optionally group or public workspace documents. "
+            "Returns a JSON list of available files with their source."
+        ),
+        name="list_tabular_files"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def list_tabular_files(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON list of available tabular files"]:
+        """List all tabular files available for the user across all accessible containers."""
+        def _sync_work():
+            results = []
+            try:
+                workspace_prefix = f"{user_id}/"
+                workspace_blobs = self._list_tabular_blobs(
+                    storage_account_user_documents_container_name, workspace_prefix
+                )
+                for blob in workspace_blobs:
+                    filename = blob.split('/')[-1]
+                    workbook_metadata = self._get_workbook_metadata(
+                        storage_account_user_documents_container_name,
+                        blob,
+                    )
+                    results.append({
+                        "filename": filename,
+                        "blob_path": blob,
+                        "source": "workspace",
+                        "container": storage_account_user_documents_container_name,
+                        "sheet_names": workbook_metadata.get('sheet_names', []),
+                        "sheet_count": workbook_metadata.get('sheet_count', 0),
+                    })
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error listing workspace blobs: {e}", level=logging.WARNING)
+
+            try:
+                chat_prefix = f"{user_id}/{conversation_id}/"
+                chat_blobs = self._list_tabular_blobs(
+                    storage_account_personal_chat_container_name, chat_prefix
+                )
+                for blob in chat_blobs:
+                    filename = blob.split('/')[-1]
+                    workbook_metadata = self._get_workbook_metadata(
+                        storage_account_personal_chat_container_name,
+                        blob,
+                    )
+                    results.append({
+                        "filename": filename,
+                        "blob_path": blob,
+                        "source": "chat",
+                        "container": storage_account_personal_chat_container_name,
+                        "sheet_names": workbook_metadata.get('sheet_names', []),
+                        "sheet_count": workbook_metadata.get('sheet_count', 0),
+                    })
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error listing chat blobs: {e}", level=logging.WARNING)
+
+            if group_id:
+                try:
+                    group_prefix = f"{group_id}/"
+                    group_blobs = self._list_tabular_blobs(
+                        storage_account_group_documents_container_name, group_prefix
+                    )
+                    for blob in group_blobs:
+                        filename = blob.split('/')[-1]
+                        workbook_metadata = self._get_workbook_metadata(
+                            storage_account_group_documents_container_name,
+                            blob,
+                        )
+                        results.append({
+                            "filename": filename,
+                            "blob_path": blob,
+                            "source": "group",
+                            "container": storage_account_group_documents_container_name,
+                            "sheet_names": workbook_metadata.get('sheet_names', []),
+                            "sheet_count": workbook_metadata.get('sheet_count', 0),
+                        })
+                except Exception as e:
+                    log_event(f"[TabularProcessingPlugin] Error listing group blobs: {e}", level=logging.WARNING)
+
+            if public_workspace_id:
+                try:
+                    public_prefix = f"{public_workspace_id}/"
+                    public_blobs = self._list_tabular_blobs(
+                        storage_account_public_documents_container_name, public_prefix
+                    )
+                    for blob in public_blobs:
+                        filename = blob.split('/')[-1]
+                        workbook_metadata = self._get_workbook_metadata(
+                            storage_account_public_documents_container_name,
+                            blob,
+                        )
+                        results.append({
+                            "filename": filename,
+                            "blob_path": blob,
+                            "source": "public",
+                            "container": storage_account_public_documents_container_name,
+                            "sheet_names": workbook_metadata.get('sheet_names', []),
+                            "sheet_count": workbook_metadata.get('sheet_count', 0),
+                        })
+                except Exception as e:
+                    log_event(f"[TabularProcessingPlugin] Error listing public blobs: {e}", level=logging.WARNING)
+
+            return json.dumps(results, indent=2)
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Get a summary of a tabular file including column names, row count, data types, "
+            "and a preview of the first few rows."
+        ),
+        name="describe_tabular_file"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def describe_tabular_file(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. When omitted on multi-sheet workbooks, the response returns workbook-level sheet schemas."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON summary of the tabular file"]:
+        """Get schema and preview of a tabular file."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                workbook_metadata = self._get_workbook_metadata(container, blob_path)
+
+                if workbook_metadata.get('is_workbook') and workbook_metadata.get('sheet_count', 0) > 1 and not (sheet_name or sheet_index):
+                    summary = self._build_workbook_schema_summary(
+                        container,
+                        blob_path,
+                        filename,
+                        preview_rows=3,
+                    )
+                else:
+                    selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                        container,
+                        blob_path,
+                        sheet_name=sheet_name,
+                        sheet_index=sheet_index,
+                        require_explicit_sheet=False,
+                    )
+                    df = self._read_tabular_blob_to_dataframe(
+                        container,
+                        blob_path,
+                        sheet_name=selected_sheet,
+                        require_explicit_sheet=False,
+                    )
+                    summary = self._build_sheet_schema_summary(df, selected_sheet, preview_rows=5)
+                    summary.update({
+                        "filename": filename,
+                        "is_workbook": workbook_metadata.get('is_workbook', False),
+                        "sheet_names": workbook_metadata.get('sheet_names', []),
+                        "sheet_count": workbook_metadata.get('sheet_count', 0),
+                    })
+
+                return json.dumps(summary, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error describing file: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Look up one or more rows by label/category in a tabular file and return the value from a target column. "
+            "Best for questions like 'What was Total Assets in Nov-25?' or 'What was Net Worth in Dec-25?'."
+        ),
+        name="lookup_value"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def lookup_value(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        lookup_column: Annotated[str, "The label/category column to search, such as Accounts or Category"],
+        lookup_value: Annotated[str, "The row label/category value to search for, such as Total Assets"],
+        target_column: Annotated[str, "The target column containing the desired value, such as Nov-25"],
+        match_operator: Annotated[str, "Match operator: equals, contains, startswith, endswith"] = "equals",
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        max_rows: Annotated[str, "Maximum matching rows to return"] = "25",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON result containing matching rows and target-column values"]:
+        """Look up values from a target column for matching rows."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                # When no explicit sheet_name is given, try cross-sheet search first
+                normalized_sheet = (sheet_name or '').strip()
+                normalized_sheet_idx = None if sheet_index is None else str(sheet_index).strip()
+                if not normalized_sheet and normalized_sheet_idx in (None, ''):
+                    cross_sheet_result = self._lookup_value_across_sheets(
+                        container, blob_path, filename,
+                        lookup_column, lookup_value, target_column,
+                        match_operator=match_operator,
+                        max_rows=int(max_rows),
+                    )
+                    if cross_sheet_result is not None:
+                        return cross_sheet_result
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                if lookup_column not in df.columns:
+                    return json.dumps(
+                        self._build_missing_column_error_payload(
+                            container,
+                            blob_path,
+                            filename,
+                            workbook_metadata,
+                            selected_sheet,
+                            lookup_column,
+                            related_columns=[target_column],
+                            available_columns=list(df.columns),
+                        )
+                    )
+                if target_column not in df.columns:
+                    return json.dumps(
+                        self._build_missing_column_error_payload(
+                            container,
+                            blob_path,
+                            filename,
+                            workbook_metadata,
+                            selected_sheet,
+                            target_column,
+                            related_columns=[lookup_column],
+                            available_columns=list(df.columns),
+                        )
+                    )
+
+                series = df[lookup_column]
+                operator = (match_operator or 'equals').strip().lower()
+                normalized_lookup_value = str(lookup_value)
+
+                if operator in {'equals', '=='}:
+                    mask = series.astype(str).str.lower() == normalized_lookup_value.lower()
+                elif operator == 'contains':
+                    mask = series.astype(str).str.contains(normalized_lookup_value, case=False, na=False)
+                elif operator == 'startswith':
+                    mask = series.astype(str).str.lower().str.startswith(normalized_lookup_value.lower())
+                elif operator == 'endswith':
+                    mask = series.astype(str).str.lower().str.endswith(normalized_lookup_value.lower())
+                else:
+                    return json.dumps({"error": f"Unsupported match_operator: {match_operator}"})
+
+                limit = int(max_rows)
+                matches = df[mask].head(limit)
+                response = {
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "lookup_column": lookup_column,
+                    "lookup_value": lookup_value,
+                    "target_column": target_column,
+                    "match_operator": operator,
+                    "total_matches": int(mask.sum()),
+                    "returned_rows": len(matches),
+                    "data": matches.to_dict(orient='records'),
+                }
+
+                if len(matches) == 1:
+                    response["value"] = matches.iloc[0][target_column]
+
+                return json.dumps(response, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error looking up value: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Execute an aggregation operation on a column of a tabular file. "
+            "Supported operations: sum, mean, count, min, max, median, std, nunique, value_counts."
+        ),
+        name="aggregate_column"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def aggregate_column(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        column: Annotated[str, "The column name to aggregate"],
+        operation: Annotated[str, "Aggregation: sum, mean, count, min, max, median, std, nunique, value_counts"],
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON result of the aggregation"]:
+        """Execute an aggregation operation on a column."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                if column not in df.columns:
+                    return json.dumps(
+                        self._build_missing_column_error_payload(
+                            container,
+                            blob_path,
+                            filename,
+                            workbook_metadata,
+                            selected_sheet,
+                            column,
+                            available_columns=list(df.columns),
+                        )
+                    )
+
+                series = df[column]
+                op = operation.lower().strip()
+
+                if op == 'sum':
+                    result = series.sum()
+                elif op == 'mean':
+                    result = series.mean()
+                elif op == 'count':
+                    result = series.count()
+                elif op == 'min':
+                    result = series.min()
+                elif op == 'max':
+                    result = series.max()
+                elif op == 'median':
+                    result = series.median()
+                elif op == 'std':
+                    result = series.std()
+                elif op == 'nunique':
+                    result = series.nunique()
+                elif op == 'value_counts':
+                    result = self._series_to_json_dict(series.value_counts())
+                else:
+                    return json.dumps({"error": f"Unsupported operation: {operation}. Use sum, mean, count, min, max, median, std, nunique, value_counts."})
+
+                return json.dumps({
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "column": column,
+                    "operation": op,
+                    "result": result,
+                }, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error aggregating column: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Filter rows in a tabular file based on conditions and return matching rows. "
+            "Supports operators: ==, !=, >, <, >=, <=, contains, startswith, endswith."
+        ),
+        name="filter_rows"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def filter_rows(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        column: Annotated[str, "The column to filter on"],
+        operator: Annotated[str, "Operator: ==, !=, >, <, >=, <=, contains, startswith, endswith"],
+        value: Annotated[str, "The value to compare against"],
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        max_rows: Annotated[str, "Maximum rows to return"] = "100",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON list of matching rows"]:
+        """Filter rows based on a condition."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                # When no explicit sheet_name is given, try cross-sheet search first
+                normalized_sheet = (sheet_name or '').strip()
+                normalized_sheet_idx = None if sheet_index is None else str(sheet_index).strip()
+                if not normalized_sheet and normalized_sheet_idx in (None, ''):
+                    cross_sheet_result = self._filter_rows_across_sheets(
+                        container, blob_path, filename, column, operator, value,
+                        max_rows=int(max_rows),
+                    )
+                    if cross_sheet_result is not None:
+                        return cross_sheet_result
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                if column not in df.columns:
+                    return json.dumps({"error": f"Column '{column}' not found. Available: {list(df.columns)}"})
+
+                series = df[column]
+                op = operator.strip().lower()
+
+                numeric_value = None
+                try:
+                    numeric_value = float(value)
+                except (ValueError, TypeError):
+                    pass
+
+                if op == '==' or op == 'equals':
+                    if numeric_value is not None and pandas.api.types.is_numeric_dtype(series):
+                        mask = series == numeric_value
+                    else:
+                        mask = series.astype(str).str.lower() == value.lower()
+                elif op == '!=':
+                    if numeric_value is not None and pandas.api.types.is_numeric_dtype(series):
+                        mask = series != numeric_value
+                    else:
+                        mask = series.astype(str).str.lower() != value.lower()
+                elif op == '>':
+                    mask = series > numeric_value
+                elif op == '<':
+                    mask = series < numeric_value
+                elif op == '>=':
+                    mask = series >= numeric_value
+                elif op == '<=':
+                    mask = series <= numeric_value
+                elif op == 'contains':
+                    mask = series.astype(str).str.contains(value, case=False, na=False)
+                elif op == 'startswith':
+                    mask = series.astype(str).str.lower().str.startswith(value.lower())
+                elif op == 'endswith':
+                    mask = series.astype(str).str.lower().str.endswith(value.lower())
+                else:
+                    return json.dumps({"error": f"Unsupported operator: {operator}"})
+
+                limit = int(max_rows)
+                filtered = df[mask].head(limit)
+                return json.dumps({
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "total_matches": int(mask.sum()),
+                    "returned_rows": len(filtered),
+                    "data": filtered.to_dict(orient='records')
+                }, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error filtering rows: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Execute a pandas query expression against a tabular file for advanced analysis. "
+            "The query string uses pandas DataFrame.query() syntax. "
+            "Examples: 'Age > 30 and State == \"CA\"', 'Price < 100'"
+        ),
+        name="query_tabular_data"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def query_tabular_data(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        query_expression: Annotated[str, "Pandas query expression (e.g. 'Age > 30 and State == \"CA\"')"],
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        max_rows: Annotated[str, "Maximum rows to return"] = "100",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON result of the query"]:
+        """Execute a pandas query expression against a tabular file."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                # When no explicit sheet_name is given, try cross-sheet query first
+                normalized_sheet = (sheet_name or '').strip()
+                normalized_sheet_idx = None if sheet_index is None else str(sheet_index).strip()
+                if not normalized_sheet and normalized_sheet_idx in (None, ''):
+                    cross_sheet_result = self._query_tabular_data_across_sheets(
+                        container, blob_path, filename, query_expression,
+                        max_rows=int(max_rows),
+                    )
+                    if cross_sheet_result is not None:
+                        return cross_sheet_result
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                result_df = df.query(query_expression)
+                limit = int(max_rows)
+                return json.dumps({
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "total_matches": len(result_df),
+                    "returned_rows": min(len(result_df), limit),
+                    "data": result_df.head(limit).to_dict(orient='records')
+                }, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error querying data: {e}", level=logging.WARNING)
+                return json.dumps({"error": f"Query error: {str(e)}. Ensure column names and values are correct."})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Perform a group-by aggregation on a tabular file. "
+            "Groups data by one column and aggregates another column. "
+            "Supported operations: sum, mean, count, min, max, median, std. "
+            "Returns top grouped results plus highest and lowest group summary fields."
+        ),
+        name="group_by_aggregate"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def group_by_aggregate(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        group_by_column: Annotated[str, "The column to group by"],
+        aggregate_column: Annotated[str, "The column to aggregate"],
+        operation: Annotated[str, "Aggregation operation: sum, mean, count, min, max, median, std"],
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        top_n: Annotated[str, "How many top groups to return in descending or ascending order"] = "10",
+        sort_descending: Annotated[str, "Whether top_results should be sorted descending (true/false)"] = "true",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON result of the group-by aggregation"]:
+        """Group by one column and aggregate another."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id, conversation_id, filename, source,
+                    group_id=group_id, public_workspace_id=public_workspace_id
+                )
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                for col in [group_by_column, aggregate_column]:
+                    if col not in df.columns:
+                        related_columns = [group_by_column, aggregate_column]
+                        related_columns = [column_name for column_name in related_columns if column_name != col]
+                        return json.dumps(
+                            self._build_missing_column_error_payload(
+                                container,
+                                blob_path,
+                                filename,
+                                workbook_metadata,
+                                selected_sheet,
+                                col,
+                                related_columns=related_columns,
+                                available_columns=list(df.columns),
+                            )
+                        )
+
+                op = operation.lower().strip()
+                if op not in {'count', 'sum', 'mean', 'min', 'max', 'median', 'std'}:
+                    return json.dumps({
+                        "error": "Unsupported operation. Use count, sum, mean, min, max, median, or std."
+                    })
+
+                grouped = df.groupby(group_by_column)[aggregate_column].agg(op)
+                grouped = grouped.dropna()
+                if grouped.empty:
+                    return json.dumps({"error": "No grouped results were produced."})
+
+                top_limit = max(1, int(top_n))
+                descending = self._parse_boolean_argument(sort_descending, default=True)
+                top_results = grouped.sort_values(ascending=not descending).head(top_limit)
+                ordered_results = grouped.sort_index()
+                summary = self._build_grouped_summary(grouped)
+
+                return json.dumps({
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "group_by": group_by_column,
+                    "aggregate_column": aggregate_column,
+                    "operation": op,
+                    "groups": len(grouped),
+                    "top_results": self._series_to_json_dict(top_results),
+                    "result": self._series_to_json_dict(ordered_results),
+                    **summary,
+                }, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error in group-by: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
+
+    @kernel_function(
+        description=(
+            "Group a tabular file by a component extracted from a datetime-like column and aggregate a metric. "
+            "Use this for time-based questions such as peak hours, busiest weekdays, or monthly trends. "
+            "Supported datetime components: year, month, month_name, day, date, hour, minute, day_name, "
+            "weekday_number, quarter, week. Supported operations: count, sum, mean, min, max, median, std. "
+            "An optional pandas query filter can be applied before grouping. Returns top grouped results plus highest and lowest summary fields."
+        ),
+        name="group_by_datetime_component"
+    )
+    @plugin_function_logger("TabularProcessingPlugin")
+    async def group_by_datetime_component(
+        self,
+        user_id: Annotated[str, "The user ID (from Scope ID in Conversation Metadata)"],
+        conversation_id: Annotated[str, "The conversation ID (from Conversation Metadata)"],
+        filename: Annotated[str, "The filename of the tabular file"],
+        datetime_column: Annotated[str, "The datetime-like column to extract a component from"],
+        datetime_component: Annotated[str, "Component: year, month, month_name, day, date, hour, minute, day_name, weekday_number, quarter, or week"],
+        aggregate_column: Annotated[Optional[str], "The numeric column to aggregate. Leave empty and use operation='count' to count rows."] = "",
+        operation: Annotated[str, "Aggregation operation: count, sum, mean, min, max, median, std"] = "count",
+        sheet_name: Annotated[Optional[str], "Optional worksheet name for Excel files. Required for analytical calls on multi-sheet workbooks unless sheet_index is provided."] = None,
+        sheet_index: Annotated[Optional[str], "Optional zero-based worksheet index for Excel files. Ignored when sheet_name is provided."] = None,
+        source: Annotated[str, "Source: 'workspace', 'chat', 'group', or 'public'"] = "chat",
+        filter_expression: Annotated[Optional[str], "Optional pandas query filter applied before grouping"] = "",
+        top_n: Annotated[str, "How many top groups to return in descending order"] = "10",
+        sort_descending: Annotated[str, "Whether top_results should be sorted descending (true/false)"] = "true",
+        group_id: Annotated[Optional[str], "Group ID (for group workspace documents)"] = None,
+        public_workspace_id: Annotated[Optional[str], "Public workspace ID (for public workspace documents)"] = None,
+    ) -> Annotated[str, "JSON result of the datetime component grouping analysis"]:
+        """Group data by a datetime component and aggregate a metric."""
+        def _sync_work():
+            try:
+                container, blob_path = self._resolve_blob_location_with_fallback(
+                    user_id,
+                    conversation_id,
+                    filename,
+                    source,
+                    group_id=group_id,
+                    public_workspace_id=public_workspace_id
+                )
+                selected_sheet, workbook_metadata = self._resolve_sheet_selection(
+                    container,
+                    blob_path,
+                    sheet_name=sheet_name,
+                    sheet_index=sheet_index,
+                    require_explicit_sheet=True,
+                )
+                df = self._read_tabular_blob_to_dataframe(
+                    container,
+                    blob_path,
+                    sheet_name=selected_sheet,
+                    require_explicit_sheet=True,
+                )
+                df = self._try_numeric_conversion(df)
+
+                if filter_expression:
+                    try:
+                        df = df.query(filter_expression)
+                    except Exception as query_error:
+                        return json.dumps({
+                            "error": f"Filter query error: {query_error}. Ensure column names and values are correct."
+                        })
+
+                if datetime_column not in df.columns:
+                    related_columns = [aggregate_column] if aggregate_column else []
+                    return json.dumps(
+                        self._build_missing_column_error_payload(
+                            container,
+                            blob_path,
+                            filename,
+                            workbook_metadata,
+                            selected_sheet,
+                            datetime_column,
+                            related_columns=related_columns,
+                            available_columns=list(df.columns),
+                        )
+                    )
+
+                parsed_datetime = self._parse_datetime_like_series(df[datetime_column])
+                valid_mask = parsed_datetime.notna()
+                if not valid_mask.any():
+                    return json.dumps({
+                        "error": (
+                            f"Could not parse any datetime values from column '{datetime_column}'. "
+                            "Try a different datetime column or inspect the file schema preview."
+                        )
+                    })
+
+                filtered_df = df.loc[valid_mask].copy()
+                parsed_datetime = parsed_datetime.loc[valid_mask]
+                component_values = self._extract_datetime_component(parsed_datetime, datetime_component)
+
+                component_column_name = f"__datetime_component_{self._normalize_datetime_component(datetime_component)}"
+                filtered_df[component_column_name] = component_values
+
+                op = (operation or 'count').strip().lower()
+                if op not in {'count', 'sum', 'mean', 'min', 'max', 'median', 'std'}:
+                    return json.dumps({
+                        "error": "Unsupported operation. Use count, sum, mean, min, max, median, or std."
+                    })
+
+                aggregate_column_name = (aggregate_column or '').strip()
+                if op == 'count' and not aggregate_column_name:
+                    grouped = filtered_df.groupby(component_column_name).size()
+                else:
+                    if not aggregate_column_name:
+                        return json.dumps({
+                            "error": "aggregate_column is required unless operation='count'."
+                        })
+                    if aggregate_column_name not in filtered_df.columns:
+                        return json.dumps(
+                            self._build_missing_column_error_payload(
+                                container,
+                                blob_path,
+                                filename,
+                                workbook_metadata,
+                                selected_sheet,
+                                aggregate_column_name,
+                                related_columns=[datetime_column],
+                                available_columns=list(filtered_df.columns),
+                            )
+                        )
+                    grouped = filtered_df.groupby(component_column_name)[aggregate_column_name].agg(op)
+
+                grouped = grouped.dropna()
+                if grouped.empty:
+                    return json.dumps({
+                        "error": "No grouped results were produced after filtering and datetime parsing."
+                    })
+
+                top_limit = max(1, int(top_n))
+                descending = self._parse_boolean_argument(sort_descending, default=True)
+                top_results = grouped.sort_values(ascending=not descending).head(top_limit)
+                ordered_results = self._ordered_grouped_results(grouped, datetime_component)
+                summary = self._build_grouped_summary(grouped)
+
+                return json.dumps({
+                    "filename": filename,
+                    "selected_sheet": selected_sheet if workbook_metadata.get('is_workbook') else None,
+                    "datetime_column": datetime_column,
+                    "datetime_component": self._normalize_datetime_component(datetime_component),
+                    "aggregate_column": aggregate_column_name or None,
+                    "operation": op,
+                    "filter_expression": filter_expression or None,
+                    "parsed_rows": int(valid_mask.sum()),
+                    "dropped_rows": int((~valid_mask).sum()),
+                    "groups": int(len(grouped)),
+                    "top_results": self._series_to_json_dict(top_results),
+                    "result": self._series_to_json_dict(ordered_results),
+                    **summary,
+                }, indent=2, default=str)
+            except Exception as e:
+                log_event(f"[TabularProcessingPlugin] Error in datetime component grouping: {e}", level=logging.WARNING)
+                return json.dumps({"error": str(e)})
+        return await asyncio.to_thread(_sync_work)
diff --git a/application/single_app/simplechat_scheduler.py b/application/single_app/simplechat_scheduler.py
new file mode 100644
index 00000000..2435227f
--- /dev/null
+++ b/application/single_app/simplechat_scheduler.py
@@ -0,0 +1,38 @@
+# simplechat_scheduler.py
+
+"""Dedicated scheduler entrypoint for SimpleChat background tasks."""
+
+import logging
+import os
+import sys
+
+import app_settings_cache
+from background_tasks import run_scheduler_forever
+from config import get_redis_cache_infrastructure_endpoint, initialize_clients
+from functions_appinsights import setup_appinsights_logging
+from functions_settings import get_settings
+
+
+def initialize_scheduler_runtime():
+    """Prepare settings cache, clients, and logging for scheduler execution."""
+    print('Initializing SimpleChat scheduler runtime...')
+    settings = get_settings(use_cosmos=True)
+    redis_hostname = settings.get('redis_url', '').strip().split('.')[0]
+    app_settings_cache.configure_app_cache(
+        settings,
+        get_redis_cache_infrastructure_endpoint(redis_hostname)
+    )
+    app_settings_cache.update_settings_cache(settings)
+    initialize_clients(settings)
+    setup_appinsights_logging(settings)
+    logging.basicConfig(level=logging.DEBUG)
+    print('SimpleChat scheduler runtime initialized.')
+
+
+if __name__ == '__main__':
+    try:
+        initialize_scheduler_runtime()
+        run_scheduler_forever()
+    except KeyboardInterrupt:
+        print('SimpleChat scheduler stopped.')
+        sys.exit(0)
\ No newline at end of file
diff --git a/application/single_app/static/css/chats.css b/application/single_app/static/css/chats.css
index 38e11c3a..8bdd0711 100644
--- a/application/single_app/static/css/chats.css
+++ b/application/single_app/static/css/chats.css
@@ -20,6 +20,13 @@
   z-index: 1050 !important; /* Ensure it's above other elements */
 }
 
+.chat-searchable-select .dropdown-menu.show {
+  display: block !important;
+  opacity: 1 !important;
+  visibility: visible !important;
+  z-index: 1050 !important;
+}
+
 /* Handle dropdown positioning at the edge of viewport */
 #document-dropdown.dropup .dropdown-menu {
   bottom: 100% !important;
@@ -39,6 +46,188 @@
   right: auto !important; /* Prevent right positioning */
 }
 
+.chat-searchable-select {
+  min-width: 120px;
+}
+
+.chat-toolbar {
+  display: flex;
+  flex-wrap: wrap;
+  align-items: center;
+  gap: 0.75rem;
+}
+
+.chat-toolbar-actions,
+.chat-toolbar-controls,
+.chat-toolbar-toggles,
+.chat-toolbar-selectors {
+  display: flex;
+  align-items: center;
+  gap: 0.5rem;
+  min-width: 0;
+}
+
+.chat-toolbar-actions {
+  flex: 1 1 320px;
+  flex-wrap: wrap;
+}
+
+.chat-toolbar-controls {
+  flex: 1 1 540px;
+  flex-wrap: wrap;
+  justify-content: flex-end;
+  align-items: center;
+  margin-left: auto;
+}
+
+.chat-toolbar-toggles {
+  flex: 0 0 auto;
+  flex-wrap: wrap;
+  justify-content: flex-end;
+}
+
+.chat-toolbar-selectors {
+  flex: 1 1 460px;
+  flex-wrap: wrap;
+  justify-content: flex-end;
+}
+
+.chat-toolbar-selector {
+  flex: 1 1 200px;
+  min-width: 180px;
+  max-width: 280px;
+}
+
+.chat-toolbar-selector .chat-searchable-select {
+  width: 100%;
+  min-width: 0;
+}
+
+#prompt-selection-container.chat-toolbar-selector {
+  max-width: 300px;
+}
+
+#model-select-container.chat-toolbar-selector,
+#agent-select-container.chat-toolbar-selector {
+  max-width: 230px;
+}
+
+.chat-searchable-select .dropdown-menu {
+  min-width: 100%;
+  max-width: 360px;
+  max-height: 60vh;
+  overflow: hidden;
+  padding: 8px;
+}
+
+.chat-searchable-select-button {
+  text-align: left;
+  position: relative;
+  overflow: hidden;
+  text-overflow: ellipsis;
+  white-space: nowrap;
+  padding-right: 1.5rem;
+}
+
+.chat-searchable-select-text {
+  display: inline-block;
+  max-width: calc(100% - 20px);
+  overflow: hidden;
+  text-overflow: ellipsis;
+  white-space: nowrap;
+}
+
+.chat-searchable-select-search,
+.chat-dropdown-search {
+  padding: 0 0.25rem;
+}
+
+.chat-searchable-select-items {
+  max-height: 40vh;
+  overflow-y: auto;
+  padding: 0;
+}
+
+.chat-searchable-select-items .dropdown-item {
+  display: block;
+  width: 100%;
+  text-align: left;
+  white-space: nowrap;
+  overflow: hidden;
+  text-overflow: ellipsis;
+  cursor: pointer;
+  padding: 0.5rem 0.75rem;
+}
+
+.chat-searchable-select-items .dropdown-item.active {
+  background-color: #e9ecef;
+  color: #212529;
+}
+
+.chat-searchable-select-items .dropdown-item:hover {
+  background-color: #f8f9fa;
+}
+
+.chat-searchable-select-items .dropdown-item.disabled {
+  color: #6c757d;
+  cursor: default;
+}
+
+.chat-searchable-select-items .dropdown-item.disabled:hover {
+  background-color: transparent;
+}
+
+#scope-dropdown-menu,
+#tags-dropdown-menu {
+  overflow: hidden !important;
+}
+
+@media (max-width: 1200px) {
+  .chat-toolbar-controls,
+  .chat-toolbar-selectors {
+    justify-content: flex-start;
+    margin-left: 0;
+  }
+}
+
+@media (max-width: 768px) {
+  .chat-toolbar-controls {
+    flex-basis: 100%;
+    justify-content: flex-start;
+    margin-left: 0;
+  }
+
+  .chat-toolbar-selectors {
+    flex-basis: 100%;
+  }
+
+  .chat-toolbar-toggles {
+    width: 100%;
+  }
+
+  .chat-toolbar-selector {
+    flex: 1 1 100%;
+    min-width: 0;
+    max-width: none;
+  }
+}
+
+#scope-dropdown-items {
+  max-height: 320px;
+  overflow-y: auto;
+}
+
+#tags-dropdown-items {
+  max-height: 220px;
+  overflow-y: auto;
+}
+
+#scope-dropdown-items .dropdown-item,
+#tags-dropdown-items .dropdown-item {
+  width: 100%;
+  text-align: left;
+}
+
 /* Document dropdown items must be explicitly displayed */
 #document-dropdown .dropdown-item {
   display: block !important;
@@ -455,6 +644,25 @@ body.layout-split .gutter {
   /* Dark grey text for better contrast */
 }
 
+.conversation-title-row {
+  min-width: 0;
+}
+
+.conversation-unread-dot {
+  width: 0.625rem;
+  height: 0.625rem;
+  min-width: 0.625rem;
+  border-radius: 999px;
+  background-color: #198754;
+  box-shadow: 0 0 0 2px rgba(25, 135, 84, 0.15);
+  flex-shrink: 0;
+}
+
+[data-bs-theme="dark"] .conversation-unread-dot {
+  background-color: #39d98a;
+  box-shadow: 0 0 0 2px rgba(57, 217, 138, 0.2);
+}
+
 .message-footer {
   /* position: absolute; bottom: 8px; left: 10px; right: 10px; */ /* Removed absolute positioning */
   display: flex;
@@ -770,6 +978,12 @@ a.citation-link:hover {
   justify-content: center;
 }
 
+.search-btn,
+.file-btn {
+  flex: 0 0 auto;
+  white-space: nowrap;
+}
+
 /* Hide the text initially */
 .search-btn .search-btn-text {
   opacity: 0;
@@ -1046,7 +1260,7 @@ a.citation-link:hover {
 .message-content {
   display: flex;
   align-items: flex-end;
-  overflow: visible; /* Allow dropdown menus to appear outside content */
+  overflow: auto; /* Preserving higher level visible property while allowing response message scroll if needed */
 }
 
 .message-content.flex-row-reverse {
@@ -1258,8 +1472,7 @@ ol {
   align-items: flex-end;
 }
 #prompt-selection-container {
-  /* Prevent the container from growing vertically */
-  align-self: flex-end; /* Align item itself to bottom */
+  align-self: auto;
 }
 #prompt-select {
   /* Adjust max-width as needed */
@@ -1676,4 +1889,160 @@ mark.search-highlight {
     100% {
         transform: scale(1.05);
     }
+}
+
+/* =============================================
+   Processing Thoughts
+   ============================================= */
+
+/* Loading indicator thought text */
+.thought-live-text {
+    font-style: italic;
+    white-space: nowrap;
+    overflow: hidden;
+    text-overflow: ellipsis;
+    max-width: 300px;
+}
+
+/* Toggle button in message footer */
+.thoughts-toggle-btn {
+    font-size: 0.9rem;
+    color: #6c757d;
+    padding: 0 0.25rem;
+    border: none;
+    background: none;
+    cursor: pointer;
+    transition: color 0.15s ease-in-out;
+}
+
+.thoughts-toggle-btn:hover {
+    color: #ffc107;
+}
+
+/* Collapsible container inside message bubble */
+.thoughts-container {
+    max-height: 300px;
+    overflow-y: auto;
+    font-size: 0.85rem;
+}
+
+/* Timeline wrapper */
+.thoughts-list {
+    position: relative;
+    padding-left: 1.25rem;
+}
+
+/* Vertical timeline line */
+.thoughts-list::before {
+    content: '';
+    position: absolute;
+    left: 0.5rem;
+    top: 0.25rem;
+    bottom: 0.25rem;
+    width: 2px;
+    background: linear-gradient(to bottom, #0d6efd, #6ea8fe);
+    border-radius: 1px;
+}
+
+/* Individual thought step */
+.thought-step {
+    display: flex;
+    align-items: flex-start;
+    padding-left: 0.75rem;
+    padding-top: 0.25rem;
+    padding-bottom: 0.25rem;
+    position: relative;
+}
+
+/* Timeline node dot */
+.thought-step::before {
+    content: '';
+    position: absolute;
+    left: -1rem;
+    top: 0.55rem;
+    width: 8px;
+    height: 8px;
+    border-radius: 50%;
+    background-color: #0d6efd;
+    border: 2px solid #fff;
+    box-shadow: 0 0 0 1px #0d6efd;
+    z-index: 1;
+}
+
+/* Last thought step gets a slightly different dot */
+.thought-step:last-child::before {
+    background-color: #198754;
+    box-shadow: 0 0 0 1px #198754;
+}
+
+.thought-step i {
+    flex-shrink: 0;
+    margin-top: 2px;
+}
+
+/* Streaming cursor thought badge pulse animation */
+.animate-pulse {
+    animation: thought-pulse 1.5s ease-in-out infinite;
+}
+
+/* Streaming thought display (before content arrives) */
+.streaming-thought-display {
+    display: flex;
+    align-items: center;
+    padding: 0.5rem 0;
+}
+
+/* Light mode: use darker, more readable colors */
+.streaming-thought-display .badge {
+    background-color: rgba(13, 110, 253, 0.08) !important;
+    color: #0a58ca !important;
+    border-color: rgba(13, 110, 253, 0.25) !important;
+}
+
+/* Dark mode: lighter accent colors */
+[data-bs-theme="dark"] .streaming-thought-display .badge {
+    background-color: rgba(13, 202, 240, 0.15) !important;
+    color: #6edff6 !important;
+    border-color: rgba(13, 202, 240, 0.3) !important;
+}
+
+@keyframes thought-pulse {
+    0%, 100% {
+        opacity: 1;
+    }
+    50% {
+        opacity: 0.6;
+    }
+}
+
+/* Dark mode overrides */
+[data-bs-theme="dark"] .thoughts-toggle-btn {
+    color: #adb5bd;
+}
+
+[data-bs-theme="dark"] .thoughts-toggle-btn:hover {
+    color: #ffc107;
+}
+
+[data-bs-theme="dark"] .thought-step {
+    /* Dark mode dot border matches dark background */
+}
+
+[data-bs-theme="dark"] .thought-step::before {
+    border-color: #212529;
+    background-color: #6ea8fe;
+    box-shadow: 0 0 0 1px #6ea8fe;
+}
+
+[data-bs-theme="dark"] .thought-step:last-child::before {
+    background-color: #75b798;
+    box-shadow: 0 0 0 1px #75b798;
+}
+
+[data-bs-theme="dark"] .thoughts-list::before {
+    background: linear-gradient(to bottom, #6ea8fe, #9ec5fe);
+}
+
+[data-bs-theme="dark"] .thoughts-container {
+    border-top-color: #495057 !important;
 }
\ No newline at end of file
diff --git a/application/single_app/static/css/sidebar.css b/application/single_app/static/css/sidebar.css
index ebc40910..932051c6 100644
--- a/application/single_app/static/css/sidebar.css
+++ b/application/single_app/static/css/sidebar.css
@@ -238,6 +238,16 @@ body.sidebar-nav-enabled.has-classification-banner .container-fluid {
   transition: color 0.2s ease;
 }
 
+.sidebar-conversation-header {
+  min-width: 0;
+}
+
+.sidebar-conversation-unread-dot {
+  width: 0.55rem;
+  height: 0.55rem;
+  min-width: 0.55rem;
+}
+
 /* Edit mode input styling */
 .sidebar-conversation-item input.form-control {
   background: rgba(255, 255, 255, 0.95);
diff --git a/application/single_app/static/css/styles.css b/application/single_app/static/css/styles.css
index e537590d..eacc8859 100644
--- a/application/single_app/static/css/styles.css
+++ b/application/single_app/static/css/styles.css
@@ -502,6 +502,95 @@ main {
     flex-grow: 1;
 }
 
+/* ============================================
+   Item cards (agents/actions grid view)
+   ============================================ */
+.item-card {
+    cursor: default;
+    transition: all 0.3s ease;
+    border: 1px solid #dee2e6;
+    border-radius: 0.375rem;
+    background-color: #ffffff;
+}
+
+.item-card:hover {
+    border-color: #adb5bd;
+    transform: translateY(-2px);
+    box-shadow: 0 4px 8px rgba(0, 0, 0, 0.1);
+}
+
+.item-card .card-title {
+    font-weight: 600;
+    font-size: 0.9rem;
+    color: #212529;
+}
+
+.item-card .card-text {
+    color: #6c757d;
+    font-size: 0.8rem;
+    line-height: 1.4;
+}
+
+.item-card .item-card-icon {
+    color: #0d6efd;
+}
+
+.item-card .item-card-buttons {
+    border-top: 1px solid #f0f0f0;
+    padding-top: 0.5rem;
+}
+
+/* Dark mode for item cards */
+[data-bs-theme="dark"] .item-card {
+    background-color: #343a40;
+    border: 1px solid #495057;
+    color: #e9ecef;
+}
+
+[data-bs-theme="dark"] .item-card:hover {
+    background-color: #3d444b;
+    border-color: #6c757d;
+}
+
+[data-bs-theme="dark"] .item-card .card-title {
+    color: #e9ecef;
+}
+
+[data-bs-theme="dark"] .item-card .card-text {
+    color: #adb5bd;
+}
+
+[data-bs-theme="dark"] .item-card .item-card-icon {
+    color: #6ea8fe;
+}
+
+[data-bs-theme="dark"] .item-card .item-card-buttons {
+    border-top-color: #495057;
+}
+
+/* Improved table column layout for agents and actions */
+.item-list-table th:nth-child(1),
+.item-list-table td:nth-child(1) {
+    width: 28%;
+    min-width: 140px;
+}
+
+.item-list-table th:nth-child(2),
+.item-list-table td:nth-child(2) {
+    width: 47%;
+    max-width: 0;
+    overflow: hidden;
+    text-overflow: ellipsis;
+    white-space: nowrap;
+}
+
+.item-list-table th:nth-child(3),
+.item-list-table td:nth-child(3) {
+    width: 25%;
+    min-width: 160px;
+    white-space: nowrap;
+}
+
 /* Connection type buttons */
 .connection-type-btn {
     border: 2px solid #dee2e6;
@@ -854,3 +943,171 @@ main {
 [data-bs-theme="dark"] .message-content a:visited {
     color: #b399ff !important; /* Purple-ish for visited links */
 }
+
+/* ============================================
+   Rendered Markdown — table & code block styles
+   Shared by agent detail view, template preview,
+   and any non-chat area that renders Markdown.
+   ============================================ */
+
+/* --- Tables --- */
+.rendered-markdown table {
+    width: 100%;
+    max-width: 100%;
+    margin: 0.75rem 0;
+    border-collapse: collapse;
+    border-spacing: 0;
+    border: 1px solid #dee2e6;
+    border-radius: 0.375rem;
+    overflow: hidden;
+    background-color: var(--bs-body-bg);
+    box-shadow: 0 0.125rem 0.25rem rgba(0, 0, 0, 0.075);
+    font-size: 0.875rem;
+    display: block;
+    overflow-x: auto;
+    white-space: nowrap;
+    -webkit-overflow-scrolling: touch;
+}
+
+@media (min-width: 768px) {
+    .rendered-markdown table {
+        display: table;
+        white-space: normal;
+    }
+}
+
+.rendered-markdown table th,
+.rendered-markdown table td {
+    padding: 0.5rem 0.75rem;
+    border-bottom: 1px solid #dee2e6;
+    border-right: 1px solid #dee2e6;
+    text-align: left;
+    vertical-align: top;
+    word-wrap: break-word;
+    line-height: 1.4;
+}
+
+.rendered-markdown table th:last-child,
+.rendered-markdown table td:last-child {
+    border-right: none;
+}
+
+.rendered-markdown table thead th {
+    background-color: #f8f9fa;
+    font-weight: 600;
+    color: #495057;
+    border-bottom: 2px solid #dee2e6;
+}
+
+.rendered-markdown table tbody tr:nth-child(even) {
+    background-color: rgba(0, 0, 0, 0.02);
+}
+
+.rendered-markdown table tbody tr:hover {
+    background-color: rgba(0, 0, 0, 0.04);
+    transition: background-color 0.15s ease-in-out;
+}
+
+.rendered-markdown table th[align="center"],
+.rendered-markdown table td[align="center"] {
+    text-align: center;
+}
+
+.rendered-markdown table th[align="right"],
+.rendered-markdown table td[align="right"] {
+    text-align: right;
+}
+
+/* Dark mode tables */
+[data-bs-theme="dark"] .rendered-markdown table {
+    border-color: #495057;
+    background-color: var(--bs-dark);
+    color: #e9ecef;
+}
+
+[data-bs-theme="dark"] .rendered-markdown table th,
+[data-bs-theme="dark"] .rendered-markdown table td {
+    border-color: #495057;
+}
+
+[data-bs-theme="dark"] .rendered-markdown table thead th {
+    background-color: #343a40;
+    color: #e9ecef;
+    border-bottom-color: #495057;
+}
+
+[data-bs-theme="dark"] .rendered-markdown table tbody tr:nth-child(even) {
+    background-color: rgba(255, 255, 255, 0.05);
+}
+
+[data-bs-theme="dark"] .rendered-markdown table tbody tr:hover {
+    background-color: rgba(255, 255, 255, 0.1);
+}
+
+.rendered-markdown table code {
+    background-color: rgba(0, 0, 0, 0.1);
+    padding: 0.125rem 0.25rem;
+    border-radius: 0.25rem;
+    font-size: 0.8em;
+}
+
+[data-bs-theme="dark"] .rendered-markdown table code {
+    background-color: rgba(255, 255, 255, 0.1);
+}
+
+/* --- Code blocks --- */
+.rendered-markdown pre,
+.rendered-markdown pre[class*="language-"] {
+    overflow-x: auto;
+    max-width: 100%;
+    width: 100%;
+    box-sizing: border-box;
+    display: block;
+    white-space: pre;
+    background-color: #1e1e1e;
+    color: #d4d4d4;
+    border-radius: 0.375rem;
+    padding: 1rem;
+    margin: 0.75rem 0;
+    font-size: 0.85rem;
+    line-height: 1.5;
+}
+
+.rendered-markdown pre code {
+    display: block;
+    min-width: 0;
+    max-width: 100%;
+    overflow-x: auto;
+    white-space: pre;
+    background: transparent;
+    color: inherit;
+    padding: 0;
+    font-size: inherit;
+}
+
+/* Inline code */
+.rendered-markdown code:not(pre code) {
+    background-color: rgba(0, 0, 0, 0.06);
+    padding: 0.15rem 0.35rem;
+    border-radius: 0.25rem;
+    font-size: 0.85em;
+    color: #d63384;
+}
+
+[data-bs-theme="dark"] .rendered-markdown code:not(pre code) {
+    background-color: rgba(255, 255, 255, 0.1);
+    color: #e685b5;
+}
+
+/* Blockquotes */
+.rendered-markdown blockquote {
+    border-left: 4px solid #dee2e6;
+    padding-left: 1em;
+    color: #6c757d;
+    margin: 0.75rem 0;
+}
+
+[data-bs-theme="dark"] .rendered-markdown blockquote {
+    border-left-color: #495057;
+    color: #adb5bd;
+}
diff --git a/application/single_app/static/images/custom_logo.png b/application/single_app/static/images/custom_logo.png
new file mode 100644
index 00000000..ecf6e652
Binary files /dev/null and b/application/single_app/static/images/custom_logo.png differ
diff --git a/application/single_app/static/images/custom_logo_dark.png b/application/single_app/static/images/custom_logo_dark.png
new file mode 100644
index 00000000..4f281945
Binary files /dev/null and b/application/single_app/static/images/custom_logo_dark.png differ
diff --git a/application/single_app/static/js/admin/admin_settings.js b/application/single_app/static/js/admin/admin_settings.js
index 85719128..21c989fd 100644
--- a/application/single_app/static/js/admin/admin_settings.js
+++ b/application/single_app/static/js/admin/admin_settings.js
@@ -1237,10 +1237,11 @@ function setupToggles() {
         const mathToggle = document.getElementById('toggle-math-plugin');
         const textToggle = document.getElementById('toggle-text-plugin');
         const factMemoryToggle = document.getElementById('toggle-fact-memory-plugin');
+        const tabularProcessingToggle = document.getElementById('toggle-tabular-processing-plugin');
         const embeddingToggle = document.getElementById('toggle-default-embedding-model-plugin');
         const allowUserPluginsToggle = document.getElementById('toggle-allow-user-plugins');
         const allowGroupPluginsToggle = document.getElementById('toggle-allow-group-plugins');
-        const toggles = [timeToggle, httpToggle, waitToggle, mathToggle, textToggle, factMemoryToggle, embeddingToggle, allowUserPluginsToggle, allowGroupPluginsToggle];
+        const toggles = [timeToggle, httpToggle, waitToggle, mathToggle, textToggle, factMemoryToggle, tabularProcessingToggle, embeddingToggle, allowUserPluginsToggle, allowGroupPluginsToggle];
         // Feedback area
         let feedbackDiv = document.getElementById('core-plugin-toggles-feedback');
         if (!feedbackDiv) {
@@ -1270,6 +1271,16 @@ function setupToggles() {
                 if (textToggle) textToggle.checked = !!settings.enable_text_plugin;
                 if (embeddingToggle) embeddingToggle.checked = !!settings.enable_default_embedding_model_plugin;
                 if (factMemoryToggle) factMemoryToggle.checked = !!settings.enable_fact_memory_plugin;
+                if (tabularProcessingToggle) {
+                    tabularProcessingToggle.checked = !!settings.enable_tabular_processing_plugin;
+                    const ecEnabled = !!settings.enable_enhanced_citations;
+                    tabularProcessingToggle.disabled = !ecEnabled;
+                    const depNote = document.getElementById('tabular-processing-dependency-note');
+                    if (depNote) {
+                        depNote.textContent = ecEnabled ? 'Requires Enhanced Citations' : 'Requires Enhanced Citations (currently disabled)';
+                        depNote.className = ecEnabled ? 'text-muted d-block ms-4' : 'text-danger d-block ms-4';
+                    }
+                }
                 if (allowUserPluginsToggle) allowUserPluginsToggle.checked = !!settings.allow_user_plugins;
                 if (allowGroupPluginsToggle) allowGroupPluginsToggle.checked = !!settings.allow_group_plugins;
             } catch (err) {
@@ -1291,6 +1302,7 @@ function setupToggles() {
                 enable_text_plugin: textToggle ? textToggle.checked : false,
                 enable_default_embedding_model_plugin: embeddingToggle ? embeddingToggle.checked : false,
                 enable_fact_memory_plugin: factMemoryToggle ? factMemoryToggle.checked : false,
+                enable_tabular_processing_plugin: tabularProcessingToggle ? tabularProcessingToggle.checked : false,
                 allow_user_plugins: allowUserPluginsToggle ? allowUserPluginsToggle.checked : false,
                 allow_group_plugins: allowGroupPluginsToggle ? allowGroupPluginsToggle.checked : false
             };
@@ -1867,14 +1879,30 @@ function setupToggles() {
     const redisAuthType = document.getElementById('redis_auth_type');
     if (redisAuthType) {
         const redisKeyContainer = document.getElementById('redis_key_container');
+        const redisKeyLabel = document.getElementById('redis_key_label');
+
+        // Helper to update the label text based on auth type
+        function updateRedisKeyLabel(authTypeValue) {
+            if (!redisKeyLabel) return;
+            redisKeyLabel.textContent = authTypeValue === 'key_vault' ? 'Key Vault Secret Name' : 'Redis Access Key';
+        }
+
         // Set initial state on load
         if (redisKeyContainer) {
-            redisKeyContainer.style.display = (redisAuthType.value === 'key') ? 'block' : 'none';
+            redisKeyContainer.classList.toggle('d-none', !(redisAuthType.value === 'key' || redisAuthType.value === 'key_vault'));
         }
+        updateRedisKeyLabel(redisAuthType.value);
+
         redisAuthType.addEventListener('change', function () {
             if (redisKeyContainer) {
-                redisKeyContainer.style.display = (this.value === 'key') ? 'block' : 'none';
+                redisKeyContainer.classList.toggle('d-none', !(this.value === 'key' || this.value === 'key_vault'));
+            }
+            const redisKeyVaultHint = document.getElementById('redis_key_vault_hint');
+            if (redisKeyVaultHint) {
+                redisKeyVaultHint.classList.toggle('d-none', this.value !== 'key_vault');
             }
+            updateRedisKeyLabel(this.value);
+            markFormAsModified();
         });
     }
 
@@ -2179,7 +2207,8 @@ function setupTestButtons() {
             const payload = {
                 test_type: 'redis',
                 endpoint: document.getElementById('redis_url').value,
-                key: document.getElementById('redis_key').value
+                key: document.getElementById('redis_key').value,
+                auth_type: document.getElementById('redis_auth_type').value
             };
 
             try {
@@ -3827,11 +3856,12 @@ function checkOptionalFeaturesEnabled(stepNumber) {
                 return endpoint && key;
             }
         
-        case 11: // User feedback and archiving
-            // Check if feedback is enabled
+        case 11: // User feedback, archiving, and thoughts
+            // Check if feedback, archiving, or thoughts is enabled
             const feedbackEnabled = document.getElementById('enable_user_feedback')?.checked;
             const archivingEnabled = document.getElementById('enable_conversation_archiving')?.checked;
-            return feedbackEnabled || archivingEnabled;
+            const thoughtsEnabled = document.getElementById('enable_thoughts')?.checked;
+            return feedbackEnabled || archivingEnabled || thoughtsEnabled;
             
         case 12: // Enhanced citations and image generation
             // Check if enhanced citations or image generation is enabled
diff --git a/application/single_app/static/js/chat/chat-agents.js b/application/single_app/static/js/chat/chat-agents.js
index b1e4f5fe..af18ff21 100644
--- a/application/single_app/static/js/chat/chat-agents.js
+++ b/application/single_app/static/js/chat/chat-agents.js
@@ -8,10 +8,43 @@ import {
     getUserSetting,
     setUserSetting
 } from '../agents_common.js';
+import { createSearchableSingleSelect } from './chat-searchable-select.js';
 
 const enableAgentsBtn = document.getElementById("enable-agents-btn");
 const agentSelectContainer = document.getElementById("agent-select-container");
 const modelSelectContainer = document.getElementById("model-select-container");
+const agentSelect = document.getElementById('agent-select');
+const agentDropdown = document.getElementById('agent-dropdown');
+const agentDropdownButton = document.getElementById('agent-dropdown-button');
+const agentDropdownMenu = document.getElementById('agent-dropdown-menu');
+const agentDropdownText = agentDropdownButton
+    ? agentDropdownButton.querySelector('.chat-searchable-select-text')
+    : null;
+const agentSearchInput = document.getElementById('agent-search-input');
+const agentDropdownItems = document.getElementById('agent-dropdown-items');
+
+let agentSelectorController = null;
+
+function initializeAgentSelector() {
+    if (agentSelectorController || !agentSelect) {
+        return agentSelectorController;
+    }
+
+    agentSelectorController = createSearchableSingleSelect({
+        selectEl: agentSelect,
+        dropdownEl: agentDropdown,
+        buttonEl: agentDropdownButton,
+        buttonTextEl: agentDropdownText,
+        menuEl: agentDropdownMenu,
+        searchInputEl: agentSearchInput,
+        itemsContainerEl: agentDropdownItems,
+        placeholderText: 'Select an Agent',
+        emptyMessage: 'No agents available',
+        emptySearchMessage: 'No matching agents found',
+    });
+
+    return agentSelectorController;
+}
 
 /**
  * Check if agents are currently enabled
@@ -24,6 +57,8 @@ export function areAgentsEnabled() {
 
 export async function initializeAgentInteractions() {
     if (enableAgentsBtn && agentSelectContainer) {
+        initializeAgentSelector();
+
         // On load, sync UI with enable_agents setting
         const enableAgents = await getUserSetting('enable_agents');
         if (enableAgents) {
@@ -58,7 +93,8 @@ export async function initializeAgentInteractions() {
 }
 
 export async function populateAgentDropdown() {
-    const agentSelect = agentSelectContainer.querySelector('select');
+    initializeAgentSelector();
+
     try {
         const [userAgents, selectedAgent] = await Promise.all([
             fetchUserAgents(),
@@ -71,6 +107,7 @@ export async function populateAgentDropdown() {
         const globalAgents = combinedAgents.filter(agent => agent.is_global);
         const orderedAgents = [...personalAgents, ...activeGroupAgents, ...globalAgents];
         populateAgentSelect(agentSelect, orderedAgents, selectedAgent);
+        agentSelectorController?.refresh();
         agentSelect.onchange = async function () {
             const selectedOption = agentSelect.options[agentSelect.selectedIndex];
             if (!selectedOption) {
diff --git a/application/single_app/static/js/chat/chat-citations.js b/application/single_app/static/js/chat/chat-citations.js
index 9ec6bad3..60099398 100644
--- a/application/single_app/static/js/chat/chat-citations.js
+++ b/application/single_app/static/js/chat/chat-citations.js
@@ -11,6 +11,14 @@ import { showEnhancedCitationModal } from './chat-enhanced-citations.js';
 
 const chatboxEl = document.getElementById("chatbox");
 
+function escapeAttribute(value) {
+  return String(value)
+    .replace(/&/g, '&amp;')
+    .replace(/"/g, '&quot;')
+    .replace(/</g, '&lt;')
+    .replace(/>/g, '&gt;');
+}
+
 export function parseDocIdAndPage(citationId) {
   // ... (keep existing implementation)
   const underscoreIndex = citationId.lastIndexOf("_");
@@ -24,9 +32,9 @@ export function parseDocIdAndPage(citationId) {
 
 export function parseCitations(message) {
   // ... (keep existing implementation)
-  const citationRegex = /\(Source:\s*([^,]+),\s*Page(?:s)?:\s*([^)]+)\)\s*((?:\[#.*?\]\s*)+)/gi;
+  const citationRegex = /\(Source:\s*([^,]+),\s*(Page(?:s)?|Sheet(?:s)?|Location):\s*([^)]+)\)\s*((?:\[#.*?\]\s*)+)/gi;
 
-  let result = message.replace(citationRegex, (whole, filename, pages, bracketSection) => {
+  let result = message.replace(citationRegex, (whole, filename, locationLabel, locations, bracketSection) => {
     let filenameHtml;
     if (/^https?:\/\/.+/i.test(filename.trim())) {
       filenameHtml = `<a href="${filename.trim()}" target="_blank" rel="noopener noreferrer">${filename.trim()}</a>`;
@@ -36,6 +44,7 @@ export function parseCitations(message) {
 
     const bracketMatches = bracketSection.match(/\[#.*?\]/g) || [];
     const pageToRefMap = {};
+    const orderedRefs = [];
 
     bracketMatches.forEach((match) => {
       let inner = match.slice(2, -1).trim();
@@ -43,6 +52,7 @@ export function parseCitations(message) {
       refs.forEach((r) => {
         let ref = r.trim();
         if (ref.startsWith('#')) ref = ref.slice(1);
+        orderedRefs.push(ref);
         const parts = ref.split('_');
         const pageNumber = parts.pop();
         // Ensure docId part is also captured if needed, though ref is the full ID here
@@ -56,8 +66,15 @@ export function parseCitations(message) {
       return underscoreIndex === -1 ? ref : ref.slice(0, underscoreIndex + 1);
     }
 
-    const pagesTokens = pages.split(/,/).map(tok => tok.trim());
-    const linkedTokens = pagesTokens.map(token => {
+    const normalizedLocationLabel = locationLabel.toLowerCase();
+    const locationTokens = locations.split(/,/).map(tok => tok.trim());
+    const linkedTokens = locationTokens.map((token, index) => {
+      if (!normalizedLocationLabel.startsWith('page')) {
+        const ref = orderedRefs[index] || orderedRefs[0];
+        const sheetName = normalizedLocationLabel.startsWith('sheet') ? token : null;
+        return buildAnchorIfExists(token, ref, sheetName);
+      }
+
       const dashParts = token.split(/[–—-]/).map(p => p.trim());
 
       if (dashParts.length === 2 && dashParts[0] && dashParts[1]) {
@@ -94,7 +111,7 @@ export function parseCitations(message) {
     });
 
     const linkedPagesText = linkedTokens.join(', ');
-    return `(Source: ${filenameHtml}, Pages: ${linkedPagesText})`;
+    return `(Source: ${filenameHtml}, ${locationLabel}: ${linkedPagesText})`;
   });
 
   // Cleanup pass: strip any remaining [#guid...] bracket groups that the main regex didn't match.
@@ -107,14 +124,15 @@ export function parseCitations(message) {
 }
 
 
-export function buildAnchorIfExists(pageStr, citationId) {
+export function buildAnchorIfExists(pageStr, citationId, sheetName = null) {
   // ... (keep existing implementation)
    if (!citationId) {
     return pageStr;
   }
   // Ensure citationId doesn't have a leading # if passed accidentally
   const cleanCitationId = citationId.startsWith('#') ? citationId.slice(1) : citationId;
-  return `<a href="#" class="citation-link" data-citation-id="${cleanCitationId}" target="_blank" rel="noopener noreferrer">${pageStr}</a>`;
+  const sheetNameAttribute = sheetName ? ` data-sheet-name="${escapeAttribute(sheetName)}"` : '';
+  return `<a href="#" class="citation-link" data-citation-id="${cleanCitationId}"${sheetNameAttribute} target="_blank" rel="noopener noreferrer">${pageStr}</a>`;
 }
 
 // --- MODIFIED: fetchCitedText handles errors more gracefully ---
@@ -609,6 +627,7 @@ if (chatboxEl) {
       }
 
       const { docId, pageNumber } = parseDocIdAndPage(citationId);
+      const sheetName = target.getAttribute("data-sheet-name");
 
       // Safety check: Ensure docId and pageNumber were parsed correctly
       if (!docId || !pageNumber) {
@@ -649,7 +668,7 @@ if (chatboxEl) {
       if (attemptEnhanced) {
           // console.log(`Attempting Enhanced Citation for ${docId}, page/timestamp ${pageNumber}, citationId ${citationId}`);
           // Use new enhanced citation system that supports multiple file types
-          showEnhancedCitationModal(docId, pageNumber, citationId);
+          showEnhancedCitationModal(docId, pageNumber, citationId, sheetName);
       } else {
           // console.log(`Fetching Text Citation for ${citationId}`);
           // Use text citation if globally disabled OR explicitly disabled for this doc OR if parsing failed earlier
diff --git a/application/single_app/static/js/chat/chat-conversation-details.js b/application/single_app/static/js/chat/chat-conversation-details.js
index 19851bae..484128af 100644
--- a/application/single_app/static/js/chat/chat-conversation-details.js
+++ b/application/single_app/static/js/chat/chat-conversation-details.js
@@ -75,7 +75,7 @@ export async function showConversationDetails(conversationId) {
  * @returns {string} HTML string
  */
 function renderConversationMetadata(metadata, conversationId) {
-  const { context = [], tags = [], strict = false, classification = [], last_updated, chat_type = 'personal', is_pinned = false, is_hidden = false, scope_locked, locked_contexts = [] } = metadata;
+  const { context = [], tags = [], strict = false, classification = [], last_updated, chat_type = 'personal', is_pinned = false, is_hidden = false, scope_locked, locked_contexts = [], summary = null } = metadata;
   
   // Organize tags by category
   const tagsByCategory = {
@@ -97,6 +97,18 @@ function renderConversationMetadata(metadata, conversationId) {
   // Build HTML sections
   let html = `
     <div class="row g-3">
+      <!-- Summary Section -->
+      <div class="col-12">
+        <div class="card">
+          <div class="card-header bg-primary bg-opacity-75 text-white d-flex justify-content-between align-items-center">
+            <h6 class="mb-0"><i class="bi bi-blockquote-left me-2"></i>Summary</h6>
+            ${summary ? `<small class="opacity-75">Generated ${formatDate(summary.generated_at)}${summary.model_deployment ? ` · ${summary.model_deployment}` : ''}</small>` : ''}
+          </div>
+          <div class="card-body" id="summary-card-body">
+            ${renderSummaryContent(summary, conversationId)}
+          </div>
+        </div>
+      </div>
       <!-- Basic Info -->
       <div class="col-12">
         <div class="card">
@@ -570,8 +582,159 @@ function extractPageNumbers(chunkIds) {
   return pages.sort((a, b) => parseInt(a) - parseInt(b));
 }
 
+/**
+ * Render the summary card body content
+ * @param {Object|null} summary - Existing summary data or null
+ * @param {string} conversationId - The conversation ID
+ * @returns {string} HTML string
+ */
+function renderSummaryContent(summary, conversationId) {
+  if (summary && summary.content) {
+    return `
+      <p class="mb-2">${escapeHtml(summary.content)}</p>
+      <div class="d-flex justify-content-end">
+        <button class="btn btn-sm btn-outline-secondary" id="regenerate-summary-btn"
+                data-conversation-id="${conversationId}">
+          <i class="bi bi-arrow-clockwise me-1"></i>Regenerate
+        </button>
+      </div>
+    `;
+  }
+
+  // Build model options from the global model-select dropdown
+  const modelOptions = getAvailableModelOptions();
+  return `
+    <p class="text-muted mb-3">No summary has been generated for this conversation yet.</p>
+    <div class="d-flex align-items-center gap-2">
+      <select class="form-select form-select-sm" id="summary-model-select" style="max-width: 260px;">
+        ${modelOptions}
+      </select>
+      <button class="btn btn-sm btn-primary" id="generate-summary-btn"
+              data-conversation-id="${conversationId}">
+        <i class="bi bi-blockquote-left me-1"></i>Generate Summary
+      </button>
+    </div>
+  `;
+}
+
+/**
+ * Get available model options from the global #model-select dropdown
+ * @returns {string} HTML option elements
+ */
+function getAvailableModelOptions() {
+  const globalSelect = document.getElementById('model-select');
+  if (!globalSelect) {
+    return '<option value="">Default</option>';
+  }
+  let options = '';
+  for (const opt of globalSelect.options) {
+    options += `<option value="${escapeHtml(opt.value)}"${opt.selected ? ' selected' : ''}>${escapeHtml(opt.text)}</option>`;
+  }
+  return options || '<option value="">Default</option>';
+}
+
+/**
+ * Handle summary generation (generate or regenerate)
+ * @param {string} conversationId - The conversation ID
+ * @param {string} modelDeployment - Selected model deployment
+ */
+async function handleGenerateSummary(conversationId, modelDeployment) {
+  const cardBody = document.getElementById('summary-card-body');
+  if (!cardBody) {
+    return;
+  }
+
+  cardBody.innerHTML = `
+    <div class="d-flex align-items-center gap-2">
+      <div class="spinner-border spinner-border-sm text-primary" role="status">
+        <span class="visually-hidden">Generating...</span>
+      </div>
+      <span class="text-muted">Generating summary...</span>
+    </div>
+  `;
+
+  try {
+    const response = await fetch(`/api/conversations/${conversationId}/summary`, {
+      method: 'POST',
+      headers: { 'Content-Type': 'application/json' },
+      body: JSON.stringify({ model_deployment: modelDeployment })
+    });
+
+    if (!response.ok) {
+      const errData = await response.json().catch(() => ({}));
+      throw new Error(errData.error || `HTTP ${response.status}`);
+    }
+
+    const data = await response.json();
+    const summary = data.summary;
+    cardBody.innerHTML = renderSummaryContent(summary, conversationId);
+
+    // Update card header with generation info
+    const cardHeader = cardBody.closest('.card').querySelector('.card-header');
+    if (cardHeader && summary) {
+      const smallEl = cardHeader.querySelector('small');
+      const infoText = `Generated ${formatDate(summary.generated_at)}${summary.model_deployment ? ` · ${summary.model_deployment}` : ''}`;
+      if (smallEl) {
+        smallEl.textContent = infoText;
+      } else {
+        const small = document.createElement('small');
+        small.className = 'opacity-75';
+        small.textContent = infoText;
+        cardHeader.appendChild(small);
+      }
+    }
+
+  } catch (error) {
+    console.error('Error generating summary:', error);
+    cardBody.innerHTML = `
+      <div class="text-danger mb-2">
+        <i class="bi bi-exclamation-triangle me-1"></i>
+        Failed to generate summary: ${escapeHtml(error.message)}
+      </div>
+      ${renderSummaryContent(null, conversationId)}
+    `;
+  }
+}
+
+/**
+ * Simple HTML escapefor display
+ * @param {string} str - String to escape
+ * @returns {string} Escaped string
+ */
+function escapeHtml(str) {
+  if (!str) {
+    return '';
+  }
+  const div = document.createElement('div');
+  div.textContent = str;
+  return div.innerHTML;
+}
+
 // Event listeners for details buttons
 document.addEventListener('click', function(e) {
+  // Generate summary button
+  if (e.target.closest('#generate-summary-btn')) {
+    e.preventDefault();
+    const btn = e.target.closest('#generate-summary-btn');
+    const cid = btn.getAttribute('data-conversation-id');
+    const modelSelect = document.getElementById('summary-model-select');
+    const model = modelSelect ? modelSelect.value : '';
+    handleGenerateSummary(cid, model);
+    return;
+  }
+
+  // Regenerate summary button
+  if (e.target.closest('#regenerate-summary-btn')) {
+    e.preventDefault();
+    const btn = e.target.closest('#regenerate-summary-btn');
+    const cid = btn.getAttribute('data-conversation-id');
+    // Use the currently selected global model for regeneration
+    const globalSelect = document.getElementById('model-select');
+    const model = globalSelect ? globalSelect.value : '';
+    handleGenerateSummary(cid, model);
+    return;
+  }
+
   if (e.target.closest('.details-btn')) {
     e.preventDefault();
     
diff --git a/application/single_app/static/js/chat/chat-conversations.js b/application/single_app/static/js/chat/chat-conversations.js
index 0af0a768..34c035be 100644
--- a/application/single_app/static/js/chat/chat-conversations.js
+++ b/application/single_app/static/js/chat/chat-conversations.js
@@ -3,7 +3,11 @@
 import { showToast } from "./chat-toast.js";
 import { loadMessages } from "./chat-messages.js";
 import { isColorLight, toBoolean } from "./chat-utils.js";
-import { loadSidebarConversations, setActiveConversation as setSidebarActiveConversation } from "./chat-sidebar-conversations.js";
+import {
+  loadSidebarConversations,
+  setActiveConversation as setSidebarActiveConversation,
+  setConversationUnreadState as setSidebarConversationUnreadState,
+} from "./chat-sidebar-conversations.js";
 import { toggleConversationInfoButton } from "./chat-conversation-info-button.js";
 import { restoreScopeLockState, resetScopeLock } from "./chat-documents.js";
 
@@ -28,6 +32,110 @@ let allConversations = []; // Store all conversations for client-side filtering
 let isLoadingConversations = false; // Prevent concurrent loads
 let showQuickSearch = false; // Track if quick search input is visible
 let quickSearchTerm = ""; // Current search term
+let pendingConversationCreation = null; // Reuse a single in-flight create request
+const markConversationReadRequests = new Map();
+
+function createUnreadDotElement() {
+  const unreadDot = document.createElement("span");
+  unreadDot.classList.add("conversation-unread-dot");
+  unreadDot.setAttribute("aria-hidden", "true");
+  return unreadDot;
+}
+
+function updateConversationUnreadStateCache(conversationId, hasUnread) {
+  allConversations = allConversations.map(convo => {
+    if (convo.id !== conversationId) {
+      return convo;
+    }
+
+    return {
+      ...convo,
+      has_unread_assistant_response: hasUnread,
+      last_unread_assistant_message_id: hasUnread ? convo.last_unread_assistant_message_id : null,
+      last_unread_assistant_at: hasUnread ? convo.last_unread_assistant_at : null,
+    };
+  });
+}
+
+function getConversationUnreadState(conversationId) {
+  const convoItem = document.querySelector(`.conversation-item[data-conversation-id="${conversationId}"]`);
+  if (convoItem) {
+    return convoItem.dataset.hasUnreadAssistantResponse === "true";
+  }
+
+  const conversation = allConversations.find(convo => convo.id === conversationId);
+  return Boolean(conversation?.has_unread_assistant_response);
+}
+
+export function setConversationUnreadState(conversationId, hasUnread) {
+  updateConversationUnreadStateCache(conversationId, hasUnread);
+
+  const convoItem = document.querySelector(`.conversation-item[data-conversation-id="${conversationId}"]`);
+  if (convoItem) {
+    convoItem.dataset.hasUnreadAssistantResponse = hasUnread ? "true" : "false";
+
+    const titleRow = convoItem.querySelector(".conversation-title-row");
+    const titleElement = convoItem.querySelector(".conversation-title");
+    const existingDot = convoItem.querySelector(".conversation-unread-dot");
+
+    if (!hasUnread) {
+      if (existingDot) {
+        existingDot.remove();
+      }
+    } else if (!existingDot && titleRow && titleElement) {
+      titleRow.insertBefore(createUnreadDotElement(), titleElement);
+    }
+  }
+
+  setSidebarConversationUnreadState(conversationId, hasUnread);
+}
+
+export async function markConversationRead(conversationId, options = {}) {
+  const { force = false, suppressErrorToast = false } = options;
+  if (!conversationId) {
+    return null;
+  }
+
+  const previousUnreadState = getConversationUnreadState(conversationId);
+  if (!force && !previousUnreadState) {
+    return { success: true, skipped: true };
+  }
+
+  if (markConversationReadRequests.has(conversationId)) {
+    return markConversationReadRequests.get(conversationId);
+  }
+
+  setConversationUnreadState(conversationId, false);
+
+  const markReadRequest = fetch(`/api/conversations/${conversationId}/mark-read`, {
+    method: "POST",
+    headers: { "Content-Type": "application/json" },
+  })
+    .then(async response => {
+      const data = await response.json().catch(() => ({}));
+      if (!response.ok || data.success === false) {
+        throw new Error(data.error || "Failed to mark conversation as read");
+      }
+      return data;
+    })
+    .catch(error => {
+      if (previousUnreadState) {
+        setConversationUnreadState(conversationId, true);
+      }
+
+      if (!suppressErrorToast) {
+        showToast(`Failed to clear unread state: ${error.message}`, "danger");
+      }
+
+      throw error;
+    })
+    .finally(() => {
+      markConversationReadRequests.delete(conversationId);
+    });
+
+  markConversationReadRequests.set(conversationId, markReadRequest);
+  return markReadRequest;
+}
 
 // Clear selected conversations when loading the page
 document.addEventListener('DOMContentLoaded', () => {
@@ -344,6 +452,9 @@ export async function ensureConversationPresent(conversationId) {
     chat_type: metadata.chat_type || null,
     is_pinned: metadata.is_pinned || false,
     is_hidden: metadata.is_hidden || false,
+    has_unread_assistant_response: metadata.has_unread_assistant_response || false,
+    last_unread_assistant_message_id: metadata.last_unread_assistant_message_id || null,
+    last_unread_assistant_at: metadata.last_unread_assistant_at || null,
   };
 
   // Keep allConversations in sync
@@ -365,6 +476,7 @@ export function createConversationItem(convo) {
   convoItem.classList.add("list-group-item", "list-group-item-action", "conversation-item", "d-flex", "align-items-center"); // Use action class
   convoItem.setAttribute("data-conversation-id", convo.id);
   convoItem.setAttribute("data-conversation-title", convo.title); // Store title too
+  convoItem.dataset.hasUnreadAssistantResponse = convo.has_unread_assistant_response ? "true" : "false";
 
   // *** Store classification data as stringified JSON ***
   convoItem.dataset.classifications = JSON.stringify(convo.classification || []);
@@ -437,8 +549,12 @@ export function createConversationItem(convo) {
   leftDiv.classList.add("d-flex", "flex-column", "flex-grow-1", "pe-2"); // flex-grow and padding-end
   leftDiv.style.overflow = "hidden"; // Prevent overflow issues
 
+  const titleRow = document.createElement("div");
+  titleRow.classList.add("conversation-title-row", "d-flex", "align-items-center", "gap-2", "overflow-hidden");
+
   const titleSpan = document.createElement("span");
-  titleSpan.classList.add("conversation-title", "text-truncate"); // Bold and truncate
+  titleSpan.classList.add("conversation-title", "text-truncate", "flex-grow-1"); // Bold and truncate
+  titleSpan.style.minWidth = "0";
   
   // Add pin icon if conversation is pinned
   const isPinned = convo.is_pinned || false;
@@ -451,12 +567,18 @@ export function createConversationItem(convo) {
   titleSpan.appendChild(document.createTextNode(convo.title));
   titleSpan.title = convo.title; // Tooltip for full title
 
+  if (convo.has_unread_assistant_response) {
+    titleRow.appendChild(createUnreadDotElement());
+  }
+
+  titleRow.appendChild(titleSpan);
+
   const dateSpan = document.createElement("small");
   dateSpan.classList.add("text-muted");
   const date = new Date(convo.last_updated);
   dateSpan.textContent = date.toLocaleString([], { dateStyle: 'short', timeStyle: 'short' }); // Shorter format
 
-  leftDiv.appendChild(titleSpan);
+  leftDiv.appendChild(titleRow);
   leftDiv.appendChild(dateSpan);
 
   // Right part: three dots dropdown
@@ -799,7 +921,8 @@ export function addConversationToList(conversationId, title = null, classificati
     id: conversationId,
     title: title || "New Conversation", // Default title
     last_updated: new Date().toISOString(),
-    classification: classifications // Include classifications
+    classification: classifications, // Include classifications
+    has_unread_assistant_response: false,
   };
 
   const convoItem = createConversationItem(convo);
@@ -988,6 +1111,9 @@ export async function selectConversation(conversationId) {
   }
 
   loadMessages(conversationId);
+  markConversationRead(conversationId, { force: true, suppressErrorToast: true }).catch(error => {
+    console.warn('Failed to clear unread state for conversation:', error);
+  });
   highlightSelectedConversation(conversationId);
   
   // Show the conversation info button since we have an active conversation
@@ -1067,7 +1193,21 @@ export function deleteConversation(conversationId) {
 }
 
 // Create a new conversation via API
-export async function createNewConversation(callback) {
+export async function createNewConversation(callback, options = {}) {
+    if (pendingConversationCreation) {
+      try {
+        await pendingConversationCreation;
+        if (typeof callback === "function") {
+          callback();
+        }
+      } catch (error) {
+        // The original caller already surfaced the creation failure.
+      }
+      return;
+    }
+
+  const { preserveSelections = false } = options;
+
     // Disable new button? Show loading?
     if (newConversationBtn) newConversationBtn.disabled = true;
     
@@ -1079,54 +1219,61 @@ export async function createNewConversation(callback) {
     }
     
   try {
-    const response = await fetch("/api/create_conversation", {
-      method: "POST",
-      headers: {
-        "Content-Type": "application/json",
-      },
-      credentials: "same-origin",
-    });
-    if (!response.ok) {
-      const errData = await response.json().catch(() => ({}));
-      throw new Error(errData.error || "Failed to create conversation");
-    }
-    const data = await response.json();
-    if (!data.conversation_id) {
-      throw new Error("No conversation_id returned from server.");
-    }
+    pendingConversationCreation = (async () => {
+      const response = await fetch("/api/create_conversation", {
+        method: "POST",
+        headers: {
+          "Content-Type": "application/json",
+        },
+        credentials: "same-origin",
+      });
+      if (!response.ok) {
+        const errData = await response.json().catch(() => ({}));
+        throw new Error(errData.error || "Failed to create conversation");
+      }
+      const data = await response.json();
+      if (!data.conversation_id) {
+        throw new Error("No conversation_id returned from server.");
+      }
 
-    currentConversationId = data.conversation_id;
-    // Reset scope lock for new conversation
-    resetScopeLock();
-    // Add to list (pass empty classifications for new convo)
-    addConversationToList(data.conversation_id, data.title /* Use title from API if provided */, []);
-    
-    // Don't call selectConversation here if we're about to send a message
-    // because selectConversation clears the chatbox, which would remove
-    // the user message that's about to be appended by actuallySendMessage
-    // Instead, just update the UI elements directly
-    window.currentConversationId = data.conversation_id;
-    const titleEl = document.getElementById("current-conversation-title");
-    if (titleEl) {
-      titleEl.textContent = data.title || "New Conversation";
-    }
-    // Clear classification/tag badges from previous conversation
-    if (currentConversationClassificationsEl) {
-      currentConversationClassificationsEl.innerHTML = "";
-    }
-    updateConversationUrl(data.conversation_id);
-    console.log('[createNewConversation] Created conversation without reload:', data.conversation_id);
+      currentConversationId = data.conversation_id;
+      // Reset scope lock for new conversation
+      resetScopeLock({ preserveSelections });
+      // Add to list (pass empty classifications for new convo)
+      addConversationToList(data.conversation_id, data.title /* Use title from API if provided */, []);
+
+      // Don't call selectConversation here if we're about to send a message
+      // because selectConversation clears the chatbox, which would remove
+      // the user message that's about to be appended by actuallySendMessage
+      // Instead, just update the UI elements directly
+      window.currentConversationId = data.conversation_id;
+      const titleEl = document.getElementById("current-conversation-title");
+      if (titleEl) {
+        titleEl.textContent = data.title || "New Conversation";
+      }
+      // Clear classification/tag badges from previous conversation
+      if (currentConversationClassificationsEl) {
+        currentConversationClassificationsEl.innerHTML = "";
+      }
+      updateConversationUrl(data.conversation_id);
+      console.log('[createNewConversation] Created conversation without reload:', data.conversation_id);
+
+      return data;
+    })();
+
+    const data = await pendingConversationCreation;
 
     // Execute callback if provided (e.g., to send the first message)
     if (typeof callback === "function") {
       callback();
     }
 
-
+    return data;
   } catch (error) {
     console.error("Error creating conversation:", error);
     showToast(`Failed to create a new conversation: ${error.message}`, "danger");
   } finally {
+      pendingConversationCreation = null;
       if (newConversationBtn) newConversationBtn.disabled = false;
   }
 }
@@ -1513,6 +1660,8 @@ window.chatConversations = {
   loadConversations,
   highlightSelectedConversation,
   addConversationToList,
+  markConversationRead,
+  setConversationUnreadState,
   deleteConversation,
   toggleConversationSelection,
   deleteSelectedConversations,
diff --git a/application/single_app/static/js/chat/chat-documents.js b/application/single_app/static/js/chat/chat-documents.js
index 44596872..bb1be3c4 100644
--- a/application/single_app/static/js/chat/chat-documents.js
+++ b/application/single_app/static/js/chat/chat-documents.js
@@ -1,6 +1,7 @@
 // chat-documents.js
 
 import { showToast } from "./chat-toast.js";
+import { initializeFilterableDropdownSearch } from "./chat-searchable-select.js";
 
 export const docScopeSelect = document.getElementById("doc-scope-select");
 const searchDocumentsBtn = document.getElementById("search-documents-btn");
@@ -8,6 +9,7 @@ const docSelectEl = document.getElementById("document-select"); // Hidden select
 const searchDocumentsContainer = document.getElementById("search-documents-container"); // Container for scope/doc/class
 
 // Custom dropdown elements
+const docDropdown = document.getElementById("document-dropdown");
 const docDropdownButton = document.getElementById("document-dropdown-button");
 const docDropdownItems = document.getElementById("document-dropdown-items");
 const docDropdownMenu = document.getElementById("document-dropdown-menu");
@@ -17,12 +19,16 @@ const docSearchInput = document.getElementById("document-search-input");
 const chatTagsFilter = document.getElementById("chat-tags-filter");
 const tagsDropdown = document.getElementById("tags-dropdown");
 const tagsDropdownButton = document.getElementById("tags-dropdown-button");
+const tagsDropdownMenu = document.getElementById("tags-dropdown-menu");
 const tagsDropdownItems = document.getElementById("tags-dropdown-items");
+const tagsSearchInput = document.getElementById("tags-search-input");
 
 // Scope dropdown elements
+const scopeDropdown = document.getElementById("scope-dropdown");
 const scopeDropdownButton = document.getElementById("scope-dropdown-button");
 const scopeDropdownItems = document.getElementById("scope-dropdown-items");
 const scopeDropdownMenu = document.getElementById("scope-dropdown-menu");
+const scopeSearchInput = document.getElementById("scope-search-input");
 
 // We'll store personalDocs/groupDocs/publicDocs in memory once loaded:
 export let personalDocs = [];
@@ -49,6 +55,33 @@ let selectedPersonal = true;
 let selectedGroupIds = (window.userGroups || []).map(g => g.id);
 let selectedPublicWorkspaceIds = (window.userVisiblePublicWorkspaces || []).map(ws => ws.id);
 
+const documentSearchController = initializeFilterableDropdownSearch({
+  dropdownEl: docDropdown,
+  menuEl: docDropdownMenu,
+  searchInputEl: docSearchInput,
+  itemsContainerEl: docDropdownItems,
+  emptyMessage: 'No matching documents found',
+  isAlwaysVisibleItem: item => item.getAttribute('data-search-role') === 'action',
+});
+
+const scopeSearchController = initializeFilterableDropdownSearch({
+  dropdownEl: scopeDropdown,
+  menuEl: scopeDropdownMenu,
+  searchInputEl: scopeSearchInput,
+  itemsContainerEl: scopeDropdownItems,
+  emptyMessage: 'No matching workspaces found',
+  isAlwaysVisibleItem: item => item.getAttribute('data-search-role') === 'action',
+});
+
+const tagsSearchController = initializeFilterableDropdownSearch({
+  dropdownEl: tagsDropdown,
+  menuEl: tagsDropdownMenu,
+  searchInputEl: tagsSearchInput,
+  itemsContainerEl: tagsDropdownItems,
+  emptyMessage: 'No matching tags found',
+  isAlwaysVisibleItem: item => item.getAttribute('data-search-role') === 'action',
+});
+
 /* ---------------------------------------------------------------------------
    Get Effective Scopes — used by chat-messages.js and internally
 --------------------------------------------------------------------------- */
@@ -160,10 +193,19 @@ export function restoreScopeLockState(lockState, contexts) {
  * Reset scope lock for a new conversation.
  * Resets to "All" with no lock.
  */
-export function resetScopeLock() {
+export function resetScopeLock(options = {}) {
+  const { preserveSelections = false } = options;
+
   scopeLocked = null;
   lockedContexts = [];
 
+  if (preserveSelections) {
+    buildScopeDropdown();
+    updateScopeLockIcon();
+    updateHeaderLockIcon();
+    return;
+  }
+
   const groups = window.userGroups || [];
   const publicWorkspaces = window.userVisiblePublicWorkspaces || [];
   selectedPersonal = true;
@@ -227,6 +269,7 @@ function buildScopeDropdown() {
   allItem.type = "button";
   allItem.classList.add("dropdown-item", "d-flex", "align-items-center", "fw-bold");
   allItem.setAttribute("data-scope-action", "toggle-all");
+  allItem.setAttribute("data-search-role", "action");
   allItem.style.display = "flex";
   allItem.style.width = "100%";
   allItem.style.textAlign = "left";
@@ -283,6 +326,7 @@ function buildScopeDropdown() {
   }
 
   syncScopeButtonText();
+  scopeSearchController?.applyFilter(scopeSearchInput ? scopeSearchInput.value : '');
 }
 
 /* ---------------------------------------------------------------------------
@@ -358,6 +402,7 @@ function rebuildScopeDropdownWithLock() {
 
   syncScopeButtonText();
   updateScopeLockIcon();
+  scopeSearchController?.applyFilter(scopeSearchInput ? scopeSearchInput.value : '');
 }
 
 /* ---------------------------------------------------------------------------
@@ -421,6 +466,8 @@ function createScopeItem(value, label, checked) {
   item.type = "button";
   item.classList.add("dropdown-item", "d-flex", "align-items-center");
   item.setAttribute("data-scope-value", value);
+  item.setAttribute("data-search-role", "item");
+  item.dataset.searchLabel = label;
   item.style.display = "flex";
   item.style.width = "100%";
   item.style.textAlign = "left";
@@ -556,6 +603,7 @@ export function populateDocumentSelectScope() {
     allItem.type = "button";
     allItem.classList.add("dropdown-item");
     allItem.setAttribute("data-document-id", "");
+    allItem.setAttribute("data-search-role", "action");
     allItem.textContent = "All Documents";
     allItem.style.display = "block";
     allItem.style.width = "100%";
@@ -618,7 +666,9 @@ export function populateDocumentSelectScope() {
       dropdownItem.type = "button";
       dropdownItem.classList.add("dropdown-item", "d-flex", "align-items-center");
       dropdownItem.setAttribute("data-document-id", doc.id);
+      dropdownItem.setAttribute("data-search-role", "item");
       dropdownItem.setAttribute("title", doc.label);
+      dropdownItem.dataset.searchLabel = doc.label;
       dropdownItem.dataset.tags = JSON.stringify(doc.tags || []);
       dropdownItem.dataset.classification = doc.classification || '';
       dropdownItem.style.display = "flex";
@@ -674,6 +724,7 @@ export function populateDocumentSelectScope() {
 
   // Trigger UI update after populating
   handleDocumentSelectChange();
+  documentSearchController?.applyFilter(docSearchInput ? docSearchInput.value : '');
 }
 
 export function getDocumentMetadata(docId) {
@@ -831,14 +882,14 @@ export function loadAllDocs() {
 function initializeDocumentDropdown() {
   if (!docDropdownMenu) return;
 
-  // Clear any leftover search-filter inline styles on visible items
+  // Clear any leftover search-filter state on visible items
   docDropdownItems.querySelectorAll('.dropdown-item').forEach(item => {
-    item.removeAttribute('data-filtered');
-    item.style.display = '';
+    item.classList.remove('d-none');
   });
 
   // Re-apply tag filter (DOM removal approach — no CSS issues)
   filterDocumentsBySelectedTags();
+  documentSearchController?.applyFilter(docSearchInput ? docSearchInput.value : '');
 
   // Size the dropdown to fill its parent container
   const parentContainer = docDropdownButton.closest('.flex-grow-1');
@@ -976,6 +1027,7 @@ export async function loadTagsForScope() {
         allItem.type = 'button';
         allItem.classList.add('dropdown-item', 'text-muted', 'small');
         allItem.setAttribute('data-tag-value', '');
+        allItem.setAttribute('data-search-role', 'action');
         allItem.textContent = 'Clear All';
         allItem.style.display = 'block';
         allItem.style.width = '100%';
@@ -993,6 +1045,8 @@ export async function loadTagsForScope() {
           item.type = 'button';
           item.classList.add('dropdown-item', 'd-flex', 'align-items-center');
           item.setAttribute('data-tag-value', tag.name);
+          item.setAttribute('data-search-role', 'item');
+          item.dataset.searchLabel = tag.displayName;
           item.style.display = 'flex';
           item.style.width = '100%';
           item.style.textAlign = 'left';
@@ -1029,6 +1083,8 @@ export async function loadTagsForScope() {
             item.type = 'button';
             item.classList.add('dropdown-item', 'd-flex', 'align-items-center');
             item.setAttribute('data-tag-value', cls.name);
+            item.setAttribute('data-search-role', 'item');
+            item.dataset.searchLabel = cls.displayName;
             item.style.display = 'flex';
             item.style.width = '100%';
             item.style.textAlign = 'left';
@@ -1053,6 +1109,8 @@ export async function loadTagsForScope() {
             tagsDropdownItems.appendChild(item);
           });
         }
+
+        tagsSearchController?.applyFilter(tagsSearchInput ? tagsSearchInput.value : '');
       }
     } else {
       hideTagsDropdown();
@@ -1069,6 +1127,9 @@ function showTagsDropdown() {
 
 function hideTagsDropdown() {
   if (tagsDropdown) tagsDropdown.style.display = 'none';
+  if (tagsSearchController) {
+    tagsSearchController.resetFilter();
+  }
 }
 
 /* ---------------------------------------------------------------------------
@@ -1166,6 +1227,8 @@ export function filterDocumentsBySelectedTags() {
       opt.disabled = !matchesSelection(optTags, optClassification);
     });
   }
+
+  documentSearchController?.applyFilter(docSearchInput ? docSearchInput.value : '');
 }
 
 /* ---------------------------------------------------------------------------
@@ -1250,7 +1313,6 @@ if (chatTagsFilter) {
 
 // Tags dropdown: prevent closing when clicking inside
 if (tagsDropdownItems) {
-  const tagsDropdownMenu = document.getElementById("tags-dropdown-menu");
   if (tagsDropdownMenu) {
     tagsDropdownMenu.addEventListener('click', function(e) {
       e.stopPropagation();
@@ -1413,70 +1475,6 @@ if (docDropdownItems) {
   });
 }
 
-// Add search functionality
-if (docSearchInput) {
-  // Define our filtering function to ensure consistent filtering logic.
-  // Items hidden by tag filter are physically removed from the DOM,
-  // so querySelectorAll naturally excludes them.
-  const filterDocumentItems = function(searchTerm) {
-    if (!docDropdownItems) return;
-
-    const items = docDropdownItems.querySelectorAll('.dropdown-item');
-    let matchFound = false;
-
-    items.forEach(item => {
-      const docName = item.textContent.toLowerCase();
-
-      if (!searchTerm || docName.includes(searchTerm)) {
-        item.style.display = '';
-        item.setAttribute('data-filtered', 'visible');
-        matchFound = true;
-      } else {
-        item.style.display = 'none';
-        item.setAttribute('data-filtered', 'hidden');
-      }
-    });
-
-    // Show a message if no matches found
-    const noMatchesEl = docDropdownItems.querySelector('.no-matches');
-    if (!matchFound && searchTerm && searchTerm.length > 0) {
-      if (!noMatchesEl) {
-        const noMatchesMsg = document.createElement('div');
-        noMatchesMsg.className = 'no-matches text-center text-muted py-2';
-        noMatchesMsg.textContent = 'No matching documents found';
-        docDropdownItems.appendChild(noMatchesMsg);
-      }
-    } else {
-      if (noMatchesEl) {
-        noMatchesEl.remove();
-      }
-    }
-  };
-
-  // Attach input event directly
-  docSearchInput.addEventListener('input', function() {
-    const searchTerm = this.value.toLowerCase().trim();
-    filterDocumentItems(searchTerm);
-  });
-
-  // Also attach keyup event as a fallback
-  docSearchInput.addEventListener('keyup', function() {
-    const searchTerm = this.value.toLowerCase().trim();
-    filterDocumentItems(searchTerm);
-  });
-
-  // Prevent dropdown from closing when clicking in search input
-  docSearchInput.addEventListener('click', function(e) {
-    e.stopPropagation();
-    e.preventDefault();
-  });
-
-  // Prevent dropdown from closing when pressing keys in search input
-  docSearchInput.addEventListener('keydown', function(e) {
-    e.stopPropagation();
-  });
-}
-
 /* ---------------------------------------------------------------------------
    Handle Document Selection & Update UI
 --------------------------------------------------------------------------- */
@@ -1513,10 +1511,7 @@ document.addEventListener('DOMContentLoaded', function() {
   // If search documents button exists, it needs to be clicked to show controls
   if (searchDocumentsBtn && docDropdownButton) {
     try {
-      // Get the dropdown element
-      const dropdownEl = document.getElementById('document-dropdown');
-
-      if (dropdownEl) {
+      if (docDropdown) {
         // Initialize Bootstrap dropdown with the right configuration
         new bootstrap.Dropdown(docDropdownButton, {
           boundary: 'viewport',
@@ -1537,14 +1532,15 @@ document.addEventListener('DOMContentLoaded', function() {
         });
 
         // Clear search when opening
-        dropdownEl.addEventListener('show.bs.dropdown', function() {
+        docDropdown.addEventListener('show.bs.dropdown', function() {
           if (docSearchInput) {
             docSearchInput.value = '';
           }
+          documentSearchController?.applyFilter('');
         });
 
         // Adjust sizing and focus search when shown
-        dropdownEl.addEventListener('shown.bs.dropdown', function() {
+        docDropdown.addEventListener('shown.bs.dropdown', function() {
           initializeDocumentDropdown();
           if (docSearchInput) {
             setTimeout(() => docSearchInput.focus(), 50);
@@ -1552,20 +1548,8 @@ document.addEventListener('DOMContentLoaded', function() {
         });
 
         // Clean up inline styles and reset state when hidden
-        dropdownEl.addEventListener('hidden.bs.dropdown', function() {
-          if (docSearchInput) {
-            docSearchInput.value = '';
-          }
-          // Clear search filtering state
-          if (docDropdownItems) {
-            const items = docDropdownItems.querySelectorAll('.dropdown-item');
-            items.forEach(item => {
-              item.removeAttribute('data-filtered');
-              item.style.display = '';
-            });
-            const noMatchesEl = docDropdownItems.querySelector('.no-matches');
-            if (noMatchesEl) noMatchesEl.remove();
-          }
+        docDropdown.addEventListener('hidden.bs.dropdown', function() {
+          documentSearchController?.resetFilter();
           // Clear inline styles set by initializeDocumentDropdown so they
           // don't interfere with Bootstrap's positioning on next open
           if (docDropdownMenu) {
diff --git a/application/single_app/static/js/chat/chat-edit.js b/application/single_app/static/js/chat/chat-edit.js
index 0e09b0d6..f8d109a7 100644
--- a/application/single_app/static/js/chat/chat-edit.js
+++ b/application/single_app/static/js/chat/chat-edit.js
@@ -3,6 +3,7 @@
 
 import { showToast } from './chat-toast.js';
 import { showLoadingIndicatorInChatbox, hideLoadingIndicatorInChatbox } from './chat-loading-indicator.js';
+import { sendMessageWithStreaming } from './chat-streaming.js';
 
 /**
  * Handle edit button click - opens edit modal
@@ -146,70 +147,44 @@ window.executeMessageEdit = function() {
                 console.log('   retry_thread_id:', data.chat_request.retry_thread_id);
                 console.log('   retry_thread_attempt:', data.chat_request.retry_thread_attempt);
                 console.log('   Full chat_request:', data.chat_request);
-                
-                // Call chat API with the edit parameters
-                return fetch('/api/chat', {
-                    method: 'POST',
-                    headers: {
-                        'Content-Type': 'application/json',
-                    },
-                    credentials: 'same-origin',
-                    body: JSON.stringify(data.chat_request)
-                });
+
+                sendMessageWithStreaming(
+                    data.chat_request,
+                    null,
+                    data.chat_request.conversation_id,
+                    {
+                        onDone: () => {
+                            const conversationId = window.chatConversations?.getCurrentConversationId() || data.chat_request.conversation_id;
+                            if (conversationId) {
+                                import('./chat-messages.js').then(module => {
+                                    module.loadMessages(conversationId);
+                                }).catch(err => {
+                                    console.error('❌ Error loading chat-messages module:', err);
+                                    showToast('Failed to reload messages', 'error');
+                                });
+                            }
+                        },
+                        onError: (errorMessage) => {
+                            showToast(`Edit failed: ${errorMessage}`, 'error');
+                        },
+                        onFinally: () => {
+                            hideLoadingIndicatorInChatbox();
+                        }
+                    }
+                );
+
+                return null;
             } else {
                 throw new Error('Edit response missing chat_request');
             }
         })
-        .then(response => {
-            if (!response.ok) {
-                return response.json().then(data => {
-                    throw new Error(data.error || 'Chat API failed');
-                });
-            }
-            return response.json();
-        })
-        .then(chatData => {
-            console.log('✅ Chat API response:', chatData);
-            
-            // Hide typing indicator
-            hideLoadingIndicatorInChatbox();
-            console.log('🧹 Typing indicator removed');
-            
-            // Get current conversation ID using the proper API
-            const conversationId = window.chatConversations?.getCurrentConversationId();
-            
-            console.log(`🔍 Current conversation ID: ${conversationId}`);
-            
-            // Reload messages to show edited message and new response
-            if (conversationId) {
-                console.log('🔄 Reloading messages for conversation:', conversationId);
-                
-                // Import loadMessages dynamically
-                import('./chat-messages.js').then(module => {
-                    console.log('📦 chat-messages.js module loaded, calling loadMessages...');
-                    module.loadMessages(conversationId);
-                    // No toast - the reloaded messages are enough feedback
-                }).catch(err => {
-                    console.error('❌ Error loading chat-messages module:', err);
-                    showToast('error', 'Failed to reload messages');
-                });
-            } else {
-                console.error('❌ No currentConversationId found!');
-                
-                // Try to force a page refresh as fallback
-                console.log('🔄 Attempting page refresh as fallback...');
-                setTimeout(() => {
-                    window.location.reload();
-                }, 1000);
-            }
-        })
         .catch(error => {
             console.error('❌ Edit error:', error);
             
             // Hide typing indicator on error
             hideLoadingIndicatorInChatbox();
             
-            showToast('error', `Edit failed: ${error.message}`);
+            showToast(`Edit failed: ${error.message}`, 'error');
         })
         .finally(() => {
             // Clean up pending edit
diff --git a/application/single_app/static/js/chat/chat-enhanced-citations.js b/application/single_app/static/js/chat/chat-enhanced-citations.js
index dcda708b..93779da9 100644
--- a/application/single_app/static/js/chat/chat-enhanced-citations.js
+++ b/application/single_app/static/js/chat/chat-enhanced-citations.js
@@ -18,11 +18,13 @@ export function getFileType(fileName) {
     const imageExtensions = ['jpg', 'jpeg', 'png', 'bmp', 'tiff', 'tif'];
     const videoExtensions = ['mp4', 'mov', 'avi', 'mkv', 'flv', 'webm', 'wmv', 'm4v', '3gp'];
     const audioExtensions = ['mp3', 'wav', 'ogg', 'aac', 'flac', 'm4a'];
-    
+    const tabularExtensions = ['csv', 'xlsx', 'xls', 'xlsm'];
+
     if (imageExtensions.includes(ext)) return 'image';
     if (ext === 'pdf') return 'pdf';
     if (videoExtensions.includes(ext)) return 'video';
     if (audioExtensions.includes(ext)) return 'audio';
+    if (tabularExtensions.includes(ext)) return 'tabular';
     
     return 'other';
 }
@@ -32,8 +34,9 @@ export function getFileType(fileName) {
  * @param {string} docId - Document ID
  * @param {string|number} pageNumberOrTimestamp - Page number for PDF or timestamp for video/audio
  * @param {string} citationId - Citation ID for fallback
+ * @param {string|null} initialSheetName - Workbook sheet to open initially for tabular files
  */
-export function showEnhancedCitationModal(docId, pageNumberOrTimestamp, citationId) {
+export function showEnhancedCitationModal(docId, pageNumberOrTimestamp, citationId, initialSheetName = null) {
     // Get document metadata to determine file type
     const docMetadata = getDocumentMetadata(docId);
     if (!docMetadata || !docMetadata.file_name) {
@@ -66,6 +69,9 @@ export function showEnhancedCitationModal(docId, pageNumberOrTimestamp, citation
             const audioTimestamp = convertTimestampToSeconds(pageNumberOrTimestamp);
             showAudioModal(docId, audioTimestamp, docMetadata.file_name);
             break;
+        case 'tabular':
+            showTabularDownloadModal(docId, docMetadata.file_name, initialSheetName);
+            break;
         default:
             // Fall back to text citation for unsupported types
             import('./chat-citations.js').then(module => {
@@ -291,6 +297,249 @@ export function showAudioModal(docId, timestamp, fileName) {
     modalInstance.show();
 }
 
+function triggerBlobDownload(blob, filename) {
+    const url = URL.createObjectURL(blob);
+    const link = document.createElement('a');
+    link.href = url;
+    link.download = filename;
+    document.body.appendChild(link);
+    link.click();
+    document.body.removeChild(link);
+    window.setTimeout(() => URL.revokeObjectURL(url), 0);
+}
+
+function getDownloadFilename(response, fallbackFilename) {
+    const contentDisposition = response.headers.get('Content-Disposition') || '';
+    const utf8Match = contentDisposition.match(/filename\*=UTF-8''([^;]+)/i);
+    if (utf8Match && utf8Match[1]) {
+        try {
+            return decodeURIComponent(utf8Match[1]);
+        } catch (error) {
+            console.warn('Could not decode UTF-8 filename from Content-Disposition:', error);
+            return utf8Match[1];
+        }
+    }
+
+    const quotedMatch = contentDisposition.match(/filename="([^"]+)"/i);
+    if (quotedMatch && quotedMatch[1]) {
+        return quotedMatch[1];
+    }
+
+    const unquotedMatch = contentDisposition.match(/filename=([^;]+)/i);
+    if (unquotedMatch && unquotedMatch[1]) {
+        return unquotedMatch[1].trim();
+    }
+
+    return fallbackFilename || 'download';
+}
+
+async function downloadTabularFile(downloadUrl, fallbackFilename, downloadBtn) {
+    const originalMarkup = downloadBtn.innerHTML;
+    downloadBtn.disabled = true;
+    downloadBtn.classList.add('disabled');
+    downloadBtn.innerHTML = '<span class="spinner-border spinner-border-sm me-1" role="status" aria-hidden="true"></span>Downloading...';
+
+    try {
+        const response = await fetch(downloadUrl, {
+            credentials: 'same-origin',
+        });
+
+        if (!response.ok) {
+            let errorMessage = `Could not download file (${response.status}).`;
+            const contentType = response.headers.get('Content-Type') || '';
+
+            if (contentType.includes('application/json')) {
+                const errorData = await response.json().catch(() => null);
+                if (errorData && errorData.error) {
+                    errorMessage = errorData.error;
+                }
+            } else {
+                const errorText = await response.text().catch(() => '');
+                if (errorText) {
+                    errorMessage = errorText;
+                }
+            }
+
+            throw new Error(errorMessage);
+        }
+
+        const blob = await response.blob();
+        const downloadFilename = getDownloadFilename(response, fallbackFilename);
+        triggerBlobDownload(blob, downloadFilename);
+    } catch (error) {
+        console.error('Error downloading tabular file:', error);
+        showToast(error.message || 'Could not download file.', 'danger');
+    } finally {
+        downloadBtn.disabled = false;
+        downloadBtn.classList.remove('disabled');
+        downloadBtn.innerHTML = originalMarkup;
+    }
+}
+
+/**
+ * Show tabular file preview modal with data table
+ * @param {string} docId - Document ID
+ * @param {string} fileName - File name
+ * @param {string|null} initialSheetName - Workbook sheet to open initially
+ */
+export function showTabularDownloadModal(docId, fileName, initialSheetName = null) {
+    console.log(`Showing tabular preview modal for docId: ${docId}, fileName: ${fileName}`);
+    showLoadingIndicator();
+
+    // Create or get tabular modal
+    let tabularModal = document.getElementById("enhanced-tabular-modal");
+    if (!tabularModal) {
+        tabularModal = createTabularModal();
+    }
+
+    const title = tabularModal.querySelector(".modal-title");
+    const tableContainer = tabularModal.querySelector("#enhanced-tabular-table-container");
+    const rowInfo = tabularModal.querySelector("#enhanced-tabular-row-info");
+    const downloadBtn = tabularModal.querySelector("#enhanced-tabular-download");
+    const errorContainer = tabularModal.querySelector("#enhanced-tabular-error");
+    const sheetControls = tabularModal.querySelector("#enhanced-tabular-sheet-controls");
+    const sheetSelect = tabularModal.querySelector("#enhanced-tabular-sheet-select");
+
+    title.textContent = `Tabular Data: ${fileName}`;
+    tableContainer.innerHTML = '<div class="text-center p-4"><div class="spinner-border text-primary" role="status"><span class="visually-hidden">Loading...</span></div><p class="mt-2 text-muted">Loading data preview...</p></div>';
+    rowInfo.textContent = '';
+    errorContainer.classList.add('d-none');
+    sheetControls.classList.add('d-none');
+    sheetSelect.innerHTML = '';
+
+    const downloadUrl = `/api/enhanced_citations/tabular_workspace?doc_id=${encodeURIComponent(docId)}`;
+    downloadBtn.onclick = (event) => {
+        event.preventDefault();
+        downloadTabularFile(downloadUrl, fileName, downloadBtn);
+    };
+
+    // Show modal immediately with loading state
+    const modalInstance = new bootstrap.Modal(tabularModal);
+    modalInstance.show();
+
+    const escapeOptionValue = (value) => String(value)
+        .replace(/&/g, '&amp;')
+        .replace(/</g, '&lt;')
+        .replace(/>/g, '&gt;')
+        .replace(/"/g, '&quot;');
+
+    const loadTabularPreview = (selectedSheetName = null) => {
+        errorContainer.classList.add('d-none');
+
+        const params = new URLSearchParams({
+            doc_id: docId,
+        });
+        if (selectedSheetName) {
+            params.set('sheet_name', selectedSheetName);
+        }
+
+        const previewUrl = `/api/enhanced_citations/tabular_preview?${params.toString()}`;
+        fetch(previewUrl)
+            .then(response => {
+                if (!response.ok) throw new Error(`HTTP ${response.status}`);
+                return response.json();
+            })
+            .then(data => {
+                hideLoadingIndicator();
+                if (data.error) {
+                    showTabularError(tableContainer, errorContainer, data.error);
+                    return;
+                }
+
+                title.textContent = data.selected_sheet
+                    ? `Tabular Data: ${fileName} [${data.selected_sheet}]`
+                    : `Tabular Data: ${fileName}`;
+
+                const sheetNames = Array.isArray(data.sheet_names) ? data.sheet_names : [];
+                if (sheetNames.length > 1) {
+                    sheetControls.classList.remove('d-none');
+                    sheetSelect.innerHTML = sheetNames
+                        .map(sheetName => {
+                            const isSelected = sheetName === data.selected_sheet ? ' selected' : '';
+                            return `<option value="${escapeOptionValue(sheetName)}"${isSelected}>${escapeOptionValue(sheetName)}</option>`;
+                        })
+                        .join('');
+                    sheetSelect.onchange = () => {
+                        showLoadingIndicator();
+                        loadTabularPreview(sheetSelect.value);
+                    };
+                } else {
+                    sheetControls.classList.add('d-none');
+                    sheetSelect.innerHTML = '';
+                }
+
+                renderTabularPreview(tableContainer, rowInfo, data);
+            })
+            .catch(error => {
+                hideLoadingIndicator();
+                console.error('Error loading tabular preview:', error);
+                showTabularError(tableContainer, errorContainer, 'Could not load data preview.');
+            });
+    };
+
+    loadTabularPreview(initialSheetName);
+}
+
+/**
+ * Render tabular data as an HTML table
+ * @param {HTMLElement} container - Table container element
+ * @param {HTMLElement} rowInfo - Row info display element
+ * @param {Object} data - Preview data from API
+ */
+function renderTabularPreview(container, rowInfo, data) {
+    const { columns, rows, total_rows, truncated, selected_sheet } = data;
+
+    // Build table HTML
+    let html = '<table class="table table-striped table-bordered table-sm table-hover mb-0">';
+
+    // Header
+    html += '<thead class="table-dark sticky-top"><tr>';
+    for (const col of columns) {
+        const escaped = col.replace(/&/g, '&amp;').replace(/</g, '&lt;').replace(/>/g, '&gt;');
+        html += `<th class="text-nowrap">${escaped}</th>`;
+    }
+    html += '</tr></thead>';
+
+    // Body
+    html += '<tbody>';
+    for (const row of rows) {
+        html += '<tr>';
+        for (const cell of row) {
+            const val = cell === null || cell === undefined ? '' : String(cell);
+            const escaped = val.replace(/&/g, '&amp;').replace(/</g, '&lt;').replace(/>/g, '&gt;');
+            html += `<td>${escaped}</td>`;
+        }
+        html += '</tr>';
+    }
+    html += '</tbody></table>';
+
+    container.innerHTML = html;
+
+    // Row info
+    const displayedRows = rows.length;
+    const hasTotalRows = total_rows !== null && total_rows !== undefined;
+    const totalFormatted = hasTotalRows ? total_rows.toLocaleString() : displayedRows.toLocaleString();
+    const sheetPrefix = selected_sheet ? `Sheet ${selected_sheet} · ` : '';
+    if (truncated) {
+        const truncationSuffix = hasTotalRows ? `${totalFormatted} rows` : `${displayedRows.toLocaleString()}+ rows`;
+        rowInfo.textContent = `${sheetPrefix}Showing ${displayedRows.toLocaleString()} of ${truncationSuffix}`;
+    } else {
+        rowInfo.textContent = `${sheetPrefix}${totalFormatted} rows, ${columns.length} columns`;
+    }
+}
+
+/**
+ * Show error state in tabular modal with download fallback
+ * @param {HTMLElement} tableContainer - Table container element
+ * @param {HTMLElement} errorContainer - Error display element
+ * @param {string} message - Error message
+ */
+function showTabularError(tableContainer, errorContainer, message) {
+    tableContainer.innerHTML = '<div class="text-center p-4"><i class="bi bi-file-earmark-spreadsheet display-1 text-success"></i></div>';
+    errorContainer.textContent = message + ' You can still download the file below.';
+    errorContainer.classList.remove('d-none');
+}
+
 /**
  * Convert timestamp string to seconds
  * @param {string|number} timestamp - Timestamp in various formats
@@ -445,3 +694,40 @@ function createPdfModal() {
     document.body.appendChild(modal);
     return modal;
 }
+
+/**
+ * Create tabular file preview modal HTML structure
+ * @returns {HTMLElement} - Modal element
+ */
+function createTabularModal() {
+    const modal = document.createElement("div");
+    modal.id = "enhanced-tabular-modal";
+    modal.classList.add("modal", "fade");
+    modal.tabIndex = -1;
+    modal.innerHTML = `
+        <div class="modal-dialog modal-xl modal-dialog-centered">
+            <div class="modal-content">
+                <div class="modal-header">
+                    <h5 class="modal-title">Tabular Data Citation</h5>
+                    <button type="button" class="btn-close" data-bs-dismiss="modal" aria-label="Close"></button>
+                </div>
+                <div class="modal-body p-0">
+                    <div id="enhanced-tabular-error" class="alert alert-warning m-3 d-none"></div>
+                    <div id="enhanced-tabular-sheet-controls" class="px-3 pt-3 d-none">
+                        <label for="enhanced-tabular-sheet-select" class="form-label small text-muted mb-1">Worksheet</label>
+                        <select id="enhanced-tabular-sheet-select" class="form-select form-select-sm"></select>
+                    </div>
+                    <div id="enhanced-tabular-table-container" style="max-height: 60vh; overflow: auto;"></div>
+                </div>
+                <div class="modal-footer d-flex justify-content-between align-items-center">
+                    <span id="enhanced-tabular-row-info" class="text-muted small"></span>
+                    <button type="button" id="enhanced-tabular-download" class="btn btn-primary btn-sm">
+                        <i class="bi bi-download me-1"></i>Download File
+                    </button>
+                </div>
+            </div>
+        </div>
+    `;
+    document.body.appendChild(modal);
+    return modal;
+}
diff --git a/application/single_app/static/js/chat/chat-export.js b/application/single_app/static/js/chat/chat-export.js
index 269cbfe0..fc53d2b6 100644
--- a/application/single_app/static/js/chat/chat-export.js
+++ b/application/single_app/static/js/chat/chat-export.js
@@ -15,6 +15,8 @@ let exportConversationIds = [];
 let exportConversationTitles = {};
 let exportFormat = 'json';
 let exportPackaging = 'single';
+let includeSummaryIntro = false;
+let summaryModelDeployment = '';
 let currentStep = 1;
 let totalSteps = 3;
 let skipSelectionStep = false;
@@ -53,14 +55,16 @@ function openExportWizard(conversationIds, skipSelection) {
     exportConversationTitles = {};
     exportFormat = 'json';
     exportPackaging = conversationIds.length > 1 ? 'zip' : 'single';
+    includeSummaryIntro = false;
+    summaryModelDeployment = _getDefaultSummaryModel();
     skipSelectionStep = !!skipSelection;
 
     // Determine step configuration
     if (skipSelectionStep) {
-        totalSteps = 3;
+        totalSteps = 4;
         currentStep = 1; // Format step (mapped to visual step)
     } else {
-        totalSteps = 4;
+        totalSteps = 5;
         currentStep = 1; // Selection review step
     }
 
@@ -142,19 +146,21 @@ function _renderCurrentStep() {
     if (!stepBody) return;
 
     if (skipSelectionStep) {
-        // Steps: 1=Format, 2=Packaging, 3=Download
+        // Steps: 1=Format, 2=Packaging, 3=Summary, 4=Download
         switch (currentStep) {
             case 1: _renderFormatStep(stepBody); break;
             case 2: _renderPackagingStep(stepBody); break;
-            case 3: _renderDownloadStep(stepBody); break;
+            case 3: _renderSummaryStep(stepBody); break;
+            case 4: _renderDownloadStep(stepBody); break;
         }
     } else {
-        // Steps: 1=Selection, 2=Format, 3=Packaging, 4=Download
+        // Steps: 1=Selection, 2=Format, 3=Packaging, 4=Summary, 5=Download
         switch (currentStep) {
             case 1: _renderSelectionStep(stepBody); break;
             case 2: _renderFormatStep(stepBody); break;
             case 3: _renderPackagingStep(stepBody); break;
-            case 4: _renderDownloadStep(stepBody); break;
+            case 4: _renderSummaryStep(stepBody); break;
+            case 5: _renderDownloadStep(stepBody); break;
         }
     }
 }
@@ -210,7 +216,7 @@ function _renderFormatStep(container) {
             <p class="text-muted small mb-3">Select the format for your exported conversations.</p>
         </div>
         <div class="row g-3">
-            <div class="col-6">
+            <div class="col-4">
                 <div class="action-type-card card h-100 text-center p-3 ${exportFormat === 'json' ? 'selected' : ''}" data-format="json" role="button" tabindex="0">
                     <div class="card-body d-flex flex-column align-items-center justify-content-center">
                         <i class="bi bi-filetype-json display-5 mb-2 text-primary"></i>
@@ -219,7 +225,7 @@ function _renderFormatStep(container) {
                     </div>
                 </div>
             </div>
-            <div class="col-6">
+            <div class="col-4">
                 <div class="action-type-card card h-100 text-center p-3 ${exportFormat === 'markdown' ? 'selected' : ''}" data-format="markdown" role="button" tabindex="0">
                     <div class="card-body d-flex flex-column align-items-center justify-content-center">
                         <i class="bi bi-filetype-md display-5 mb-2 text-success"></i>
@@ -228,6 +234,15 @@ function _renderFormatStep(container) {
                     </div>
                 </div>
             </div>
+            <div class="col-4">
+                <div class="action-type-card card h-100 text-center p-3 ${exportFormat === 'pdf' ? 'selected' : ''}" data-format="pdf" role="button" tabindex="0">
+                    <div class="card-body d-flex flex-column align-items-center justify-content-center">
+                        <i class="bi bi-filetype-pdf display-5 mb-2 text-danger"></i>
+                        <h6 class="mb-1">PDF</h6>
+                        <p class="text-muted small mb-0">Print-ready format with chat bubbles. Ideal for archiving and printing.</p>
+                    </div>
+                </div>
+            </div>
         </div>`;
 
     // Wire card clicks
@@ -297,11 +312,68 @@ function _renderPackagingStep(container) {
     });
 }
 
+function _renderSummaryStep(container) {
+    const mainModelSelect = getEl('model-select');
+    const hasModelOptions = Boolean(mainModelSelect && mainModelSelect.options.length > 0);
+    const defaultSummaryModel = summaryModelDeployment || _getDefaultSummaryModel();
+    const perConversationText = exportConversationIds.length > 1
+        ? 'An intro will be generated for each exported conversation.'
+        : 'An intro will be generated for this conversation.';
+
+    container.innerHTML = `
+        <div class="mb-3">
+            <h6 class="mb-1">Optional Intro Summary</h6>
+            <p class="text-muted small mb-3">Add a short abstract before the exported transcript. ${perConversationText}</p>
+        </div>
+        <div class="form-check form-switch mb-3">
+            <input class="form-check-input" type="checkbox" role="switch" id="export-summary-toggle" ${includeSummaryIntro ? 'checked' : ''}>
+            <label class="form-check-label" for="export-summary-toggle">Include AI-generated intro summary</label>
+        </div>
+        <div id="export-summary-model-container" class="card ${includeSummaryIntro ? '' : 'd-none'}">
+            <div class="card-body">
+                <label for="export-summary-model" class="form-label">Summary model</label>
+                <select id="export-summary-model" class="form-select" ${hasModelOptions ? '' : 'disabled'}>
+                    ${hasModelOptions ? mainModelSelect.innerHTML : '<option value="">No chat models available</option>'}
+                </select>
+                <div class="form-text">Uses the same model list as the chat composer.</div>
+            </div>
+        </div>`;
+
+    const toggle = getEl('export-summary-toggle');
+    const modelContainer = getEl('export-summary-model-container');
+    const summaryModelSelect = getEl('export-summary-model');
+
+    if (summaryModelSelect && hasModelOptions) {
+        summaryModelSelect.value = defaultSummaryModel || summaryModelSelect.value;
+        summaryModelDeployment = summaryModelSelect.value;
+        summaryModelSelect.addEventListener('change', () => {
+            summaryModelDeployment = summaryModelSelect.value;
+        });
+    }
+
+    if (toggle) {
+        toggle.addEventListener('change', () => {
+            includeSummaryIntro = toggle.checked;
+            if (modelContainer) {
+                modelContainer.classList.toggle('d-none', !includeSummaryIntro);
+            }
+            if (includeSummaryIntro && summaryModelSelect && !summaryModelSelect.value) {
+                summaryModelSelect.value = _getDefaultSummaryModel();
+                summaryModelDeployment = summaryModelSelect.value;
+            }
+        });
+    }
+}
+
 function _renderDownloadStep(container) {
     const count = exportConversationIds.length;
-    const formatLabel = exportFormat === 'json' ? 'JSON' : 'Markdown';
+    const formatLabels = { json: 'JSON', markdown: 'Markdown', pdf: 'PDF' };
+    const formatLabel = formatLabels[exportFormat] || exportFormat.toUpperCase();
     const packagingLabel = exportPackaging === 'zip' ? 'ZIP Archive' : 'Single File';
-    const ext = exportPackaging === 'zip' ? '.zip' : (exportFormat === 'json' ? '.json' : '.md');
+    const extMap = { json: '.json', markdown: '.md', pdf: '.pdf' };
+    const ext = exportPackaging === 'zip' ? '.zip' : (extMap[exportFormat] || '.bin');
+    const summaryLabel = includeSummaryIntro ? 'Enabled' : 'Disabled';
+    const summaryModelLabel = includeSummaryIntro ? (summaryModelDeployment || 'Configured default') : '—';
 
     let conversationsList = '';
     exportConversationIds.forEach(id => {
@@ -328,6 +400,14 @@ function _renderDownloadStep(container) {
                     <div class="col-5 text-muted small">Packaging:</div>
                     <div class="col-7 fw-semibold small">${packagingLabel}</div>
                 </div>
+                <div class="row mb-2">
+                    <div class="col-5 text-muted small">Intro summary:</div>
+                    <div class="col-7 fw-semibold small">${summaryLabel}</div>
+                </div>
+                <div class="row mb-2">
+                    <div class="col-5 text-muted small">Summary model:</div>
+                    <div class="col-7 fw-semibold small">${_escapeHtml(summaryModelLabel)}</div>
+                </div>
                 <div class="row">
                     <div class="col-5 text-muted small">File type:</div>
                     <div class="col-7 fw-semibold small">${ext}</div>
@@ -364,6 +444,7 @@ function _updateStepIndicators() {
         steps = [
             { label: 'Format', icon: 'bi-filetype-json' },
             { label: 'Packaging', icon: 'bi-box' },
+            { label: 'Summary', icon: 'bi-card-text' },
             { label: 'Download', icon: 'bi-download' }
         ];
     } else {
@@ -371,6 +452,7 @@ function _updateStepIndicators() {
             { label: 'Select', icon: 'bi-list-check' },
             { label: 'Format', icon: 'bi-filetype-json' },
             { label: 'Packaging', icon: 'bi-box' },
+            { label: 'Summary', icon: 'bi-card-text' },
             { label: 'Download', icon: 'bi-download' }
         ];
     }
@@ -448,7 +530,9 @@ async function _executeExport() {
             body: JSON.stringify({
                 conversation_ids: exportConversationIds,
                 format: exportFormat,
-                packaging: exportPackaging
+                packaging: exportPackaging,
+                include_summary_intro: includeSummaryIntro,
+                summary_model_deployment: includeSummaryIntro ? summaryModelDeployment : null
             })
         });
 
@@ -460,7 +544,8 @@ async function _executeExport() {
         // Get filename from Content-Disposition header
         const disposition = response.headers.get('Content-Disposition') || '';
         const filenameMatch = disposition.match(/filename="?([^"]+)"?/);
-        const filename = filenameMatch ? filenameMatch[1] : `conversations_export.${exportPackaging === 'zip' ? 'zip' : (exportFormat === 'json' ? 'json' : 'md')}`;
+        const fallbackExtMap = { json: 'json', markdown: 'md', pdf: 'pdf' };
+        const filename = filenameMatch ? filenameMatch[1] : `conversations_export.${exportPackaging === 'zip' ? 'zip' : (fallbackExtMap[exportFormat] || 'bin')}`;
 
         // Download the blob
         const blob = await response.blob();
@@ -511,6 +596,15 @@ function _escapeHtml(text) {
     return div.innerHTML;
 }
 
+function _getDefaultSummaryModel() {
+    const mainModelSelect = getEl('model-select');
+    if (!mainModelSelect) {
+        return '';
+    }
+
+    return mainModelSelect.value || (mainModelSelect.options[0] ? mainModelSelect.options[0].value : '');
+}
+
 // --- Expose Globally ---
 window.chatExport = {
     openExportWizard
diff --git a/application/single_app/static/js/chat/chat-input-actions.js b/application/single_app/static/js/chat/chat-input-actions.js
index 77851319..66eaf044 100644
--- a/application/single_app/static/js/chat/chat-input-actions.js
+++ b/application/single_app/static/js/chat/chat-input-actions.js
@@ -127,11 +127,11 @@ export function fetchFileContent(conversationId, fileId) {
       hideLoadingIndicator();
 
       if (data.file_content && data.filename) {
-        showFileContentPopup(data.file_content, data.filename, data.is_table);
+        showFileContentPopup(data.file_content, data.filename, data.is_table, data.file_content_source, conversationId, fileId);
       } else if (data.error) {
         showToast(data.error, "danger");
       } else {
-        ashowToastlert("Unexpected response from server.", "danger");
+        showToast("Unexpected response from server.", "danger");
       }
     })
     .catch((error) => {
@@ -141,7 +141,7 @@ export function fetchFileContent(conversationId, fileId) {
     });
 }
 
-export function showFileContentPopup(fileContent, filename, isTable) {
+export function showFileContentPopup(fileContent, filename, isTable, fileContentSource, conversationId, fileId) {
   let modalContainer = document.getElementById("file-modal");
   if (!modalContainer) {
     modalContainer = document.createElement("div");
@@ -155,6 +155,7 @@ export function showFileContentPopup(fileContent, filename, isTable) {
         <div class="modal-content">
           <div class="modal-header">
             <h5 class="modal-title">Uploaded File: ${filename}</h5>
+            <div class="ms-auto me-2" id="file-modal-download-btn-container"></div>
             <button
               type="button"
               class="btn-close"
@@ -176,6 +177,16 @@ export function showFileContentPopup(fileContent, filename, isTable) {
     }
   }
 
+  // Add or remove download button for blob-stored files
+  const downloadBtnContainer = document.getElementById("file-modal-download-btn-container");
+  if (downloadBtnContainer) {
+    if (fileContentSource === 'blob' && conversationId && fileId) {
+      downloadBtnContainer.innerHTML = `<a href="/api/enhanced_citations/tabular?conversation_id=${encodeURIComponent(conversationId)}&file_id=${encodeURIComponent(fileId)}" class="btn btn-sm btn-outline-primary" download><i class="bi bi-download me-1"></i>Download Original</a>`;
+    } else {
+      downloadBtnContainer.innerHTML = '';
+    }
+  }
+
   const fileContentElement = document.getElementById("file-content");
   if (!fileContentElement) return;
 
@@ -308,7 +319,6 @@ if (imageGenBtn) {
     const docBtn = document.getElementById("search-documents-btn");
     const webBtn = document.getElementById("search-web-btn");
     const fileBtn = document.getElementById("choose-file-btn");
-    const streamingBtn = document.getElementById("streaming-toggle-btn");
     const modelSelectContainer = document.getElementById("model-select-container");
 
     if (isImageGenEnabled) {
@@ -324,10 +334,6 @@ if (imageGenBtn) {
         fileBtn.disabled = true;
         fileBtn.classList.remove("active");
       }
-      // Hide streaming toggle and model selector for image generation
-      if (streamingBtn) {
-        streamingBtn.style.display = "none";
-      }
       if (modelSelectContainer) {
         modelSelectContainer.style.display = "none";
       }
@@ -335,10 +341,6 @@ if (imageGenBtn) {
       if (docBtn) docBtn.disabled = false;
       if (webBtn) webBtn.disabled = false;
       if (fileBtn) fileBtn.disabled = false;
-      // Show streaming toggle and model selector when not in image generation mode
-      if (streamingBtn) {
-        streamingBtn.style.display = "flex";
-      }
       if (modelSelectContainer) {
         modelSelectContainer.style.display = "block";
       }
@@ -409,7 +411,7 @@ if (fileInputEl) {
         if (!currentConversationId) {
           createNewConversation(() => {
             uploadFileToConversation(file);
-          });
+          }, { preserveSelections: true });
         } else {
           uploadFileToConversation(file);
         }
@@ -458,7 +460,7 @@ if (uploadBtn) {
       if (!currentConversationId) {
         createNewConversation(() => {
           uploadFileToConversation(file);
-        });
+        }, { preserveSelections: true });
       } else {
         uploadFileToConversation(file);
       }
diff --git a/application/single_app/static/js/chat/chat-loading-indicator.js b/application/single_app/static/js/chat/chat-loading-indicator.js
index c1ab20c8..b76ef0e1 100644
--- a/application/single_app/static/js/chat/chat-loading-indicator.js
+++ b/application/single_app/static/js/chat/chat-loading-indicator.js
@@ -39,12 +39,37 @@ export function showLoadingIndicatorInChatbox() {
     <div class="spinner-border text-primary" role="status">
       <span class="visually-hidden">AI is typing...</span>
     </div>
-    <span>AI is typing...</span>
+    <span id="loading-indicator-text">AI is typing...</span>
   `;
   chatbox.appendChild(loadingIndicator);
   chatbox.scrollTop = chatbox.scrollHeight;
 }
 
+/**
+ * Update the loading indicator text with a thought step.
+ * Called by the thought polling handler to replace "AI is typing..." dynamically.
+ * @param {string} text - The thought content to display.
+ * @param {string} iconClass - Bootstrap Icon class (e.g. 'bi-search').
+ */
+export function updateLoadingIndicatorText(text, iconClass) {
+  const textEl = document.getElementById("loading-indicator-text");
+  if (!textEl) return;
+
+  textEl.textContent = "";
+  if (iconClass) {
+    const icon = document.createElement("i");
+    icon.className = `bi ${iconClass} me-1`;
+    textEl.appendChild(icon);
+  }
+  textEl.appendChild(document.createTextNode(text));
+
+  // Scroll chatbox to keep indicator visible
+  const chatbox = document.getElementById("chatbox");
+  if (chatbox) {
+    chatbox.scrollTop = chatbox.scrollHeight;
+  }
+}
+
 export function hideLoadingIndicatorInChatbox() {
   const loadingIndicator = document.getElementById("loading-indicator");
   if (loadingIndicator) {
diff --git a/application/single_app/static/js/chat/chat-message-export.js b/application/single_app/static/js/chat/chat-message-export.js
new file mode 100644
index 00000000..8211c097
--- /dev/null
+++ b/application/single_app/static/js/chat/chat-message-export.js
@@ -0,0 +1,171 @@
+// chat-message-export.js
+import { showToast } from "./chat-toast.js";
+
+/**
+ * Per-message export module.
+ *
+ * Provides functions to export a single chat message as Markdown (.md)
+ * or Word (.docx) from the three-dots dropdown on each message bubble.
+ */
+
+/**
+ * Get the markdown content for a message from the DOM.
+ * AI messages store their markdown in a hidden textarea; user messages
+ * use the visible text content.
+ */
+function getMessageMarkdown(messageDiv, role) {
+    if (role === 'assistant') {
+        // AI messages have a hidden textarea with the markdown content
+        const hiddenTextarea = messageDiv.querySelector('textarea[id^="copy-md-"]');
+        if (hiddenTextarea) {
+            return hiddenTextarea.value;
+        }
+    }
+    // For user messages (or fallback), grab the text from the message bubble
+    const messageText = messageDiv.querySelector('.message-text');
+    if (messageText) {
+        return messageText.innerText;
+    }
+    return '';
+}
+
+/**
+ * Get the sender label from a message div.
+ */
+function getMessageMeta(messageDiv, role) {
+    const senderEl = messageDiv.querySelector('.message-sender');
+    const sender = senderEl ? senderEl.innerText.trim() : (role === 'assistant' ? 'Assistant' : 'User');
+
+    return { sender };
+}
+
+/**
+ * Trigger a browser file download from a Blob.
+ */
+function downloadBlob(blob, filename) {
+    const url = URL.createObjectURL(blob);
+    const a = document.createElement('a');
+    a.href = url;
+    a.download = filename;
+    document.body.appendChild(a);
+    a.click();
+    document.body.removeChild(a);
+    URL.revokeObjectURL(url);
+}
+
+/**
+ * Build a formatted timestamp string for filenames.
+ */
+function filenameTimestamp() {
+    const now = new Date();
+    const pad = (n) => String(n).padStart(2, '0');
+    return `${now.getFullYear()}${pad(now.getMonth() + 1)}${pad(now.getDate())}_${pad(now.getHours())}${pad(now.getMinutes())}${pad(now.getSeconds())}`;
+}
+
+/**
+ * Export a single message as a Markdown (.md) file download.
+ * This is entirely client-side — no backend call needed.
+ */
+export function exportMessageAsMarkdown(messageDiv, messageId, role) {
+    const content = getMessageMarkdown(messageDiv, role);
+    if (!content) {
+        showToast('No message content to export.', 'warning');
+        return;
+    }
+
+    const { sender } = getMessageMeta(messageDiv, role);
+
+    const lines = [];
+    lines.push(`### ${sender}`);
+    lines.push('');
+    lines.push(content);
+    lines.push('');
+
+    const markdown = lines.join('\n');
+    const blob = new Blob([markdown], { type: 'text/markdown; charset=utf-8' });
+    const filename = `message_export_${filenameTimestamp()}.md`;
+    downloadBlob(blob, filename);
+    showToast('Message exported as Markdown.', 'success');
+}
+
+/**
+ * Export a single message as a Word (.docx) file by calling the backend
+ * endpoint which uses python-docx to generate the document.
+ */
+export async function exportMessageAsWord(messageDiv, messageId, role) {
+    const conversationId = window.currentConversationId;
+    if (!conversationId || !messageId) {
+        showToast('Cannot export — no active conversation or message.', 'warning');
+        return;
+    }
+
+    try {
+        const response = await fetch('/api/message/export-word', {
+            method: 'POST',
+            headers: { 'Content-Type': 'application/json' },
+            body: JSON.stringify({
+                message_id: messageId,
+                conversation_id: conversationId
+            })
+        });
+
+        if (!response.ok) {
+            const errorData = await response.json().catch(() => null);
+            const errorMsg = errorData?.error || `Export failed (${response.status})`;
+            showToast(errorMsg, 'danger');
+            return;
+        }
+
+        const blob = await response.blob();
+        const filename = `message_export_${filenameTimestamp()}.docx`;
+        downloadBlob(blob, filename);
+        showToast('Message exported as Word document.', 'success');
+    } catch (err) {
+        console.error('Error exporting message to Word:', err);
+        showToast('Failed to export message to Word.', 'danger');
+    }
+}
+
+/**
+ * Insert the message content as a formatted prompt directly into the chat
+ * input box so the user can review, edit, and send it.
+ * The raw message content is inserted unchanged for both user and AI messages.
+ */
+export function copyAsPrompt(messageDiv, messageId, role) {
+    const content = getMessageMarkdown(messageDiv, role);
+    if (!content) {
+        showToast('No message content to use.', 'warning');
+        return;
+    }
+
+    const userInput = document.getElementById('user-input');
+    if (!userInput) {
+        showToast('Chat input not found.', 'warning');
+        return;
+    }
+
+    userInput.value = content;
+    userInput.focus();
+    // Trigger input event so auto-resize and send button visibility update
+    userInput.dispatchEvent(new Event('input', { bubbles: true }));
+    showToast('Prompt inserted into chat input.', 'success');
+}
+
+/**
+ * Open the user's default email client with the message content
+ * pre-filled in the email body via a mailto: link.
+ */
+export function openInEmail(messageDiv, messageId, role) {
+    const content = getMessageMarkdown(messageDiv, role);
+    if (!content) {
+        showToast('No message content to email.', 'warning');
+        return;
+    }
+
+    const { sender } = getMessageMeta(messageDiv, role);
+    const subject = `Chat message from ${sender}`;
+
+    // mailto: uses the body parameter for content
+    const mailtoUrl = `mailto:?subject=${encodeURIComponent(subject)}&body=${encodeURIComponent(content)}`;
+    window.open(mailtoUrl, '_blank');
+}
diff --git a/application/single_app/static/js/chat/chat-messages.js b/application/single_app/static/js/chat/chat-messages.js
index d4c54790..dc3a2afc 100644
--- a/application/single_app/static/js/chat/chat-messages.js
+++ b/application/single_app/static/js/chat/chat-messages.js
@@ -17,9 +17,10 @@ import { escapeHtml, isColorLight, addTargetBlankToExternalLinks } from "./chat-
 import { showToast } from "./chat-toast.js";
 import { autoplayTTSIfEnabled } from "./chat-tts.js";
 import { saveUserSetting } from "./chat-layout.js";
-import { isStreamingEnabled, sendMessageWithStreaming } from "./chat-streaming.js";
+import { sendMessageWithStreaming } from "./chat-streaming.js";
 import { getCurrentReasoningEffort, isReasoningEffortEnabled } from './chat-reasoning.js';
 import { areAgentsEnabled } from './chat-agents.js';
+import { createThoughtsToggleHtml, attachThoughtsToggleListener } from './chat-thoughts.js';
 
 // Conditionally import TTS if enabled
 let ttsModule = null;
@@ -372,9 +373,12 @@ function createCitationsHtml(
     hybridCitations.forEach((cite, index) => {
       const citationId =
         cite.citation_id || `${cite.chunk_id}_${cite.page_number || index}`; // Fallback ID
-      const displayText = `${escapeHtml(cite.file_name)}, Page ${
-        cite.page_number || "N/A"
-      }`;
+      const locationLabel = cite.location_label || (cite.sheet_name ? 'Sheet' : 'Page');
+      const locationValue = cite.location_value || cite.sheet_name || cite.page_number || 'N/A';
+      const displayText = `${escapeHtml(cite.file_name)}, ${escapeHtml(locationLabel)}: ${escapeHtml(locationValue)}`;
+      const sheetNameAttribute = cite.sheet_name
+        ? `data-sheet-name="${escapeHtml(cite.sheet_name)}"`
+        : '';
 
       // Check if this is a metadata citation
       const isMetadata = cite.metadata_type ? true : false;
@@ -385,6 +389,7 @@ function createCitationsHtml(
               <a href="#"
                  class="btn btn-sm citation-button hybrid-citation-link ${isMetadata ? 'metadata-citation' : ''}"
                  data-citation-id="${escapeHtml(citationId)}"
+                  ${sheetNameAttribute}
                  data-is-metadata="${isMetadata}"
                  data-metadata-type="${escapeHtml(metadataType)}"
                  data-metadata-content="${escapeHtml(metadataContent)}"
@@ -663,6 +668,11 @@ export function appendMessage(
                     <li><a class="dropdown-item dropdown-delete-btn" href="#" data-message-id="${messageId}"><i class="bi bi-trash me-2"></i>Delete</a></li>
                     <li><a class="dropdown-item dropdown-retry-btn" href="#" data-message-id="${messageId}"><i class="bi bi-arrow-clockwise me-2"></i>Retry</a></li>
                     ${feedbackHtml}
+                    <li><hr class="dropdown-divider"></li>
+                    <li><a class="dropdown-item dropdown-export-md-btn" href="#" data-message-id="${messageId}"><i class="bi bi-markdown me-2"></i>Export to Markdown</a></li>
+                    <li><a class="dropdown-item dropdown-export-word-btn" href="#" data-message-id="${messageId}"><i class="bi bi-file-earmark-word me-2"></i>Export to Word</a></li>
+                    <li><a class="dropdown-item dropdown-copy-prompt-btn" href="#" data-message-id="${messageId}"><i class="bi bi-clipboard-plus me-2"></i>Use as Prompt</a></li>
+                    <li><a class="dropdown-item dropdown-open-email-btn" href="#" data-message-id="${messageId}"><i class="bi bi-envelope me-2"></i>Open in Email</a></li>
                 </ul>
             </div>
         `;
@@ -743,11 +753,13 @@ export function appendMessage(
 
     const metadataContainerId = `metadata-${messageId || Date.now()}`;
     const metadataContainerHtml = `<div class="metadata-container mt-2 pt-2 border-top" id="${metadataContainerId}" style="display: none;"><div class="text-muted">Loading metadata...</div></div>`;
-    
+
+    const thoughtsHtml = createThoughtsToggleHtml(messageId);
+
     const footerContentHtml = `<div class="message-footer d-flex justify-content-between align-items-center mt-2">
       <div class="d-flex align-items-center">${copyAndFeedbackHtml}</div>
       <div class="d-flex align-items-center"></div>
-      <div class="d-flex align-items-center gap-2">${citationToggleHtml}<button class="btn btn-sm btn-link text-muted metadata-info-btn" data-message-id="${messageId}" title="Show metadata" aria-expanded="false" aria-controls="${metadataContainerId}">
+      <div class="d-flex align-items-center gap-2">${thoughtsHtml.toggleHtml}${citationToggleHtml}<button class="btn btn-sm btn-link text-muted metadata-info-btn" data-message-id="${messageId}" title="Show metadata" aria-expanded="false" aria-controls="${metadataContainerId}">
         <i class="bi bi-info-circle"></i>
       </button></div>
     </div>`;
@@ -760,6 +772,7 @@ export function appendMessage(
                     <div class="message-sender">${senderLabel}</div>
                     ${mainMessageHtml}
                     ${citationContentContainerHtml}
+                    ${thoughtsHtml.containerHtml}
                     ${metadataContainerHtml}
                     ${footerContentHtml}
                 </div>
@@ -816,6 +829,9 @@ export function appendMessage(
         }
       });
     }
+
+    // Attach thoughts toggle listener
+    attachThoughtsToggleListener(messageDiv, messageId, currentConversationId);
     
     const maskBtn = messageDiv.querySelector(".mask-btn");
     if (maskBtn) {
@@ -851,6 +867,50 @@ export function appendMessage(
         handleRetryButtonClick(messageDiv, currentMessageId, 'assistant');
       });
     }
+
+    const dropdownExportMdBtn = messageDiv.querySelector(".dropdown-export-md-btn");
+    if (dropdownExportMdBtn) {
+      dropdownExportMdBtn.addEventListener("click", (e) => {
+        e.preventDefault();
+        const currentMessageId = messageDiv.getAttribute('data-message-id');
+        import('./chat-message-export.js').then(module => {
+          module.exportMessageAsMarkdown(messageDiv, currentMessageId, 'assistant');
+        }).catch(err => console.error('Error loading message export module:', err));
+      });
+    }
+
+    const dropdownExportWordBtn = messageDiv.querySelector(".dropdown-export-word-btn");
+    if (dropdownExportWordBtn) {
+      dropdownExportWordBtn.addEventListener("click", (e) => {
+        e.preventDefault();
+        const currentMessageId = messageDiv.getAttribute('data-message-id');
+        import('./chat-message-export.js').then(module => {
+          module.exportMessageAsWord(messageDiv, currentMessageId, 'assistant');
+        }).catch(err => console.error('Error loading message export module:', err));
+      });
+    }
+
+    const dropdownCopyPromptBtn = messageDiv.querySelector(".dropdown-copy-prompt-btn");
+    if (dropdownCopyPromptBtn) {
+      dropdownCopyPromptBtn.addEventListener("click", (e) => {
+        e.preventDefault();
+        const currentMessageId = messageDiv.getAttribute('data-message-id');
+        import('./chat-message-export.js').then(module => {
+          module.copyAsPrompt(messageDiv, currentMessageId, 'assistant');
+        }).catch(err => console.error('Error loading message export module:', err));
+      });
+    }
+
+    const dropdownOpenEmailBtn = messageDiv.querySelector(".dropdown-open-email-btn");
+    if (dropdownOpenEmailBtn) {
+      dropdownOpenEmailBtn.addEventListener("click", (e) => {
+        e.preventDefault();
+        const currentMessageId = messageDiv.getAttribute('data-message-id');
+        import('./chat-message-export.js').then(module => {
+          module.openInEmail(messageDiv, currentMessageId, 'assistant');
+        }).catch(err => console.error('Error loading message export module:', err));
+      });
+    }
     
     // Handle dropdown positioning manually - move to chatbox container
     const dropdownToggle = messageDiv.querySelector(".message-actions .dropdown button[data-bs-toggle='dropdown']");
@@ -1076,6 +1136,11 @@ export function appendMessage(
                 <li><a class="dropdown-item dropdown-edit-btn" href="#" data-message-id="${messageId}"><i class="bi bi-pencil me-2"></i>Edit</a></li>
                 <li><a class="dropdown-item dropdown-delete-btn" href="#" data-message-id="${messageId}"><i class="bi bi-trash me-2"></i>Delete</a></li>
                 <li><a class="dropdown-item dropdown-retry-btn" href="#" data-message-id="${messageId}"><i class="bi bi-arrow-clockwise me-2"></i>Retry</a></li>
+                <li><hr class="dropdown-divider"></li>
+                <li><a class="dropdown-item dropdown-export-md-btn" href="#" data-message-id="${messageId}"><i class="bi bi-markdown me-2"></i>Export to Markdown</a></li>
+                <li><a class="dropdown-item dropdown-export-word-btn" href="#" data-message-id="${messageId}"><i class="bi bi-file-earmark-word me-2"></i>Export to Word</a></li>
+                <li><a class="dropdown-item dropdown-copy-prompt-btn" href="#" data-message-id="${messageId}"><i class="bi bi-clipboard-plus me-2"></i>Use as Prompt</a></li>
+                <li><a class="dropdown-item dropdown-open-email-btn" href="#" data-message-id="${messageId}"><i class="bi bi-envelope me-2"></i>Open in Email</a></li>
               </ul>
             </div>
             <button class="btn btn-sm btn-link text-muted copy-user-btn" data-message-id="${messageId}" title="Copy message">
@@ -1324,7 +1389,7 @@ export function sendMessage() {
   if (!currentConversationId) {
     createNewConversation(() => {
       actuallySendMessage(combinedMessage);
-    });
+    }, { preserveSelections: true });
   } else {
     actuallySendMessage(combinedMessage);
   }
@@ -1350,11 +1415,6 @@ export function actuallySendMessage(finalMessageToSend) {
   userInput.style.height = "";
   // Update send button visibility after clearing input
   updateSendButtonVisibility();
-  
-  // Only show loading indicator if NOT using streaming (streaming creates its own placeholder)
-  if (!isStreamingEnabled()) {
-    showLoadingIndicatorInChatbox();
-  }
 
   const modelDeployment = modelSelect?.value;
 
@@ -1500,272 +1560,13 @@ export function actuallySendMessage(finalMessageToSend) {
     agent_info: agentInfo,
     reasoning_effort: getCurrentReasoningEffort()
   };
-  
-  // Check if streaming is enabled (but not for image generation)
-  const agentsEnabled = typeof areAgentsEnabled === 'function' && areAgentsEnabled();
-  if (isStreamingEnabled() && !imageGenEnabled) {
-    const streamInitiated = sendMessageWithStreaming(
-      messageData, 
-      tempUserMessageId, 
-      currentConversationId
-    );
-    if (streamInitiated) {
-      return; // Streaming handles the rest
-    }
-    // If streaming failed to initiate, fall through to regular fetch
-  }
-  
-  // Regular non-streaming fetch
-  fetch("/api/chat", {
-    method: "POST",
-    headers: {
-      "Content-Type": "application/json",
-    },
-    credentials: "same-origin",
-    body: JSON.stringify(messageData),
-  })
-    .then((response) => {
-      if (!response.ok) {
-        // Handle non-OK responses, try to parse JSON error
-        return response
-          .json()
-          .then((errData) => {
-            // Throw an error object including the status and parsed data
-            const error = new Error(
-              errData.error || `HTTP error! status: ${response.status}`
-            );
-            error.status = response.status;
-            error.data = errData; // Attach full error data
-            throw error;
-          })
-          .catch(() => {
-            // If JSON parsing fails, throw a generic error
-            throw new Error(`HTTP error! status: ${response.status}`);
-          });
-      }
-      return response.json(); // Parse JSON for successful responses
-    })
-    .then((data) => {
-      // Only successful responses reach here
-      hideLoadingIndicatorInChatbox();
-
-      console.log("--- Data received from /api/chat ---");
-      console.log("Full data object:", data);
-      console.log(
-        `data.augmented: ${data.augmented} (Type: ${typeof data.augmented})`
-      );
-      console.log("data.hybrid_citations:", data.hybrid_citations);
-      console.log("data.web_search_citations:", data.web_search_citations);
-      console.log("data.agent_citations:", data.agent_citations);
-      console.log(`data.message_id: ${data.message_id}`);
-      console.log(`data.user_message_id: ${data.user_message_id}`);
-      console.log(`tempUserMessageId: ${tempUserMessageId}`);
-
-      // Update the user message with the real message ID
-      if (data.user_message_id) {
-        console.log(`🔄 Calling updateUserMessageId(${tempUserMessageId}, ${data.user_message_id})`);
-        updateUserMessageId(tempUserMessageId, data.user_message_id);
-      } else {
-        console.warn(`⚠️ No user_message_id in response! User message will keep temporary ID: ${tempUserMessageId}`);
-      }
-
-      if (data.reply) {
-        // *** Pass the new fields to appendMessage ***
-        appendMessage(
-          "AI",
-          data.reply,
-          data.model_deployment_name,
-          data.message_id,
-          data.augmented, // Pass augmented flag
-          data.hybrid_citations, // Pass hybrid citations
-          data.web_search_citations, // Pass web citations
-          data.agent_citations, // Pass agent citations
-          data.agent_display_name, // Pass agent display name
-          data.agent_name, // Pass agent name
-          null, // fullMessageObject
-          true // isNewMessage - trigger autoplay for new responses
-        );
-      }
-      // Show kernel fallback notice if present
-      if (data.kernel_fallback_notice) {
-        showToast(data.kernel_fallback_notice, 'warning');
-      }
-      if (data.image_url) {
-        // Assuming image messages don't have citations in this flow
-        appendMessage(
-          "image",
-          data.image_url,
-          data.model_deployment_name,
-          null, // messageId
-          false, // augmented
-          [], // hybridCitations
-          [], // webCitations
-          [], // agentCitations
-          data.agent_display_name, // Pass agent display name
-          data.agent_name // Pass agent name
-        );
-      }
+  sendMessageWithStreaming(
+    messageData,
+    tempUserMessageId,
+    currentConversationId
+  );
 
-      if (data.reload_messages && currentConversationId) {
-        console.log("Reload flag received from backend - refreshing messages.");
-        loadMessages(currentConversationId);
-      }
-
-      // Update conversation list item and header if needed
-      if (data.conversation_id) {
-        currentConversationId = data.conversation_id; // Update current ID
-        const convoItem = document.querySelector(
-          `.conversation-item[data-conversation-id="${currentConversationId}"]`
-        );
-        if (convoItem) {
-          let updated = false;
-          // Update Title
-          if (
-            data.conversation_title &&
-            convoItem.getAttribute("data-conversation-title") !==
-              data.conversation_title
-          ) {
-            convoItem.setAttribute(
-              "data-conversation-title",
-              data.conversation_title
-            );
-            const titleEl = convoItem.querySelector(".conversation-title");
-            if (titleEl) titleEl.textContent = data.conversation_title;
-            
-            // Update sidebar conversation title in real-time
-            updateSidebarConversationTitle(currentConversationId, data.conversation_title);
-            
-            updated = true;
-          }
-          // Update Classifications
-          if (data.classification) {
-            // Check if API returned classification
-            const currentClassificationJson =
-              convoItem.dataset.classifications || "[]";
-            const newClassificationJson = JSON.stringify(data.classification);
-            if (currentClassificationJson !== newClassificationJson) {
-              convoItem.dataset.classifications = newClassificationJson;
-              updated = true;
-            }
-          }
-          // Update Timestamp (optional, could be done on load)
-          const dateEl = convoItem.querySelector("small");
-          if (dateEl)
-            dateEl.textContent = new Date().toLocaleString([], {
-              dateStyle: "short",
-              timeStyle: "short",
-            });
-
-          if (updated) {
-            selectConversation(currentConversationId); // Re-select to update header
-          }
-        } else {
-          // New conversation case
-          console.log('[sendMessage] New conversation created, adding to list without reload');
-          addConversationToList(
-            currentConversationId,
-            data.conversation_title,
-            data.classification || []
-          );
-          // Don't call selectConversation here - messages are already displayed
-          // Just update the current conversation ID and title
-          window.currentConversationId = currentConversationId;
-          document.getElementById("current-conversation-title").textContent = data.conversation_title || "New Conversation";
-          console.log('[sendMessage] New conversation setup complete, conversation ID:', currentConversationId);
-        }
-      }
-
-      // Apply scope lock if document search was used
-      if (data.augmented && currentConversationId) {
-        fetch(`/api/conversations/${currentConversationId}/metadata`, { credentials: 'same-origin' })
-          .then(r => r.json())
-          .then(metadata => {
-            if (metadata.scope_locked === true && metadata.locked_contexts) {
-              applyScopeLock(metadata.locked_contexts, metadata.scope_locked);
-            }
-          })
-          .catch(err => console.warn('Failed to fetch scope lock metadata:', err));
-      }
-    })
-    .catch((error) => {
-      hideLoadingIndicatorInChatbox();
-      console.error("Error sending message:", error);
-
-      // Display specific error messages based on status or content
-      if (error.status === 403 && error.data) {
-        // Check for status and data from thrown error
-        const categories = (error.data.triggered_categories || [])
-          .map((catObj) => `${catObj.category} (severity=${catObj.severity})`)
-          .join(", ");
-        const reasonMsg = Array.isArray(error.data.reason)
-          ? error.data.reason.join(", ")
-          : error.data.reason;
-
-        appendMessage(
-          "safety", // Use 'safety' sender type
-          `Your message was blocked by Content Safety.\n\n` +
-            `**Categories triggered**: ${categories}\n` +
-            `**Reason**: ${reasonMsg}`,
-          null, // No model name for safety message
-          error.data.message_id, // Use message_id if provided in error
-          false, // augmented
-          [], // hybridCitations
-          [], // webCitations
-          [], // agentCitations
-          null, // agentDisplayName
-          null // agentName
-        );
-      } else {
-        // Show specific embedding error if present, or if status is 500 (embedding backend error)
-        const errMsg = (error.message || "").toLowerCase();
-        
-        // Handle image generation content safety errors
-        if (errMsg.includes("safety system") || errMsg.includes("moderation_blocked") || errMsg.includes("content safety")) {
-          appendMessage(
-            "safety", // Use 'safety' sender type
-            `**Image Generation Blocked by Content Safety**\n\n` +
-            `Your image generation request was blocked by Azure OpenAI's content safety system. ` +
-            `Please try a different prompt that doesn't involve potentially harmful, violent, or illicit content.\n\n` +
-            `**Error**: ${error.message || "Content safety violation"}`,
-            null, // No model name for safety message
-            null, // No message ID for error
-            false, // augmented
-            [], // hybridCitations
-            [], // webCitations
-            [], // agentCitations
-            null, // agentDisplayName
-            null // agentName
-          );
-        } else if (errMsg.includes("embedding") || error.status === 500) {
-          appendMessage(
-            "Error",
-            "There was an issue with the embedding process. Please check with an admin on embedding configuration.",
-            null, // No model name for error message
-            null, // No message ID for error
-            false, // augmented
-            [], // hybridCitations
-            [], // webCitations
-            [], // agentCitations
-            null, // agentDisplayName
-            null // agentName
-          );
-        } else {
-          // General error message
-          appendMessage(
-            "Error",
-            `Could not get a response. ${error.message || ""}`,
-            null, // No model name for error message
-            null, // No message ID for error
-            false, // augmented
-            [], // hybridCitations
-            [], // webCitations
-            [], // agentCitations
-            null, // agentDisplayName
-            null // agentName
-          );
-        }
-      }
-    });
+  return;
 }
 
 function attachCodeBlockCopyButtons(parentElement) {
@@ -1987,6 +1788,50 @@ function attachUserMessageEventListeners(messageDiv, messageId, messageContent)
       });
     });
   }
+
+  const dropdownExportMdBtn = messageDiv.querySelector(".dropdown-export-md-btn");
+  if (dropdownExportMdBtn) {
+    dropdownExportMdBtn.addEventListener("click", (e) => {
+      e.preventDefault();
+      const currentMessageId = messageDiv.getAttribute('data-message-id');
+      import('./chat-message-export.js').then(module => {
+        module.exportMessageAsMarkdown(messageDiv, currentMessageId, 'user');
+      }).catch(err => console.error('Error loading message export module:', err));
+    });
+  }
+
+  const dropdownExportWordBtn = messageDiv.querySelector(".dropdown-export-word-btn");
+  if (dropdownExportWordBtn) {
+    dropdownExportWordBtn.addEventListener("click", (e) => {
+      e.preventDefault();
+      const currentMessageId = messageDiv.getAttribute('data-message-id');
+      import('./chat-message-export.js').then(module => {
+        module.exportMessageAsWord(messageDiv, currentMessageId, 'user');
+      }).catch(err => console.error('Error loading message export module:', err));
+    });
+  }
+
+  const dropdownCopyPromptBtn = messageDiv.querySelector(".dropdown-copy-prompt-btn");
+  if (dropdownCopyPromptBtn) {
+    dropdownCopyPromptBtn.addEventListener("click", (e) => {
+      e.preventDefault();
+      const currentMessageId = messageDiv.getAttribute('data-message-id');
+      import('./chat-message-export.js').then(module => {
+        module.copyAsPrompt(messageDiv, currentMessageId, 'user');
+      }).catch(err => console.error('Error loading message export module:', err));
+    });
+  }
+
+  const dropdownOpenEmailBtn = messageDiv.querySelector(".dropdown-open-email-btn");
+  if (dropdownOpenEmailBtn) {
+    dropdownOpenEmailBtn.addEventListener("click", (e) => {
+      e.preventDefault();
+      const currentMessageId = messageDiv.getAttribute('data-message-id');
+      import('./chat-message-export.js').then(module => {
+        module.openInEmail(messageDiv, currentMessageId, 'user');
+      }).catch(err => console.error('Error loading message export module:', err));
+    });
+  }
   
   // Handle dropdown positioning manually for user messages - move to chatbox
   const dropdownToggle = messageDiv.querySelector(".message-footer .dropdown button[data-bs-toggle='dropdown']");
diff --git a/application/single_app/static/js/chat/chat-model-selector.js b/application/single_app/static/js/chat/chat-model-selector.js
index e69de29b..14a9d2b6 100644
--- a/application/single_app/static/js/chat/chat-model-selector.js
+++ b/application/single_app/static/js/chat/chat-model-selector.js
@@ -0,0 +1,44 @@
+// chat-model-selector.js
+
+import { createSearchableSingleSelect } from './chat-searchable-select.js';
+
+const modelSelect = document.getElementById('model-select');
+const modelDropdown = document.getElementById('model-dropdown');
+const modelDropdownButton = document.getElementById('model-dropdown-button');
+const modelDropdownMenu = document.getElementById('model-dropdown-menu');
+const modelDropdownText = modelDropdownButton
+	? modelDropdownButton.querySelector('.chat-searchable-select-text')
+	: null;
+const modelSearchInput = document.getElementById('model-search-input');
+const modelDropdownItems = document.getElementById('model-dropdown-items');
+
+let modelSelectorController = null;
+
+export function initializeModelSelector() {
+	if (modelSelectorController || !modelSelect) {
+		return modelSelectorController;
+	}
+
+	modelSelectorController = createSearchableSingleSelect({
+		selectEl: modelSelect,
+		dropdownEl: modelDropdown,
+		buttonEl: modelDropdownButton,
+		buttonTextEl: modelDropdownText,
+		menuEl: modelDropdownMenu,
+		searchInputEl: modelSearchInput,
+		itemsContainerEl: modelDropdownItems,
+		placeholderText: 'Select a Model',
+		emptyMessage: 'No models available',
+		emptySearchMessage: 'No matching models found',
+	});
+
+	return modelSelectorController;
+}
+
+export function refreshModelSelector() {
+	if (!modelSelectorController) {
+		initializeModelSelector();
+	}
+
+	modelSelectorController?.refresh();
+}
diff --git a/application/single_app/static/js/chat/chat-onload.js b/application/single_app/static/js/chat/chat-onload.js
index 43e1eba3..0477c0b7 100644
--- a/application/single_app/static/js/chat/chat-onload.js
+++ b/application/single_app/static/js/chat/chat-onload.js
@@ -1,14 +1,14 @@
 // chat-onload.js
 
-import { loadConversations, selectConversation, ensureConversationPresent } from "./chat-conversations.js";
+import { loadConversations, selectConversation, ensureConversationPresent, createNewConversation } from "./chat-conversations.js";
 // Import handleDocumentSelectChange
 import { loadAllDocs, populateDocumentSelectScope, handleDocumentSelectChange, loadTagsForScope, filterDocumentsBySelectedTags, setScopeFromUrlParam } from "./chat-documents.js";
 import { getUrlParameter } from "./chat-utils.js"; // Assuming getUrlParameter is in chat-utils.js now
 import { loadUserPrompts, loadGroupPrompts, initializePromptInteractions } from "./chat-prompts.js";
+import { initializeModelSelector } from "./chat-model-selector.js";
 import { loadUserSettings } from "./chat-layout.js";
 import { showToast } from "./chat-toast.js";
 import { initConversationInfoButton } from "./chat-conversation-info-button.js";
-import { initializeStreamingToggle } from "./chat-streaming.js";
 import { initializeReasoningToggle } from "./chat-reasoning.js";
 import { initializeSpeechInput } from "./chat-speech-input.js";
 
@@ -21,12 +21,6 @@ window.addEventListener('DOMContentLoaded', async () => {
   // Initialize the conversation info button
   initConversationInfoButton();
   
-  // Initialize streaming toggle
-  initializeStreamingToggle();
-  
-  // Initialize reasoning toggle
-  initializeReasoningToggle();
-  
   // Initialize speech input
   try {
     initializeSpeechInput();
@@ -44,7 +38,7 @@ window.addEventListener('DOMContentLoaded', async () => {
   if (userInput && newConversationBtn) {
     userInput.addEventListener("focus", () => {
       if (!currentConversationId) {
-        newConversationBtn.click();
+                createNewConversation(null, { preserveSelections: true });
       }
     });
   }
@@ -55,7 +49,7 @@ window.addEventListener('DOMContentLoaded', async () => {
       if (!currentConversationId) {
         // Optionally prevent the default action if it does something immediately
         // event.preventDefault(); 
-        newConversationBtn.click();
+                createNewConversation(null, { preserveSelections: true });
 
         // (Optional) If you need the prompt UI to appear *after* the conversation is created,
         // you can open the prompt UI programmatically in a small setTimeout or callback.
@@ -69,7 +63,7 @@ window.addEventListener('DOMContentLoaded', async () => {
     fileBtn.addEventListener("click", (event) => {
       if (!currentConversationId) {
         // event.preventDefault(); // If file dialog should only open once conversation is created
-        newConversationBtn.click();
+                createNewConversation(null, { preserveSelections: true });
 
         // (Optional) If you want the file dialog to appear *after* the conversation is created,
         // do it in a short setTimeout or callback:
@@ -79,14 +73,13 @@ window.addEventListener('DOMContentLoaded', async () => {
   }
 
   // Load documents, prompts, and user settings
+  const docsPromise = loadAllDocs();
+  const userPromptsPromise = loadUserPrompts();
+  const groupPromptsPromise = loadGroupPrompts();
+  const userSettingsPromise = loadUserSettings();
+
   try {
-      const [docsResult, userPromptsResult, groupPromptsResult, userSettings] = await Promise.all([
-          loadAllDocs(),
-          loadUserPrompts(),
-          loadGroupPrompts(),
-          loadUserSettings()
-      ]);
-      console.log("Initial data (Docs, Prompts, Settings) loaded successfully."); // Log success
+      const userSettings = await userSettingsPromise;
       
       // Set the preferred model if available
       if (userSettings && userSettings.preferredModelDeployment) {
@@ -97,6 +90,16 @@ window.addEventListener('DOMContentLoaded', async () => {
           }
       }
 
+      initializeModelSelector();
+      initializeReasoningToggle(userSettings);
+
+      const [docsResult, userPromptsResult, groupPromptsResult] = await Promise.all([
+          docsPromise,
+          userPromptsPromise,
+          groupPromptsPromise
+      ]);
+      console.log("Initial data (Docs, Prompts, Settings) loaded successfully."); // Log success
+
       // --- Initialize Document-related UI ---
       // This part handles URL params for documents - KEEP IT
       const localSearchDocsParam = getUrlParameter("search_documents") === "true";
diff --git a/application/single_app/static/js/chat/chat-prompts.js b/application/single_app/static/js/chat/chat-prompts.js
index 01521098..4862783a 100644
--- a/application/single_app/static/js/chat/chat-prompts.js
+++ b/application/single_app/static/js/chat/chat-prompts.js
@@ -1,114 +1,172 @@
 // chat-prompts.js
 
-import { userInput} from "./chat-messages.js";
+import { userInput } from "./chat-messages.js";
 import { updateSendButtonVisibility } from "./chat-messages.js";
 import { docScopeSelect, getEffectiveScopes } from "./chat-documents.js";
+import { createSearchableSingleSelect } from "./chat-searchable-select.js";
 
 const promptSelectionContainer = document.getElementById("prompt-selection-container");
 export const promptSelect = document.getElementById("prompt-select"); // Keep export if needed elsewhere
 const searchPromptsBtn = document.getElementById("search-prompts-btn");
-
-export function loadUserPrompts() {
-  return fetch("/api/prompts")
-    .then(r => r.json())
-    .then(data => {
-      if (data.prompts) {
-        userPrompts = data.prompts;
-      }
-    })
-    .catch(err => console.error("Error loading user prompts:", err));
+const promptDropdown = document.getElementById("prompt-dropdown");
+const promptDropdownButton = document.getElementById("prompt-dropdown-button");
+const promptDropdownMenu = document.getElementById("prompt-dropdown-menu");
+const promptDropdownText = promptDropdownButton
+    ? promptDropdownButton.querySelector(".chat-searchable-select-text")
+    : null;
+const promptDropdownItems = document.getElementById("prompt-dropdown-items");
+const promptSearchInput = document.getElementById("prompt-search-input");
+
+const promptPageSize = 100;
+
+let promptSelectorController = null;
+let loadAllPromptsPromise = null;
+
+function initializePromptSelector() {
+    if (promptSelectorController || !promptSelect) {
+        return promptSelectorController;
+    }
+
+    promptSelectorController = createSearchableSingleSelect({
+        selectEl: promptSelect,
+        dropdownEl: promptDropdown,
+        buttonEl: promptDropdownButton,
+        buttonTextEl: promptDropdownText,
+        menuEl: promptDropdownMenu,
+        searchInputEl: promptSearchInput,
+        itemsContainerEl: promptDropdownItems,
+        placeholderText: "Select a Prompt...",
+        emptyMessage: "No prompts available",
+        emptySearchMessage: "No matching prompts found",
+    });
+
+    return promptSelectorController;
 }
 
-export function loadGroupPrompts() {
-  return fetch("/api/group_prompts")
-    .then(r => {
-      if (!r.ok) {
-        // Handle 400 errors gracefully (e.g., no active group selected)
-        if (r.status === 400) {
-          console.log("No active group selected for group prompts");
-          groupPrompts = [];
-          return { prompts: [] }; // Return empty result to avoid further errors
+async function fetchAllPromptPages(endpoint, emptyStatuses = []) {
+    const prompts = [];
+    let page = 1;
+    let totalCount = null;
+
+    while (true) {
+        const params = new URLSearchParams({
+            page: String(page),
+            page_size: String(promptPageSize)
+        });
+        const response = await fetch(`${endpoint}?${params.toString()}`);
+
+        if (!response.ok) {
+            if (emptyStatuses.includes(response.status)) {
+                return [];
+            }
+
+            throw new Error(`HTTP ${response.status}: ${response.statusText}`);
         }
-        throw new Error(`HTTP ${r.status}: ${r.statusText}`);
-      }
-      return r.json();
-    })
-    .then(data => {
-      if (data.prompts) {
-        groupPrompts = data.prompts;
-      }
-    })
-    .catch(err => console.error("Error loading group prompts:", err));
-}
 
-export function loadPublicPrompts() {
-  return fetch("/api/public_prompts")
-    .then(r => {
-      if (!r.ok) {
-        // Handle 400 errors gracefully
-        if (r.status === 400) {
-          console.log("No public prompts available");
-          publicPrompts = [];
-          return { prompts: [] }; // Return empty result to avoid further errors
+        const data = await response.json();
+        const pagePrompts = Array.isArray(data.prompts) ? data.prompts : [];
+        prompts.push(...pagePrompts);
+
+        if (totalCount === null) {
+            totalCount = Number(data.total_count) || pagePrompts.length;
         }
-        throw new Error(`HTTP ${r.status}: ${r.statusText}`);
-      }
-      return r.json();
-    })
-    .then(data => {
-      if (data.prompts) {
-        publicPrompts = data.prompts;
-      }
-    })
-    .catch(err => console.error("Error loading public prompts:", err));
-}
 
-export function populatePromptSelectScope() {
-  if (!promptSelect) return;
+        if (!pagePrompts.length || prompts.length >= totalCount || pagePrompts.length < promptPageSize) {
+            break;
+        }
 
-  // Determine effective scope from multi-select dropdown
-  const scopes = getEffectiveScopes();
-  console.log("Populating prompt dropdown with scopes:", scopes);
-  console.log("User prompts:", userPrompts.length);
-  console.log("Group prompts:", groupPrompts.length);
-  console.log("Public prompts:", publicPrompts.length);
+        page += 1;
+    }
 
-  const previousValue = promptSelect.value; // Store previous selection if needed
-  promptSelect.innerHTML = "";
+    return prompts;
+}
 
-  const defaultOpt = document.createElement("option");
-  defaultOpt.value = "";
-  defaultOpt.textContent = "Select a Prompt...";
-  promptSelect.appendChild(defaultOpt);
+export function loadUserPrompts() {
+    return fetchAllPromptPages("/api/prompts")
+        .then(prompts => {
+            userPrompts = prompts;
+            return prompts;
+        })
+        .catch(err => {
+            console.error("Error loading user prompts:", err);
+            userPrompts = [];
+            return [];
+        });
+}
 
-  let finalPrompts = [];
+export function loadGroupPrompts() {
+    return fetchAllPromptPages("/api/group_prompts", [400])
+        .then(prompts => {
+            groupPrompts = prompts;
+            return prompts;
+        })
+        .catch(err => {
+            console.error("Error loading group prompts:", err);
+            groupPrompts = [];
+            return [];
+        });
+}
 
-  // Include prompts based on which scopes are selected
-  if (scopes.personal) {
-    finalPrompts = finalPrompts.concat(userPrompts.map((p) => ({...p, scope: "Personal"})));
-  }
-  if (scopes.groupIds.length > 0) {
-    finalPrompts = finalPrompts.concat(groupPrompts.map((p) => ({...p, scope: "Group"})));
-  }
-  if (scopes.publicWorkspaceIds.length > 0) {
-    finalPrompts = finalPrompts.concat(publicPrompts.map((p) => ({...p, scope: "Public"})));
-  }
+export function loadPublicPrompts() {
+    return fetchAllPromptPages("/api/public_prompts", [400])
+        .then(prompts => {
+            publicPrompts = prompts;
+            return prompts;
+        })
+        .catch(err => {
+            console.error("Error loading public prompts:", err);
+            publicPrompts = [];
+            return [];
+        });
+}
 
-  // Add prompt options
-  finalPrompts.forEach((promptObj) => {
-    const opt = document.createElement("option");
-    opt.value = promptObj.id;
-    opt.textContent = `[${promptObj.scope}] ${promptObj.name}`;
-    opt.dataset.promptContent = promptObj.content;
-    promptSelect.appendChild(opt);
-  });
-
-  // Try to restore previous selection if it still exists, otherwise default to "Select a Prompt..."
-  if (finalPrompts.some(prompt => prompt.id === previousValue)) {
-    promptSelect.value = previousValue;
-  } else {
-    promptSelect.value = ""; // Default to "Select a Prompt..."
-  }
+export function populatePromptSelectScope() {
+    if (!promptSelect) return;
+
+    initializePromptSelector();
+
+    const scopes = getEffectiveScopes();
+    console.log("Populating prompt dropdown with scopes:", scopes);
+    console.log("User prompts:", userPrompts.length);
+    console.log("Group prompts:", groupPrompts.length);
+    console.log("Public prompts:", publicPrompts.length);
+
+    const previousValue = promptSelect.value;
+    promptSelect.innerHTML = "";
+
+    const defaultOpt = document.createElement("option");
+    defaultOpt.value = "";
+    defaultOpt.textContent = "Select a Prompt...";
+    promptSelect.appendChild(defaultOpt);
+
+    let finalPrompts = [];
+
+    if (scopes.personal) {
+        finalPrompts = finalPrompts.concat(userPrompts.map(prompt => ({ ...prompt, scope: "Personal" })));
+    }
+    if (scopes.groupIds.length > 0) {
+        finalPrompts = finalPrompts.concat(groupPrompts.map(prompt => ({ ...prompt, scope: "Group" })));
+    }
+    if (scopes.publicWorkspaceIds.length > 0) {
+        finalPrompts = finalPrompts.concat(publicPrompts.map(prompt => ({ ...prompt, scope: "Public" })));
+    }
+
+    finalPrompts.forEach(promptObj => {
+        const opt = document.createElement("option");
+        opt.value = promptObj.id;
+        opt.textContent = `[${promptObj.scope}] ${promptObj.name}`;
+        opt.dataset.promptContent = promptObj.content;
+        promptSelect.appendChild(opt);
+    });
+
+    if (finalPrompts.some(prompt => prompt.id === previousValue)) {
+        promptSelect.value = previousValue;
+    } else {
+        promptSelect.value = "";
+    }
+
+    promptSelect.dispatchEvent(new Event("change", { bubbles: true }));
+    promptSelectorController?.refresh();
 }
 
 // Keep the old function for backward compatibility, but have it call the scope-aware version
@@ -117,61 +175,64 @@ export function populatePromptSelect() {
 }
 
 export function loadAllPrompts() {
-  return Promise.all([loadUserPrompts(), loadGroupPrompts(), loadPublicPrompts()])
+  if (loadAllPromptsPromise) {
+    return loadAllPromptsPromise;
+  }
+
+  loadAllPromptsPromise = Promise.all([loadUserPrompts(), loadGroupPrompts(), loadPublicPrompts()])
     .then(() => {
       console.log("All prompts loaded, populating scope-based select...");
       populatePromptSelectScope();
     })
-    .catch(err => console.error("Error loading all prompts:", err));
+    .catch(err => {
+      console.error("Error loading all prompts:", err);
+    })
+    .finally(() => {
+      loadAllPromptsPromise = null;
+    });
+
+  return loadAllPromptsPromise;
 }
 
 export function initializePromptInteractions() {
-  console.log("Attempting to initialize prompt interactions..."); // Debug log
-  
-  // Check for elements *inside* the function that runs later
+  console.log("Attempting to initialize prompt interactions...");
+
   if (searchPromptsBtn && promptSelectionContainer && userInput) {
-      console.log("Elements found, adding prompt button listener."); // Debug log
-      
-      searchPromptsBtn.addEventListener("click", function() {
-          const isActive = this.classList.toggle("active");
-
-          if (isActive) {
-              promptSelectionContainer.style.display = "block";
-              // Load all prompts and populate with scope filtering
-              loadAllPrompts();
-              userInput.classList.add("with-prompt-active");
-              userInput.focus();
-              // Update send button visibility when prompts are shown
-              updateSendButtonVisibility();
-          } else {
-              promptSelectionContainer.style.display = "none";
-              if (promptSelect) {
-                  promptSelect.selectedIndex = 0;
-              }
-              userInput.classList.remove("with-prompt-active");
-              userInput.focus();
-              // Update send button visibility when prompts are hidden
-              updateSendButtonVisibility();
-          }
-      });
-      
-      // Add event listener for scope changes to update prompts
-      if (docScopeSelect) {
-          // Add event listener that will repopulate prompts when scope changes
-          docScopeSelect.addEventListener("change", function() {
-              // Only repopulate if prompts are currently visible
-              if (promptSelectionContainer && promptSelectionContainer.style.display === "block") {
-                  console.log("Scope changed, repopulating prompts...");
-                  populatePromptSelectScope();
-              }
-          });
+    initializePromptSelector();
+    console.log("Elements found, adding prompt button listener.");
+
+    searchPromptsBtn.addEventListener("click", function() {
+      const isActive = this.classList.toggle("active");
+
+      if (isActive) {
+        promptSelectionContainer.style.display = "block";
+        loadAllPrompts();
+        userInput.classList.add("with-prompt-active");
+        userInput.focus();
+        updateSendButtonVisibility();
+      } else {
+        promptSelectionContainer.style.display = "none";
+        if (promptSelect) {
+          promptSelect.selectedIndex = 0;
+          promptSelect.dispatchEvent(new Event("change", { bubbles: true }));
+        }
+        userInput.classList.remove("with-prompt-active");
+        userInput.focus();
+        updateSendButtonVisibility();
       }
-      
+    });
+
+    if (docScopeSelect) {
+      docScopeSelect.addEventListener("change", function() {
+        if (promptSelectionContainer && promptSelectionContainer.style.display === "block") {
+          console.log("Scope changed, repopulating prompts...");
+          populatePromptSelectScope();
+        }
+      });
+    }
   } else {
-      // Log detailed errors if elements are missing WHEN this function runs
-      if (!searchPromptsBtn) console.error("Prompt Init Error: search-prompts-btn not found.");
-      if (!promptSelectionContainer) console.error("Prompt Init Error: prompt-selection-container not found.");
-      // This check is crucial: is userInput null/undefined when this function executes?
-      if (!userInput) console.error("Prompt Init Error: userInput (imported from chat-messages) is not available.");
+    if (!searchPromptsBtn) console.error("Prompt Init Error: search-prompts-btn not found.");
+    if (!promptSelectionContainer) console.error("Prompt Init Error: prompt-selection-container not found.");
+    if (!userInput) console.error("Prompt Init Error: userInput (imported from chat-messages) is not available.");
   }
 }
\ No newline at end of file
diff --git a/application/single_app/static/js/chat/chat-reasoning.js b/application/single_app/static/js/chat/chat-reasoning.js
index 252fba91..8dddac88 100644
--- a/application/single_app/static/js/chat/chat-reasoning.js
+++ b/application/single_app/static/js/chat/chat-reasoning.js
@@ -4,10 +4,17 @@ import { showToast } from './chat-toast.js';
 
 let reasoningEffortSettings = {}; // Per-model settings: {modelName: 'low', ...}
 
+function applyReasoningSettings(settings = {}) {
+    console.log('Loaded reasoning settings:', settings);
+    reasoningEffortSettings = settings.reasoningEffortSettings || {};
+    console.log('Reasoning effort settings:', reasoningEffortSettings);
+    syncReasoningStateForCurrentModel();
+}
+
 /**
  * Initialize the reasoning effort toggle button
  */
-export function initializeReasoningToggle() {
+export function initializeReasoningToggle(initialSettings = null) {
     const reasoningToggleBtn = document.getElementById('reasoning-toggle-btn');
     if (!reasoningToggleBtn) {
         console.warn('Reasoning toggle button not found');
@@ -17,16 +24,16 @@ export function initializeReasoningToggle() {
     console.log('Initializing reasoning toggle...');
     
     // Load initial state from user settings
-    loadUserSettings().then(settings => {
-        console.log('Loaded reasoning settings:', settings);
-        reasoningEffortSettings = settings.reasoningEffortSettings || {};
-        console.log('Reasoning effort settings:', reasoningEffortSettings);
-        
-        // Update icon based on current model
-        updateReasoningIconForCurrentModel();
-    }).catch(error => {
-        console.error('Error loading reasoning settings:', error);
-    });
+    if (initialSettings) {
+        applyReasoningSettings(initialSettings);
+    } else {
+        loadUserSettings().then(settings => {
+            applyReasoningSettings(settings);
+        }).catch(error => {
+            console.error('Error loading reasoning settings:', error);
+            syncReasoningStateForCurrentModel();
+        });
+    }
     
     // Handle toggle click - show slider modal
     reasoningToggleBtn.addEventListener('click', () => {
@@ -37,8 +44,7 @@ export function initializeReasoningToggle() {
     const modelSelect = document.getElementById('model-select');
     if (modelSelect) {
         modelSelect.addEventListener('change', () => {
-            updateReasoningIconForCurrentModel();
-            updateReasoningButtonVisibility();
+            syncReasoningStateForCurrentModel();
         });
     }
     
@@ -63,6 +69,11 @@ export function initializeReasoningToggle() {
     updateReasoningButtonVisibility();
 }
 
+export function syncReasoningStateForCurrentModel() {
+    updateReasoningIconForCurrentModel();
+    updateReasoningButtonVisibility();
+}
+
 /**
  * Update reasoning button visibility based on image generation state, agent state, and model support
  */
diff --git a/application/single_app/static/js/chat/chat-retry.js b/application/single_app/static/js/chat/chat-retry.js
index 55cfbf8e..757977a6 100644
--- a/application/single_app/static/js/chat/chat-retry.js
+++ b/application/single_app/static/js/chat/chat-retry.js
@@ -3,6 +3,7 @@
 
 import { showToast } from './chat-toast.js';
 import { showLoadingIndicatorInChatbox, hideLoadingIndicatorInChatbox } from './chat-loading-indicator.js';
+import { sendMessageWithStreaming } from './chat-streaming.js';
 
 /**
  * Populate retry agent dropdown with available agents
@@ -274,70 +275,44 @@ window.executeMessageRetry = function() {
             console.log('   retry_thread_id:', data.chat_request.retry_thread_id);
             console.log('   retry_thread_attempt:', data.chat_request.retry_thread_attempt);
             console.log('   Full chat_request:', data.chat_request);
-            
-            // Call chat API with the retry parameters
-            return fetch('/api/chat', {
-                method: 'POST',
-                headers: {
-                    'Content-Type': 'application/json',
-                },
-                credentials: 'same-origin',
-                body: JSON.stringify(data.chat_request)
-            });
+
+            sendMessageWithStreaming(
+                data.chat_request,
+                null,
+                data.chat_request.conversation_id,
+                {
+                    onDone: () => {
+                        const conversationId = window.chatConversations?.getCurrentConversationId() || data.chat_request.conversation_id;
+                        if (conversationId) {
+                            import('./chat-messages.js').then(module => {
+                                module.loadMessages(conversationId);
+                            }).catch(err => {
+                                console.error('❌ Error loading chat-messages module:', err);
+                                showToast('Failed to reload messages', 'error');
+                            });
+                        }
+                    },
+                    onError: (errorMessage) => {
+                        showToast(`Retry failed: ${errorMessage}`, 'error');
+                    },
+                    onFinally: () => {
+                        hideLoadingIndicatorInChatbox();
+                    }
+                }
+            );
+
+            return null;
         } else {
             throw new Error('Retry response missing chat_request');
         }
     })
-    .then(response => {
-        if (!response.ok) {
-            return response.json().then(data => {
-                throw new Error(data.error || 'Chat API failed');
-            });
-        }
-        return response.json();
-    })
-    .then(chatData => {
-        console.log('✅ Chat API response:', chatData);
-        
-        // Hide typing indicator
-        hideLoadingIndicatorInChatbox();
-        console.log('🧹 Typing indicator removed');
-        
-        // Get current conversation ID using the proper API
-        const conversationId = window.chatConversations?.getCurrentConversationId();
-        
-        console.log(`🔍 Current conversation ID: ${conversationId}`);
-        
-        // Reload messages to show new attempt (which will automatically hide old attempts)
-        if (conversationId) {
-            console.log('🔄 Reloading messages for conversation:', conversationId);
-            
-            // Import loadMessages dynamically
-            import('./chat-messages.js').then(module => {
-                console.log('📦 chat-messages.js module loaded, calling loadMessages...');
-                module.loadMessages(conversationId);
-                // No toast - the reloaded messages are enough feedback
-            }).catch(err => {
-                console.error('❌ Error loading chat-messages module:', err);
-                showToast('error', 'Failed to reload messages');
-            });
-        } else {
-            console.error('❌ No currentConversationId found!');
-            
-            // Try to force a page refresh as fallback
-            console.log('🔄 Attempting page refresh as fallback...');
-            setTimeout(() => {
-                window.location.reload();
-            }, 1000);
-        }
-    })
     .catch(error => {
         console.error('❌ Retry error:', error);
         
         // Hide typing indicator on error
         hideLoadingIndicatorInChatbox();
         
-        showToast('error', `Retry failed: ${error.message}`);
+        showToast(`Retry failed: ${error.message}`, 'error');
     })
     .finally(() => {
         // Clean up pending retry
diff --git a/application/single_app/static/js/chat/chat-searchable-select.js b/application/single_app/static/js/chat/chat-searchable-select.js
new file mode 100644
index 00000000..d7ff37ec
--- /dev/null
+++ b/application/single_app/static/js/chat/chat-searchable-select.js
@@ -0,0 +1,376 @@
+// chat-searchable-select.js
+
+function createNoMatchesElement(message) {
+    const noMatchesEl = document.createElement('div');
+    noMatchesEl.className = 'no-matches text-center text-muted py-2';
+    noMatchesEl.textContent = message;
+    return noMatchesEl;
+}
+
+function removeNoMatchesElement(itemsContainerEl) {
+    const noMatchesEl = itemsContainerEl.querySelector('.no-matches');
+    if (noMatchesEl) {
+        noMatchesEl.remove();
+    }
+}
+
+function isVisibleItem(el) {
+    return Boolean(
+        el &&
+        !el.classList.contains('d-none') &&
+        !el.classList.contains('dropdown-divider')
+    );
+}
+
+function updateDropdownStructure(itemsContainerEl) {
+    if (!itemsContainerEl) {
+        return;
+    }
+
+    const children = Array.from(itemsContainerEl.children).filter(child => !child.classList.contains('no-matches'));
+
+    children.forEach(child => {
+        if (!child.classList.contains('dropdown-header')) {
+            return;
+        }
+
+        let hasVisibleItem = false;
+        let next = child.nextElementSibling;
+
+        while (next && !next.classList.contains('dropdown-header')) {
+            if (next.classList.contains('dropdown-item') && isVisibleItem(next)) {
+                hasVisibleItem = true;
+                break;
+            }
+            next = next.nextElementSibling;
+        }
+
+        child.classList.toggle('d-none', !hasVisibleItem);
+    });
+
+    children.forEach(child => {
+        if (!child.classList.contains('dropdown-divider')) {
+            return;
+        }
+
+        let previousVisible = null;
+        let previous = child.previousElementSibling;
+        while (previous) {
+            if (!previous.classList.contains('no-matches') && isVisibleItem(previous)) {
+                previousVisible = previous;
+                break;
+            }
+            previous = previous.previousElementSibling;
+        }
+
+        let nextVisible = null;
+        let next = child.nextElementSibling;
+        while (next) {
+            if (!next.classList.contains('no-matches') && isVisibleItem(next)) {
+                nextVisible = next;
+                break;
+            }
+            next = next.nextElementSibling;
+        }
+
+        child.classList.toggle('d-none', !(previousVisible && nextVisible));
+    });
+}
+
+export function initializeFilterableDropdownSearch({
+    dropdownEl,
+    buttonEl,
+    menuEl,
+    searchInputEl,
+    itemsContainerEl,
+    emptyMessage,
+    getItemSearchText,
+    isAlwaysVisibleItem,
+    itemSelector = '.dropdown-item',
+    clearSearchOnHide = true,
+}) {
+    if (!menuEl || !searchInputEl || !itemsContainerEl) {
+        return null;
+    }
+
+    const readSearchText = getItemSearchText || (item => item.dataset.searchLabel || item.textContent || '');
+    const isAlwaysVisible = isAlwaysVisibleItem || (() => false);
+
+    const applyFilter = (rawSearchTerm = '') => {
+        const searchTerm = rawSearchTerm.toLowerCase().trim();
+        const items = Array.from(itemsContainerEl.querySelectorAll(itemSelector));
+        let visibleMatchCount = 0;
+
+        items.forEach(item => {
+            const keepVisible = isAlwaysVisible(item);
+            const searchText = String(readSearchText(item) || '').toLowerCase();
+            const matches = !searchTerm || keepVisible || searchText.includes(searchTerm);
+
+            item.classList.toggle('d-none', !matches);
+
+            if (matches && !keepVisible) {
+                visibleMatchCount += 1;
+            }
+        });
+
+        removeNoMatchesElement(itemsContainerEl);
+        updateDropdownStructure(itemsContainerEl);
+
+        if (searchTerm && visibleMatchCount === 0) {
+            itemsContainerEl.appendChild(createNoMatchesElement(emptyMessage));
+        }
+    };
+
+    const resetFilter = () => {
+        searchInputEl.value = '';
+        applyFilter('');
+    };
+
+    menuEl.addEventListener('click', event => {
+        event.stopPropagation();
+    });
+
+    menuEl.addEventListener('keydown', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('click', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('keydown', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('input', () => {
+        applyFilter(searchInputEl.value);
+    });
+
+    if (dropdownEl) {
+        dropdownEl.addEventListener('shown.bs.dropdown', () => {
+            searchInputEl.focus();
+            searchInputEl.select();
+        });
+
+        if (clearSearchOnHide) {
+            dropdownEl.addEventListener('hidden.bs.dropdown', () => {
+                resetFilter();
+            });
+        }
+    }
+
+    if (buttonEl) {
+        try {
+            bootstrap.Dropdown.getOrCreateInstance(buttonEl, {
+                autoClose: 'outside'
+            });
+        } catch (error) {
+            console.error('Error initializing dropdown search helper:', error);
+        }
+    }
+
+    applyFilter('');
+
+    return {
+        applyFilter,
+        resetFilter,
+    };
+}
+
+export function createSearchableSingleSelect({
+    selectEl,
+    dropdownEl,
+    buttonEl,
+    buttonTextEl,
+    menuEl,
+    searchInputEl,
+    itemsContainerEl,
+    placeholderText,
+    emptyMessage,
+    emptySearchMessage,
+    getOptionLabel,
+    getOptionSearchText,
+}) {
+    if (!selectEl || !dropdownEl || !buttonEl || !buttonTextEl || !menuEl || !searchInputEl || !itemsContainerEl) {
+        return null;
+    }
+
+    const readOptionLabel = getOptionLabel || (option => option.textContent.trim());
+    const readOptionSearchText = getOptionSearchText || (option => option.textContent.trim());
+
+    const getSelectedOption = () => {
+        if (selectEl.selectedIndex < 0) {
+            return null;
+        }
+
+        return selectEl.options[selectEl.selectedIndex] || null;
+    };
+
+    const syncButtonText = () => {
+        const selectedOption = getSelectedOption();
+        const label = selectedOption ? readOptionLabel(selectedOption) : '';
+        buttonTextEl.textContent = label || placeholderText;
+    };
+
+    const renderOptions = () => {
+        const searchTerm = searchInputEl.value.toLowerCase().trim();
+        const options = Array.from(selectEl.options);
+        const selectedIndex = selectEl.selectedIndex;
+        const hasEnabledOption = options.some(option => !option.disabled);
+
+        itemsContainerEl.innerHTML = '';
+
+        if (!options.length) {
+            buttonEl.disabled = true;
+            searchInputEl.disabled = true;
+            buttonTextEl.textContent = emptyMessage;
+            itemsContainerEl.appendChild(createNoMatchesElement(emptyMessage));
+            return;
+        }
+
+        let matchedCount = 0;
+
+        options.forEach((option, index) => {
+            const optionLabel = readOptionLabel(option);
+            const optionSearchText = String(readOptionSearchText(option) || optionLabel).toLowerCase();
+            const matches = !searchTerm || optionSearchText.includes(searchTerm);
+
+            if (!matches) {
+                return;
+            }
+
+            matchedCount += 1;
+
+            const item = document.createElement('button');
+            item.type = 'button';
+            item.classList.add('dropdown-item', 'chat-searchable-select-item');
+            item.dataset.optionIndex = String(index);
+            item.dataset.optionValue = option.value;
+            item.title = optionLabel;
+
+            if (index === selectedIndex) {
+                item.classList.add('active');
+            }
+
+            if (option.disabled) {
+                item.classList.add('disabled');
+                item.disabled = true;
+            }
+
+            const textEl = document.createElement('span');
+            textEl.className = 'chat-searchable-select-item-text';
+            textEl.textContent = optionLabel;
+            item.appendChild(textEl);
+
+            itemsContainerEl.appendChild(item);
+        });
+
+        buttonEl.disabled = !hasEnabledOption;
+        searchInputEl.disabled = !hasEnabledOption;
+        syncButtonText();
+
+        if (matchedCount === 0) {
+            itemsContainerEl.appendChild(createNoMatchesElement(searchTerm ? emptySearchMessage : emptyMessage));
+        }
+    };
+
+    const syncFromSelect = () => {
+        renderOptions();
+    };
+
+    const selectOption = optionIndex => {
+        const normalizedIndex = Number(optionIndex);
+        const option = selectEl.options[normalizedIndex];
+
+        if (!option || option.disabled) {
+            return;
+        }
+
+        selectEl.selectedIndex = normalizedIndex;
+        renderOptions();
+        selectEl.dispatchEvent(new Event('change', { bubbles: true }));
+
+        try {
+            bootstrap.Dropdown.getOrCreateInstance(buttonEl, {
+                autoClose: 'outside'
+            }).hide();
+        } catch (error) {
+            console.error('Error hiding dropdown after selection:', error);
+        }
+    };
+
+    itemsContainerEl.addEventListener('click', event => {
+        const item = event.target.closest('.chat-searchable-select-item[data-option-index]');
+        if (!item) {
+            return;
+        }
+
+        event.preventDefault();
+        event.stopPropagation();
+        selectOption(item.dataset.optionIndex);
+    });
+
+    menuEl.addEventListener('click', event => {
+        event.stopPropagation();
+    });
+
+    menuEl.addEventListener('keydown', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('click', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('keydown', event => {
+        event.stopPropagation();
+    });
+
+    searchInputEl.addEventListener('input', () => {
+        renderOptions();
+    });
+
+    dropdownEl.addEventListener('show.bs.dropdown', () => {
+        searchInputEl.value = '';
+        renderOptions();
+    });
+
+    dropdownEl.addEventListener('shown.bs.dropdown', () => {
+        searchInputEl.focus();
+    });
+
+    dropdownEl.addEventListener('hidden.bs.dropdown', () => {
+        searchInputEl.value = '';
+        renderOptions();
+    });
+
+    selectEl.addEventListener('change', syncFromSelect);
+
+    const observer = new MutationObserver(() => {
+        renderOptions();
+    });
+    observer.observe(selectEl, {
+        childList: true,
+        subtree: true,
+        attributes: true,
+        attributeFilter: ['disabled', 'label', 'selected', 'value']
+    });
+
+    try {
+        bootstrap.Dropdown.getOrCreateInstance(buttonEl, {
+            autoClose: 'outside'
+        });
+    } catch (error) {
+        console.error('Error initializing searchable select:', error);
+    }
+
+    renderOptions();
+
+    return {
+        refresh: renderOptions,
+        syncFromSelect,
+        destroy() {
+            observer.disconnect();
+        }
+    };
+}
\ No newline at end of file
diff --git a/application/single_app/static/js/chat/chat-sidebar-conversations.js b/application/single_app/static/js/chat/chat-sidebar-conversations.js
index 4e89144f..a12b2e1c 100644
--- a/application/single_app/static/js/chat/chat-sidebar-conversations.js
+++ b/application/single_app/static/js/chat/chat-sidebar-conversations.js
@@ -11,6 +11,37 @@ let sidebarShowHiddenConversations = false; // Track if hidden conversations sho
 let isLoadingSidebarConversations = false; // Prevent concurrent sidebar loads
 let pendingSidebarReload = false; // Track if a reload is pending
 
+function createUnreadDotElement() {
+  const unreadDot = document.createElement('span');
+  unreadDot.classList.add('conversation-unread-dot', 'sidebar-conversation-unread-dot');
+  unreadDot.setAttribute('aria-hidden', 'true');
+  return unreadDot;
+}
+
+export function setConversationUnreadState(conversationId, hasUnread) {
+  const sidebarItem = document.querySelector(`.sidebar-conversation-item[data-conversation-id="${conversationId}"]`);
+  if (!sidebarItem) {
+    return;
+  }
+
+  sidebarItem.dataset.hasUnreadAssistantResponse = hasUnread ? 'true' : 'false';
+
+  const titleWrapper = sidebarItem.querySelector('.sidebar-conversation-header');
+  const titleElement = sidebarItem.querySelector('.sidebar-conversation-title');
+  const existingDot = sidebarItem.querySelector('.sidebar-conversation-unread-dot');
+
+  if (!hasUnread) {
+    if (existingDot) {
+      existingDot.remove();
+    }
+    return;
+  }
+
+  if (!existingDot && titleWrapper && titleElement) {
+    titleWrapper.insertBefore(createUnreadDotElement(), titleElement);
+  }
+}
+
 // Load conversations for the sidebar
 export function loadSidebarConversations() {
   if (!sidebarConversationsList) return;
@@ -124,6 +155,7 @@ function createSidebarConversationItem(convo) {
   const convoItem = document.createElement("div");
   convoItem.classList.add("sidebar-conversation-item");
   convoItem.setAttribute("data-conversation-id", convo.id);
+  convoItem.dataset.hasUnreadAssistantResponse = convo.has_unread_assistant_response ? 'true' : 'false';
   if (convo.chat_type) {
     convoItem.setAttribute("data-chat-type", convo.chat_type);
   }
@@ -187,6 +219,10 @@ function createSidebarConversationItem(convo) {
     // Add title to wrapper
     titleWrapper.appendChild(originalTitleElement);
 
+    if (convo.has_unread_assistant_response) {
+      titleWrapper.insertBefore(createUnreadDotElement(), originalTitleElement);
+    }
+
     const isGroupConversation = (convo.chat_type && convo.chat_type.startsWith('group')) || groupName;
     if (isGroupConversation) {
       const badge = document.createElement('span');
@@ -916,5 +952,6 @@ window.chatSidebarConversations = {
   updateSidebarConversationTitle,
   enableSidebarTitleEdit,
   loadSidebarConversations,
-  setActiveConversation
+  setActiveConversation,
+  setConversationUnreadState
 };
diff --git a/application/single_app/static/js/chat/chat-streaming.js b/application/single_app/static/js/chat/chat-streaming.js
index faf6f59e..700eec6f 100644
--- a/application/single_app/static/js/chat/chat-streaming.js
+++ b/application/single_app/static/js/chat/chat-streaming.js
@@ -1,143 +1,34 @@
 // chat-streaming.js
 import { appendMessage, updateUserMessageId } from './chat-messages.js';
+import { markConversationRead } from './chat-conversations.js';
 import { hideLoadingIndicatorInChatbox, showLoadingIndicatorInChatbox } from './chat-loading-indicator.js';
-import { loadUserSettings, saveUserSetting } from './chat-layout.js';
 import { showToast } from './chat-toast.js';
 import { updateSidebarConversationTitle } from './chat-sidebar-conversations.js';
 import { applyScopeLock } from './chat-documents.js';
+import { handleStreamingThought } from './chat-thoughts.js';
 
-let streamingEnabled = false;
 let currentEventSource = null;
 
-export function initializeStreamingToggle() {
-    const streamingToggleBtn = document.getElementById('streaming-toggle-btn');
-    if (!streamingToggleBtn) {
-        console.warn('Streaming toggle button not found');
-        return;
-    }
-    
-    console.log('Initializing streaming toggle...');
-    
-    // Load initial state from user settings
-    loadUserSettings().then(settings => {
-        console.log('Loaded user settings:', settings);
-        streamingEnabled = settings.streamingEnabled === true;
-        console.log('Streaming enabled:', streamingEnabled);
-        updateStreamingButtonState();
-        updateStreamingButtonVisibility();
-    }).catch(error => {
-        console.error('Error loading streaming settings:', error);
-    });
-    
-    // Handle toggle click
-    streamingToggleBtn.addEventListener('click', () => {
-        streamingEnabled = !streamingEnabled;
-        console.log('Streaming toggled to:', streamingEnabled);
-        
-        // Save the setting
-        console.log('Saving streaming setting...');
-        saveUserSetting({ streamingEnabled });
-        
-        updateStreamingButtonState();
-        
-        const message = streamingEnabled 
-            ? 'Streaming enabled - responses will appear in real-time' 
-            : 'Streaming disabled - responses will appear when complete';
-        showToast(message, 'info');
-    });
-    
-    // Listen for agents toggle - hide streaming button when agents are active
-    const enableAgentsBtn = document.getElementById('enable-agents-btn');
-    if (enableAgentsBtn) {
-        const observer = new MutationObserver(() => {
-            updateStreamingButtonVisibility();
-        });
-        observer.observe(enableAgentsBtn, { attributes: true, attributeFilter: ['class'] });
-    }
-    
-    updateStreamingButtonVisibility();
-}
+function parseSseEventPayload(eventBlock) {
+    const dataLines = eventBlock
+        .split('\n')
+        .filter(line => line.startsWith('data:'));
 
-function updateStreamingButtonState() {
-    const streamingToggleBtn = document.getElementById('streaming-toggle-btn');
-    if (!streamingToggleBtn) return;
-    
-    // Check if TTS autoplay is enabled
-    let ttsAutoplayEnabled = false;
-    if (typeof window.appSettings !== 'undefined' && window.appSettings.enable_text_to_speech) {
-        const cachedSettings = JSON.parse(localStorage.getItem('userSettings') || '{}');
-        ttsAutoplayEnabled = cachedSettings.settings?.ttsAutoplay === true;
+    if (dataLines.length === 0) {
+        return null;
     }
-    
-    if (ttsAutoplayEnabled) {
-        // Disable streaming button when TTS autoplay is on
-        streamingToggleBtn.classList.remove('btn-primary');
-        streamingToggleBtn.classList.add('btn-outline-secondary', 'disabled');
-        streamingToggleBtn.disabled = true;
-        streamingToggleBtn.title = 'Streaming disabled - TTS autoplay is enabled. Disable TTS autoplay in your profile to enable streaming.';
-    } else if (streamingEnabled) {
-        streamingToggleBtn.classList.remove('btn-outline-secondary', 'disabled');
-        streamingToggleBtn.classList.add('btn-primary');
-        streamingToggleBtn.disabled = false;
-        streamingToggleBtn.title = 'Streaming enabled - click to disable';
-    } else {
-        streamingToggleBtn.classList.remove('btn-primary', 'disabled');
-        streamingToggleBtn.classList.add('btn-outline-secondary');
-        streamingToggleBtn.disabled = false;
-        streamingToggleBtn.title = 'Streaming disabled - click to enable';
-    }
-}
 
-/**
- * Update streaming button visibility based on agent state
- */
-function updateStreamingButtonVisibility() {
-    const streamingToggleBtn = document.getElementById('streaming-toggle-btn');
-    const enableAgentsBtn = document.getElementById('enable-agents-btn');
-    
-    if (!streamingToggleBtn) return;
-    
-    // Show streaming button even when agents are active (agents now support streaming)
-    streamingToggleBtn.style.display = 'flex';
-}
-
-export function isStreamingEnabled() {
-    // Check if TTS autoplay is enabled - streaming is incompatible with TTS autoplay
-    if (typeof window.appSettings !== 'undefined' && window.appSettings.enable_text_to_speech) {
-        // Dynamically check TTS settings
-        loadUserSettings().then(settings => {
-            if (settings.ttsAutoplay === true) {
-                console.log('TTS autoplay enabled - streaming disabled');
-            }
-        }).catch(error => {
-            console.error('Error checking TTS settings:', error);
-        });
-        
-        // Synchronous check using cached value if available
-        const cachedSettings = JSON.parse(localStorage.getItem('userSettings') || '{}');
-        if (cachedSettings.settings?.ttsAutoplay === true) {
-            return false; // Disable streaming when TTS autoplay is active
-        }
-    }
-    
-    // Check if image generation is active - streaming is incompatible with image gen
-    const imageGenBtn = document.getElementById('image-generate-btn');
-    if (imageGenBtn && imageGenBtn.classList.contains('active')) {
-        return false; // Disable streaming when image generation is active
-    }
-    return streamingEnabled;
+    return dataLines
+        .map(line => line.substring(5).trimStart())
+        .join('\n');
 }
 
-export function sendMessageWithStreaming(messageData, tempUserMessageId, currentConversationId) {
-    if (!streamingEnabled) {
-        return null; // Caller should use regular fetch
-    }
-    
-    // Double-check: never stream if image generation is active
-    const imageGenBtn = document.getElementById('image-generate-btn');
-    if (imageGenBtn && imageGenBtn.classList.contains('active')) {
-        return null; // Force regular fetch for image generation
-    }
+export function sendMessageWithStreaming(messageData, tempUserMessageId, currentConversationId, options = {}) {
+    const {
+        onDone = null,
+        onError = null,
+        onFinally = null,
+    } = options;
     
     // Close any existing connection
     if (currentEventSource) {
@@ -148,8 +39,10 @@ export function sendMessageWithStreaming(messageData, tempUserMessageId, current
     // Create a unique message ID for the AI response
     const tempAiMessageId = `temp_ai_${Date.now()}`;
     let accumulatedContent = '';
+    let hasStreamedContent = false;
     let streamError = false;
     let streamErrorMessage = '';
+    let streamCompleted = false;
     
     // Create placeholder message with streaming indicator
     appendMessage('AI', '<span class="text-muted"><i class="bi bi-three-dots-vertical"></i> Streaming...</span>', null, tempAiMessageId);
@@ -183,54 +76,133 @@ export function sendMessageWithStreaming(messageData, tempUserMessageId, current
         // Read the streaming response
         const reader = response.body.getReader();
         const decoder = new TextDecoder();
+        let sseBuffer = '';
+
+        function processStreamData(data) {
+            if (data.error) {
+                clearTimeout(streamTimeout);
+                streamError = true;
+                streamErrorMessage = data.error;
+                handleStreamError(tempAiMessageId, data.partial_content || accumulatedContent, data.error);
+                if (typeof onError === 'function') {
+                    onError(data.error, data);
+                }
+                if (typeof onFinally === 'function') {
+                    onFinally();
+                }
+                return true;
+            }
+
+            if (data.type === 'thought') {
+                if (!hasStreamedContent && !streamCompleted) {
+                    handleStreamingThought(data);
+                }
+                return false;
+            }
+
+            if (data.content) {
+                accumulatedContent += data.content;
+                hasStreamedContent = true;
+                updateStreamingMessage(tempAiMessageId, accumulatedContent);
+            }
+
+            if (data.done) {
+                clearTimeout(streamTimeout);
+                streamCompleted = true;
+
+                finalizeStreamingMessage(
+                    tempAiMessageId,
+                    tempUserMessageId,
+                    data
+                );
+
+                if (typeof onDone === 'function') {
+                    onDone(data);
+                }
+
+                if (typeof onFinally === 'function') {
+                    onFinally();
+                }
+
+                currentEventSource = null;
+                return true;
+            }
+
+            return false;
+        }
+
+        function processSseEventBlock(eventBlock) {
+            const jsonStr = parseSseEventPayload(eventBlock);
+            if (!jsonStr) {
+                return false;
+            }
+
+            try {
+                const data = JSON.parse(jsonStr);
+                return processStreamData(data);
+            } catch (error) {
+                console.error('Error parsing SSE data:', error);
+                return false;
+            }
+        }
+
+        function processSseBuffer(flush = false) {
+            let delimiterIndex = sseBuffer.indexOf('\n\n');
+
+            while (delimiterIndex !== -1) {
+                const eventBlock = sseBuffer.slice(0, delimiterIndex);
+                sseBuffer = sseBuffer.slice(delimiterIndex + 2);
+
+                if (processSseEventBlock(eventBlock)) {
+                    return true;
+                }
+
+                delimiterIndex = sseBuffer.indexOf('\n\n');
+            }
+
+            if (flush) {
+                const trailingBlock = sseBuffer.trim();
+                sseBuffer = '';
+
+                if (trailingBlock) {
+                    return processSseEventBlock(trailingBlock);
+                }
+            }
+
+            return false;
+        }
         
         function readStream() {
             reader.read().then(({ done, value }) => {
                 if (done) {
                     clearTimeout(streamTimeout);
+
+                    sseBuffer += decoder.decode();
+                    const processedFinalEvent = processSseBuffer(true);
+
+                    if (!processedFinalEvent && !streamCompleted && !streamError) {
+                        handleStreamError(
+                            tempAiMessageId,
+                            accumulatedContent,
+                            'Stream ended before completion metadata was received.'
+                        );
+
+                        if (typeof onError === 'function') {
+                            onError('Stream ended before completion metadata was received.');
+                        }
+
+                        if (typeof onFinally === 'function') {
+                            onFinally();
+                        }
+                    }
+
                     return;
                 }
                 
-                const chunk = decoder.decode(value, { stream: true });
-                const lines = chunk.split('\n');
-                
-                for (const line of lines) {
-                    if (line.startsWith('data: ')) {
-                        try {
-                            const jsonStr = line.substring(6); // Remove 'data: '
-                            const data = JSON.parse(jsonStr);
-                            
-                            if (data.error) {
-                                clearTimeout(streamTimeout);
-                                streamError = true;
-                                streamErrorMessage = data.error;
-                                handleStreamError(tempAiMessageId, data.partial_content || accumulatedContent, data.error);
-                                return;
-                            }
-                            
-                            if (data.content) {
-                                // Append chunk to accumulated content
-                                accumulatedContent += data.content;
-                                updateStreamingMessage(tempAiMessageId, accumulatedContent);
-                            }
-                            
-                            if (data.done) {
-                                clearTimeout(streamTimeout);
-                                
-                                // Update with final metadata
-                                finalizeStreamingMessage(
-                                    tempAiMessageId,
-                                    tempUserMessageId,
-                                    data
-                                );
-                                
-                                currentEventSource = null;
-                                return;
-                            }
-                        } catch (e) {
-                            console.error('Error parsing SSE data:', e);
-                        }
-                    }
+                sseBuffer += decoder.decode(value, { stream: true }).replace(/\r/g, '');
+
+                if (processSseBuffer() || streamCompleted || streamError) {
+                    return;
                 }
                 
                 readStream(); // Continue reading
@@ -238,6 +210,12 @@ export function sendMessageWithStreaming(messageData, tempUserMessageId, current
                 clearTimeout(streamTimeout);
                 console.error('Stream reading error:', err);
                 handleStreamError(tempAiMessageId, accumulatedContent, err.message);
+                if (typeof onError === 'function') {
+                    onError(err.message, err);
+                }
+                if (typeof onFinally === 'function') {
+                    onFinally();
+                }
             });
         }
         
@@ -253,6 +231,14 @@ export function sendMessageWithStreaming(messageData, tempUserMessageId, current
         if (msgElement) {
             msgElement.remove();
         }
+
+        if (typeof onError === 'function') {
+            onError(error.message, error);
+        }
+
+        if (typeof onFinally === 'function') {
+            onFinally();
+        }
     });
     
     return true; // Indicates streaming was initiated
@@ -261,6 +247,8 @@ export function sendMessageWithStreaming(messageData, tempUserMessageId, current
 function updateStreamingMessage(messageId, content) {
     const messageElement = document.querySelector(`[data-message-id="${messageId}"]`);
     if (!messageElement) return;
+
+    messageElement.dataset.streamingHasContent = 'true';
     
     const contentElement = messageElement.querySelector('.message-text');
     if (contentElement) {
@@ -328,6 +316,32 @@ function finalizeStreamingMessage(messageId, userMessageId, finalData) {
     
     // Remove the temporary streaming message
     messageElement.remove();
+
+    if (finalData.kernel_fallback_notice) {
+        showToast(finalData.kernel_fallback_notice, 'warning');
+    }
+
+    if (finalData.image_url) {
+        appendMessage(
+            'image',
+            finalData.image_url,
+            finalData.model_deployment_name,
+            finalData.message_id,
+            false,
+            [],
+            [],
+            finalData.agent_citations || [],
+            finalData.agent_display_name || null,
+            finalData.agent_name || null,
+            null,
+            true
+        );
+
+        if (finalData.reload_messages && finalData.conversation_id && typeof window.chatMessages?.loadMessages === 'function') {
+            window.chatMessages.loadMessages(finalData.conversation_id);
+        }
+        return;
+    }
     
     // Create proper message with all metadata using appendMessage
     appendMessage(
@@ -337,7 +351,7 @@ function finalizeStreamingMessage(messageId, userMessageId, finalData) {
         finalData.message_id,
         finalData.augmented,
         finalData.hybrid_citations || [],
-        [],
+        finalData.web_search_citations || [],
         finalData.agent_citations || [],
         finalData.agent_display_name || null,
         finalData.agent_name || null,
@@ -371,6 +385,16 @@ function finalizeStreamingMessage(messageId, userMessageId, finalData) {
             })
             .catch(err => console.warn('Failed to fetch scope lock metadata after streaming:', err));
     }
+
+    if (finalData.reload_messages && finalData.conversation_id && typeof window.chatMessages?.loadMessages === 'function') {
+        window.chatMessages.loadMessages(finalData.conversation_id);
+    }
+
+    if (finalData.conversation_id) {
+        markConversationRead(finalData.conversation_id, { force: true, suppressErrorToast: true }).catch(error => {
+            console.warn('Failed to clear unread state after live streaming completion:', error);
+        });
+    }
 }
 
 export function cancelStreaming() {
diff --git a/application/single_app/static/js/chat/chat-thoughts.js b/application/single_app/static/js/chat/chat-thoughts.js
new file mode 100644
index 00000000..59e8653a
--- /dev/null
+++ b/application/single_app/static/js/chat/chat-thoughts.js
@@ -0,0 +1,219 @@
+// chat-thoughts.js
+
+import { updateLoadingIndicatorText } from './chat-loading-indicator.js';
+import { escapeHtml } from './chat-utils.js';
+
+let thoughtPollingInterval = null;
+let lastSeenThoughtIndex = -1;
+
+// ---------------------------------------------------------------------------
+// Icon map: step_type → Bootstrap Icon class
+// ---------------------------------------------------------------------------
+function getThoughtIcon(stepType) {
+    const iconMap = {
+        'search': 'bi-search',
+        'tabular_analysis': 'bi-table',
+        'web_search': 'bi-globe',
+        'agent_tool_call': 'bi-robot',
+        'generation': 'bi-lightning',
+        'content_safety': 'bi-shield-check'
+    };
+    return iconMap[stepType] || 'bi-stars';
+}
+
+// ---------------------------------------------------------------------------
+// Polling (non-streaming mode)
+// ---------------------------------------------------------------------------
+
+/**
+ * Start polling for pending thoughts while waiting for a non-streaming response.
+ * @param {string} conversationId - The current conversation ID.
+ */
+export function startThoughtPolling(conversationId) {
+    if (!conversationId) return;
+    if (!window.appSettings?.enable_thoughts) return;
+
+    stopThoughtPolling(); // clear any previous interval
+    lastSeenThoughtIndex = -1;
+
+    thoughtPollingInterval = setInterval(() => {
+        fetch(`/api/conversations/${conversationId}/thoughts/pending`, {
+            credentials: 'same-origin'
+        })
+            .then(r => r.json())
+            .then(data => {
+                if (data.thoughts && data.thoughts.length > 0) {
+                    const latest = data.thoughts[data.thoughts.length - 1];
+                    if (latest.step_index > lastSeenThoughtIndex) {
+                        lastSeenThoughtIndex = latest.step_index;
+                        const icon = getThoughtIcon(latest.step_type);
+                        updateLoadingIndicatorText(latest.content, icon);
+                    }
+                }
+            })
+            .catch(() => { /* ignore polling errors */ });
+    }, 2000);
+}
+
+/**
+ * Stop the thought polling interval.
+ */
+export function stopThoughtPolling() {
+    if (thoughtPollingInterval) {
+        clearInterval(thoughtPollingInterval);
+        thoughtPollingInterval = null;
+    }
+    lastSeenThoughtIndex = -1;
+}
+
+// ---------------------------------------------------------------------------
+// Streaming handler
+// ---------------------------------------------------------------------------
+
+/**
+ * Handle a streaming thought event received via SSE.
+ * Updates the streaming message placeholder with a styled thought indicator.
+ * When actual content starts streaming, updateStreamingMessage() will overwrite this.
+ * @param {object} thoughtData - { step_index, step_type, content }
+ */
+export function handleStreamingThought(thoughtData) {
+    // Find the streaming message's content area
+    const messageElement = document.querySelector('[data-message-id^="temp_ai_"]');
+    if (!messageElement) return;
+
+    if (messageElement.dataset.streamingHasContent === 'true') {
+        return;
+    }
+
+    const contentElement = messageElement.querySelector('.message-text');
+    if (!contentElement) return;
+
+    const icon = getThoughtIcon(thoughtData.step_type);
+    // Replace entire content with styled thought indicator (visually distinct from AI response)
+    contentElement.innerHTML = `<div class="streaming-thought-display">
+        <span class="badge bg-info bg-opacity-10 text-info border border-info-subtle px-3 py-2 animate-pulse" style="font-size: 0.85rem; font-weight: 500;">
+            <i class="bi ${icon} me-2"></i>${escapeHtml(thoughtData.content)}
+        </span>
+    </div>`;
+}
+
+// ---------------------------------------------------------------------------
+// Per-message collapsible: toggle button + container HTML
+// ---------------------------------------------------------------------------
+
+/**
+ * Create HTML for the thoughts toggle button and hidden container.
+ * Returns an object with { toggleHtml, containerHtml }.
+ * @param {string} messageId
+ */
+export function createThoughtsToggleHtml(messageId) {
+    if (!window.appSettings?.enable_thoughts) {
+        return { toggleHtml: '', containerHtml: '' };
+    }
+
+    const containerId = `thoughts-${messageId || Date.now()}`;
+    const toggleHtml = `<button class="btn btn-sm btn-link text-muted thoughts-toggle-btn" title="Show processing thoughts" aria-expanded="false" aria-controls="${containerId}"><i class="bi bi-stars"></i></button>`;
+    const containerHtml = `<div id="${containerId}" class="thoughts-container d-none mt-2 pt-2 border-top"><div class="text-muted small">Loading thoughts...</div></div>`;
+
+    return { toggleHtml, containerHtml };
+}
+
+/**
+ * Attach event listener for the thoughts toggle button inside a message div.
+ * @param {HTMLElement} messageDiv
+ * @param {string} messageId
+ * @param {string} conversationId
+ */
+export function attachThoughtsToggleListener(messageDiv, messageId, conversationId) {
+    const toggleBtn = messageDiv.querySelector('.thoughts-toggle-btn');
+    if (!toggleBtn) return;
+
+    toggleBtn.addEventListener('click', () => {
+        const targetId = toggleBtn.getAttribute('aria-controls');
+        const container = messageDiv.querySelector(`#${targetId}`);
+        if (!container) return;
+
+        // Store scroll position
+        const scrollContainer = document.getElementById('chat-messages-container');
+        const currentScroll = scrollContainer?.scrollTop || window.pageYOffset;
+
+        const isExpanded = !container.classList.contains('d-none');
+        if (isExpanded) {
+            container.classList.add('d-none');
+            toggleBtn.setAttribute('aria-expanded', 'false');
+            toggleBtn.title = 'Show processing thoughts';
+            toggleBtn.innerHTML = '<i class="bi bi-stars"></i>';
+        } else {
+            container.classList.remove('d-none');
+            toggleBtn.setAttribute('aria-expanded', 'true');
+            toggleBtn.title = 'Hide processing thoughts';
+            toggleBtn.innerHTML = '<i class="bi bi-chevron-up"></i>';
+
+            // Lazy-load thoughts on first expand
+            if (container.innerHTML.includes('Loading thoughts')) {
+                loadThoughtsForMessage(conversationId, messageId, container);
+            }
+        }
+
+        // Restore scroll position
+        setTimeout(() => {
+            if (scrollContainer) {
+                scrollContainer.scrollTop = currentScroll;
+            } else {
+                window.scrollTo(0, currentScroll);
+            }
+        }, 10);
+    });
+}
+
+// ---------------------------------------------------------------------------
+// Fetch + render thoughts for a message
+// ---------------------------------------------------------------------------
+
+/**
+ * Fetch thoughts for a specific message from the API and render them.
+ * @param {string} conversationId
+ * @param {string} messageId
+ * @param {HTMLElement} container
+ */
+function loadThoughtsForMessage(conversationId, messageId, container) {
+    fetch(`/api/conversations/${conversationId}/messages/${messageId}/thoughts`, {
+        credentials: 'same-origin'
+    })
+        .then(r => r.json())
+        .then(data => {
+            if (!data.enabled) {
+                container.innerHTML = '<div class="text-muted small">Processing thoughts are disabled.</div>';
+                return;
+            }
+            if (!data.thoughts || data.thoughts.length === 0) {
+                container.innerHTML = '<div class="text-muted small">No processing thoughts recorded for this message.</div>';
+                return;
+            }
+            container.innerHTML = renderThoughtsList(data.thoughts);
+        })
+        .catch(err => {
+            console.error('Error loading thoughts:', err);
+            container.innerHTML = '<div class="text-danger small">Failed to load processing thoughts.</div>';
+        });
+}
+
+/**
+ * Render a list of thought steps as HTML.
+ * @param {Array} thoughts
+ * @returns {string} HTML string
+ */
+function renderThoughtsList(thoughts) {
+    let html = '<div class="thoughts-list">';
+    thoughts.forEach(t => {
+        const icon = getThoughtIcon(t.step_type);
+        const durationStr = t.duration_ms != null ? `<span class="text-muted ms-2">(${t.duration_ms}ms)</span>` : '';
+        html += `<div class="thought-step small py-1">
+            <i class="bi ${icon} me-2 text-muted"></i>
+            <span>${escapeHtml(t.content || '')}</span>
+            ${durationStr}
+        </div>`;
+    });
+    html += '</div>';
+    return html;
+}
diff --git a/application/single_app/static/js/plugin_common.js b/application/single_app/static/js/plugin_common.js
index e40158b9..29a88a24 100644
--- a/application/single_app/static/js/plugin_common.js
+++ b/application/single_app/static/js/plugin_common.js
@@ -2,6 +2,10 @@
 // Shared logic for admin_plugins.js and workspace_plugins.js
 // Exports: functions for modal field handling, validation, label toggling, table rendering, and plugin CRUD
 import { showToast } from "./chat/chat-toast.js"
+import {
+    humanizeName, truncateDescription,
+    openViewModal, createActionCard
+} from './workspace/view-utils.js';
 
 // Fetch merged plugin settings from backend given type and current settings
 export async function fetchAndMergePluginSettings(pluginType, currentSettings = {}) {
@@ -60,8 +64,7 @@ export function escapeHtml(str) {
 }
 
 // Render plugins table (parameterized for tbody selector and button handlers)
-export function renderPluginsTable({plugins, tbodySelector, onEdit, onDelete, ensureTable = true, isAdmin = false}) {
-  console.log('Rendering plugins table with %d plugins', plugins.length);
+export function renderPluginsTable({plugins, tbodySelector, onEdit, onDelete, onView, ensureTable = true, isAdmin = false}) {
   // Optionally ensure the table is present before rendering
   if (ensureTable) {
     ensurePluginsTableInRoot();
@@ -75,29 +78,33 @@ export function renderPluginsTable({plugins, tbodySelector, onEdit, onDelete, en
   plugins.forEach(plugin => {
     const tr = document.createElement('tr');
     const safeName = escapeHtml(plugin.name);
-    const safeDisplayName = escapeHtml(plugin.display_name || plugin.name);
-    const safeDesc = escapeHtml(plugin.description || 'No description available');
+    const displayName = humanizeName(plugin.display_name || plugin.name);
+    const safeDisplayName = escapeHtml(displayName);
+    const description = plugin.description || 'No description available';
+    const truncatedDesc = escapeHtml(truncateDescription(description, 90));
     let actionButtons = '';
     let globalBadge = plugin.is_global ? ' <span class="badge bg-info text-dark">Global</span>' : '';
     
-    // Show action buttons for:
-    // - Admin context: all actions (global and personal)
-    // - User context: only personal actions (not global)
+    // View button always shown
+    let viewButton = `<button type="button" class="btn btn-sm btn-outline-info view-plugin-btn me-1" data-plugin-name="${safeName}" title="View details">
+            <i class="bi bi-eye"></i>
+          </button>`;
+    
+    // Edit/Delete buttons based on context
+    let editDeleteButtons = '';
     if (isAdmin || !plugin.is_global) {
-      actionButtons = `
-        <div class="d-flex gap-1">
+      editDeleteButtons = `
           <button type="button" class="btn btn-sm btn-outline-secondary edit-plugin-btn" data-plugin-name="${safeName}" title="Edit action">
             <i class="bi bi-pencil"></i>
           </button>
           <button type="button" class="btn btn-sm btn-outline-danger delete-plugin-btn" data-plugin-name="${safeName}" title="Delete action">
             <i class="bi bi-trash"></i>
-          </button>
-        </div>
-      `;
+          </button>`;
     }
+    actionButtons = `<div class="d-flex gap-1">${viewButton}${editDeleteButtons}</div>`;
     tr.innerHTML = `
-      <td><strong>${safeDisplayName}</strong>${globalBadge}</td>
-      <td class="text-muted small">${safeDesc}</td>
+      <td><strong title="${escapeHtml(plugin.display_name || plugin.name || '')}">${safeDisplayName}</strong>${globalBadge}</td>
+      <td class="text-muted small" title="${escapeHtml(description)}">${truncatedDesc}</td>
       <td>${actionButtons}</td>
     `;
     tbody.appendChild(tr);
@@ -109,6 +116,34 @@ export function renderPluginsTable({plugins, tbodySelector, onEdit, onDelete, en
   tbody.querySelectorAll('.delete-plugin-btn').forEach(btn => {
     btn.onclick = () => onDelete(btn.getAttribute('data-plugin-name'));
   });
+  tbody.querySelectorAll('.view-plugin-btn').forEach(btn => {
+    btn.onclick = () => {
+      if (onView) {
+        onView(btn.getAttribute('data-plugin-name'));
+      }
+    };
+  });
+}
+
+// Render plugins grid (card-based view)
+export function renderPluginsGrid({plugins, containerSelector, onEdit, onDelete, onView, isAdmin = false}) {
+  const container = document.querySelector(containerSelector);
+  if (!container) return;
+  container.innerHTML = '';
+  if (!plugins.length) {
+    container.innerHTML = '<div class="col-12 text-center text-muted p-4">No actions found.</div>';
+    return;
+  }
+  plugins.forEach(plugin => {
+    const card = createActionCard(plugin, {
+      onView: (p) => { if (onView) onView(p.name); },
+      onEdit: (p) => onEdit(p.name),
+      onDelete: (p) => onDelete(p.name),
+      canManage: isAdmin || !plugin.is_global,
+      isAdmin
+    });
+    container.appendChild(card);
+  });
 }
 
 // Toggle auth fields and labels (parameterized for DOM elements)
diff --git a/application/single_app/static/js/plugin_modal_stepper.js b/application/single_app/static/js/plugin_modal_stepper.js
index 89076076..2e619fcf 100644
--- a/application/single_app/static/js/plugin_modal_stepper.js
+++ b/application/single_app/static/js/plugin_modal_stepper.js
@@ -1,6 +1,10 @@
 // plugin_modal_stepper.js
 // Multi-step modal functionality for action/plugin creation
 import { showToast } from "./chat/chat-toast.js";
+import { getTypeIcon } from "./workspace/view-utils.js";
+
+// Action types hidden from the creation UI (backend plugins remain intact)
+const HIDDEN_ACTION_TYPES = ['sql_schema', 'ui_test', 'queue_storage', 'blob_storage', 'embedding_model'];
 
 export class PluginModalStepper {
   
@@ -129,6 +133,12 @@ export class PluginModalStepper {
     
     document.getElementById('sql-auth-type').addEventListener('change', () => this.handleSqlAuthTypeChange());
     
+    // Test SQL connection button
+    const testConnBtn = document.getElementById('sql-test-connection-btn');
+    if (testConnBtn) {
+      testConnBtn.addEventListener('click', () => this.testSqlConnection());
+    }
+    
     // Set up display name to generated name conversion
     this.setupNameGeneration();
     
@@ -193,6 +203,8 @@ export class PluginModalStepper {
       if (!res.ok) throw new Error('Failed to load action types');
       
       this.availableTypes = await res.json();
+      // Hide deprecated/internal action types from the creation UI
+      this.availableTypes = this.availableTypes.filter(t => !HIDDEN_ACTION_TYPES.includes(t.type));
       // Sort action types alphabetically by display name
       this.availableTypes.sort((a, b) => {
         const nameA = (a.display || a.displayName || a.type || a.name || '').toLowerCase();
@@ -271,10 +283,15 @@ export class PluginModalStepper {
       description.substring(0, maxLength) + '...' : description;
     const needsTruncation = description.length > maxLength;
     
+    const iconClass = getTypeIcon(type.type || type.name);
+
     col.innerHTML = `
       <div class="card action-type-card h-100" data-type="${type.type || type.name}">
         <div class="card-body">
-          <h6 class="card-title">${this.escapeHtml(displayName)}</h6>
+          <div class="d-flex align-items-center mb-2">
+            <i class="bi ${iconClass} me-2" style="font-size: 1.25rem; color: #0d6efd;"></i>
+            <h6 class="card-title mb-0">${this.escapeHtml(displayName)}</h6>
+          </div>
           <p class="card-text">
             <span class="description-short">${this.escapeHtml(truncatedDescription)}</span>
             ${needsTruncation ? `
@@ -538,43 +555,52 @@ export class PluginModalStepper {
     }
 
     if (stepNumber === 4) {
-      // Load additional settings schema for selected type
-      let options = {forceReload: true};
-      this.getAdditionalSettingsSchema(this.selectedType, options);
+      const isSqlType = this.selectedType === 'sql_query' || this.selectedType === 'sql_schema';
       const additionalFieldsDiv = document.getElementById('plugin-additional-fields-div');
-      if (additionalFieldsDiv) {
-        // Only clear and rebuild if type changes
-        if (this.selectedType !== this.lastAdditionalFieldsType) {
-          additionalFieldsDiv.innerHTML = '';
-          additionalFieldsDiv.classList.remove('d-none');
-          if (this.selectedType) {
-            this.getAdditionalSettingsSchema(this.selectedType)
-              .then(schema => {
-                if (schema) {
-                  this.buildAdditionalFieldsUI(schema, additionalFieldsDiv);
-                  try {
-                    if (this.isEditMode && this.originalPlugin && this.originalPlugin.additionalFields) {
-                      this.populateDynamicAdditionalFields(this.originalPlugin.additionalFields);
+
+      // For SQL types, hide additional fields entirely since Step 3 covers all SQL config
+      if (isSqlType && additionalFieldsDiv) {
+        additionalFieldsDiv.innerHTML = '';
+        additionalFieldsDiv.classList.add('d-none');
+        this.lastAdditionalFieldsType = this.selectedType;
+      } else {
+        // Load additional settings schema for selected type
+        let options = {forceReload: true};
+        this.getAdditionalSettingsSchema(this.selectedType, options);
+        if (additionalFieldsDiv) {
+          // Only clear and rebuild if type changes
+          if (this.selectedType !== this.lastAdditionalFieldsType) {
+            additionalFieldsDiv.innerHTML = '';
+            additionalFieldsDiv.classList.remove('d-none');
+            if (this.selectedType) {
+              this.getAdditionalSettingsSchema(this.selectedType)
+                .then(schema => {
+                  if (schema) {
+                    this.buildAdditionalFieldsUI(schema, additionalFieldsDiv);
+                    try {
+                      if (this.isEditMode && this.originalPlugin && this.originalPlugin.additionalFields) {
+                        this.populateDynamicAdditionalFields(this.originalPlugin.additionalFields);
+                      }
+                    } catch (error) {
+                      console.error('Error populating dynamic additional fields:', error);
                     }
-                  } catch (error) {
-                    console.error('Error populating dynamic additional fields:', error);
+                  } else {
+                    console.log('No additional settings schema found');
+                    additionalFieldsDiv.classList.add('d-none');
                   }
-                } else {
-                  console.log('No additional settings schema found');
+                })
+                .catch(error => {
+                  console.error(`Error fetching additional settings schema for type: ${this.selectedType} -- ${error}`);
                   additionalFieldsDiv.classList.add('d-none');
-                }
-              })
-              .catch(error => {
-                console.error(`Error fetching additional settings schema for type: ${this.selectedType} -- ${error}`);
-                additionalFieldsDiv.classList.add('d-none');
-              });
-          } else {
-            console.warn('No plugin type selected');
-            additionalFieldsDiv.classList.add('d-none');
+                });
+            } else {
+              console.warn('No plugin type selected');
+              additionalFieldsDiv.classList.add('d-none');
+            }
+            this.lastAdditionalFieldsType = this.selectedType;
           }
-          this.lastAdditionalFieldsType = this.selectedType;
+          // Otherwise, preserve user data and do not redraw
         }
-        // Otherwise, preserve user data and do not redraw
       }
 
       if (!this.isEditMode) {
@@ -1230,6 +1256,110 @@ export class PluginModalStepper {
     this.updateSqlAuthInfo();
   }
 
+  getSqlTestPluginContext() {
+    if (!this.isEditMode || !this.originalPlugin) {
+      return null;
+    }
+
+    const originalPlugin = this.originalPlugin;
+    let scope = originalPlugin.scope;
+
+    if (!scope) {
+      if (originalPlugin.is_group) {
+        scope = 'group';
+      } else if (originalPlugin.is_global || window.location.pathname.includes('admin')) {
+        scope = 'global';
+      } else {
+        scope = 'user';
+      }
+    }
+
+    return {
+      id: originalPlugin.id || '',
+      name: originalPlugin.name || '',
+      scope
+    };
+  }
+
+  async testSqlConnection() {
+    const btn = document.getElementById('sql-test-connection-btn');
+    const resultDiv = document.getElementById('sql-test-connection-result');
+    const alertDiv = document.getElementById('sql-test-connection-alert');
+    if (!btn || !resultDiv || !alertDiv) return;
+
+    // Collect current SQL config from Step 3
+    const databaseType = document.querySelector('input[name="sql-database-type"]:checked')?.value;
+    const connectionMethod = document.querySelector('input[name="sql-connection-method"]:checked')?.value || 'parameters';
+    const authType = document.getElementById('sql-auth-type')?.value || 'username_password';
+
+    if (!databaseType) {
+      resultDiv.classList.remove('d-none');
+      alertDiv.className = 'alert alert-warning mb-0 py-2 px-3 small';
+      alertDiv.textContent = 'Please select a database type first.';
+      return;
+    }
+
+    const payload = {
+      database_type: databaseType,
+      connection_method: connectionMethod,
+      auth_type: authType
+    };
+
+    if (connectionMethod === 'connection_string') {
+      payload.connection_string = document.getElementById('sql-connection-string')?.value?.trim() || '';
+    } else {
+      payload.server = document.getElementById('sql-server')?.value?.trim() || '';
+      payload.database = document.getElementById('sql-database')?.value?.trim() || '';
+      payload.port = document.getElementById('sql-port')?.value?.trim() || '';
+      if (databaseType === 'sqlserver' || databaseType === 'azure_sql') {
+        payload.driver = document.getElementById('sql-driver')?.value || '';
+      }
+    }
+
+    if (authType === 'username_password') {
+      payload.username = document.getElementById('sql-username')?.value?.trim() || '';
+      payload.password = document.getElementById('sql-password')?.value?.trim() || '';
+    }
+
+    payload.timeout = parseInt(document.getElementById('sql-timeout')?.value) || 10;
+
+    const existingPluginContext = this.getSqlTestPluginContext();
+    if (existingPluginContext) {
+      payload.existing_plugin = existingPluginContext;
+    }
+
+    // Show loading state
+    const originalText = btn.innerHTML;
+    btn.innerHTML = '<span class="spinner-border spinner-border-sm me-2" role="status" aria-hidden="true"></span>Testing...';
+    btn.disabled = true;
+    resultDiv.classList.add('d-none');
+
+    try {
+      const response = await fetch('/api/plugins/test-sql-connection', {
+        method: 'POST',
+        headers: { 'Content-Type': 'application/json' },
+        body: JSON.stringify(payload)
+      });
+      const data = await response.json();
+
+      resultDiv.classList.remove('d-none');
+      if (data.success) {
+        alertDiv.className = 'alert alert-success mb-0 py-2 px-3 small';
+        alertDiv.innerHTML = '<i class="bi bi-check-circle me-2"></i>' + (data.message || 'Connection successful!');
+      } else {
+        alertDiv.className = 'alert alert-danger mb-0 py-2 px-3 small';
+        alertDiv.innerHTML = '<i class="bi bi-x-circle me-2"></i>' + (data.error || 'Connection failed.');
+      }
+    } catch (error) {
+      resultDiv.classList.remove('d-none');
+      alertDiv.className = 'alert alert-danger mb-0 py-2 px-3 small';
+      alertDiv.innerHTML = '<i class="bi bi-x-circle me-2"></i>Test failed: ' + (error.message || 'Network error');
+    } finally {
+      btn.innerHTML = originalText;
+      btn.disabled = false;
+    }
+  }
+
   updateSqlConnectionExamples() {
     const selectedType = document.querySelector('input[name="sql-database-type"]:checked')?.value;
     const examplesDiv = document.getElementById('sql-connection-examples');
@@ -1432,6 +1562,13 @@ export class PluginModalStepper {
     } else if (plugin.type && (plugin.type.toLowerCase().includes('sql') || plugin.type.toLowerCase() === 'sql_schema' || plugin.type.toLowerCase() === 'sql_query')) {
       // Populate SQL fields
       const additionalFields = plugin.additionalFields || {};
+      const auth = plugin.auth || {};
+
+      const pluginVariant = plugin.type.toLowerCase() === 'sql_schema' ? 'schema' : 'query';
+      const pluginTypeRadio = document.querySelector(`input[name="sql-plugin-type"][value="${pluginVariant}"]`);
+      if (pluginTypeRadio) {
+        pluginTypeRadio.checked = true;
+      }
       
       // Database type - select the appropriate radio button
       const databaseType = additionalFields.database_type || 'sqlserver';
@@ -1440,56 +1577,40 @@ export class PluginModalStepper {
         dbTypeRadio.checked = true;
       }
       
-      // Connection method (default to connection string)
-      // Note: The connection method might not be saved in the data, so we'll default to connection_string
-      const connectionMethodRadio = document.querySelector('input[name="sql-connection-method"][value="connection_string"]');
+      const hasConnectionString = typeof additionalFields.connection_string === 'string' && additionalFields.connection_string.length > 0;
+      const connectionMethodValue = hasConnectionString ? 'connection_string' : 'parameters';
+      const connectionMethodRadio = document.querySelector(`input[name="sql-connection-method"][value="${connectionMethodValue}"]`);
       if (connectionMethodRadio) {
         connectionMethodRadio.checked = true;
       }
-      
-      // Build connection string from individual parameters if needed
-      let connectionString = plugin.endpoint || '';
-      if (!connectionString && additionalFields.server) {
-        // Build connection string from components
-        const server = additionalFields.server;
-        const database = additionalFields.database;
-        const driver = additionalFields.driver || 'ODBC Driver 17 for SQL Server';
-        
-        if (databaseType === 'azure_sql' || databaseType === 'sqlserver') {
-          connectionString = `Server=${server};Database=${database};Driver={${driver}};`;
-          if (additionalFields.username && additionalFields.password) {
-            connectionString += `Uid=${additionalFields.username};Pwd=${additionalFields.password};`;
-          }
-        } else if (databaseType === 'postgresql') {
-          connectionString = `Host=${server};Database=${database};`;
-          if (additionalFields.username && additionalFields.password) {
-            connectionString += `Username=${additionalFields.username};Password=${additionalFields.password};`;
-          }
-        } else if (databaseType === 'mysql') {
-          connectionString = `Server=${server};Database=${database};`;
-          if (additionalFields.username && additionalFields.password) {
-            connectionString += `Uid=${additionalFields.username};Pwd=${additionalFields.password};`;
-          }
-        }
-      }
-      
-      document.getElementById('sql-connection-string').value = connectionString;
-      
-      // Authentication
-      const auth = plugin.auth || {};
-      let sqlAuthType = 'username_password'; // Default for SQL plugins
-      
-      if (auth.type === 'user' || auth.type === 'username_password') {
+
+      document.getElementById('sql-connection-string').value = additionalFields.connection_string || '';
+      document.getElementById('sql-server').value = additionalFields.server || '';
+      document.getElementById('sql-database').value = additionalFields.database || '';
+      document.getElementById('sql-port').value = additionalFields.port || '';
+      document.getElementById('sql-driver').value = additionalFields.driver || 'ODBC Driver 17 for SQL Server';
+
+      let sqlAuthType = hasConnectionString ? 'connection_string_only' : 'username_password';
+
+      if (auth.type === 'servicePrincipal') {
+        sqlAuthType = 'service_principal';
+        document.getElementById('sql-client-id').value = auth.identity || auth.client_id || '';
+        document.getElementById('sql-client-secret').value = auth.key || auth.client_secret || '';
+        document.getElementById('sql-tenant-id').value = auth.tenantId || auth.tenant_id || '';
+      } else if (auth.type === 'user' || auth.type === 'username_password' || additionalFields.username || additionalFields.password) {
         sqlAuthType = 'username_password';
         document.getElementById('sql-username').value = additionalFields.username || '';
         document.getElementById('sql-password').value = additionalFields.password || '';
       } else if (auth.type === 'integrated' || auth.type === 'windows') {
         sqlAuthType = 'integrated';
-      } else if (auth.type === 'connection_string') {
-        sqlAuthType = 'connection_string';
+      } else if (auth.type === 'identity') {
+        sqlAuthType = databaseType === 'azure_sql' ? 'managed_identity' : 'integrated';
       }
       
       document.getElementById('sql-auth-type').value = sqlAuthType;
+      this.handleSqlDatabaseTypeChange();
+      this.handleSqlConnectionMethodChange();
+      this.handleSqlAuthTypeChange();
     } else {
       // Populate generic fields
       document.getElementById('plugin-endpoint-generic').value = plugin.endpoint || '';
@@ -1672,9 +1793,9 @@ export class PluginModalStepper {
           if (!clientId || !clientSecret || !tenantId) {
             throw new Error('Please enter client ID, client secret, and tenant ID');
           }
-          auth.client_id = clientId;
-          auth.client_secret = clientSecret;
-          auth.tenant_id = tenantId;
+          auth.identity = clientId;
+          auth.key = clientSecret;
+          auth.tenantId = tenantId;
           break;
           
         case 'integrated':
@@ -1720,12 +1841,17 @@ export class PluginModalStepper {
     
     // Collect additional fields from the dynamic UI and MERGE with existing additionalFields
     // This preserves OpenAPI spec content and other auto-populated fields
-    try {
-      const dynamicFields = this.collectAdditionalFields();
-      // Merge dynamicFields into additionalFields (preserving existing values)
-      additionalFields = { ...additionalFields, ...dynamicFields };
-    } catch (e) {
-      throw new Error('Invalid additional fields input');
+    // For SQL types, Step 3 already provides all necessary config — skip dynamic field merge
+    // to prevent empty Step 4 fields from overwriting populated Step 3 values
+    const isSqlType = this.selectedType === 'sql_query' || this.selectedType === 'sql_schema';
+    if (!isSqlType) {
+      try {
+        const dynamicFields = this.collectAdditionalFields();
+        // Merge dynamicFields into additionalFields (preserving existing values)
+        additionalFields = { ...additionalFields, ...dynamicFields };
+      } catch (e) {
+        throw new Error('Invalid additional fields input');
+      }
     }
     
     let metadata = {};
@@ -2106,6 +2232,7 @@ export class PluginModalStepper {
 
   populateAdvancedSummary() {
     const advancedSection = document.getElementById('summary-advanced-section');
+    const isSqlType = this.selectedType === 'sql_query' || this.selectedType === 'sql_schema';
     
     // Check if there's any metadata or additional fields
     const metadata = document.getElementById('plugin-metadata').value.trim();
@@ -2123,9 +2250,33 @@ export class PluginModalStepper {
       hasMetadata = metadata.length > 0 && metadata !== '{}';
     }
     
-    // DRY: Use private helper to collect additional fields
-    let additionalFieldsObj = this.collectAdditionalFields();
-    hasAdditionalFields = Object.keys(additionalFieldsObj).length > 0;
+    // For SQL types, additional fields are already shown in the SQL Database Configuration
+    // summary section, so skip showing them again in Advanced to avoid redundancy
+    if (!isSqlType) {
+      // DRY: Use private helper to collect additional fields
+      let additionalFieldsObj = this.collectAdditionalFields();
+      hasAdditionalFields = Object.keys(additionalFieldsObj).length > 0;
+
+      // Show/hide additional fields preview
+      const additionalFieldsPreview = document.getElementById('summary-additional-fields-preview');
+      if (hasAdditionalFields) {
+        let previewContent = '';
+        if (typeof additionalFieldsObj === 'object' && additionalFieldsObj !== null) {
+          previewContent = JSON.stringify(additionalFieldsObj, null, 2);
+        } else {
+          previewContent = '';
+        }
+        document.getElementById('summary-additional-fields-content').textContent = previewContent;
+        additionalFieldsPreview.style.display = '';
+      } else {
+        additionalFieldsPreview.style.display = 'none';
+      }
+    } else {
+      // Hide additional fields for SQL types
+      const additionalFieldsPreview = document.getElementById('summary-additional-fields-preview');
+      if (additionalFieldsPreview) additionalFieldsPreview.style.display = 'none';
+      hasAdditionalFields = false;
+    }
     
     // Update has metadata/additional fields indicators
     document.getElementById('summary-has-metadata').textContent = hasMetadata ? 'Yes' : 'No';
@@ -2140,21 +2291,6 @@ export class PluginModalStepper {
       metadataPreview.style.display = 'none';
     }
     
-    // Show/hide additional fields preview
-    const additionalFieldsPreview = document.getElementById('summary-additional-fields-preview');
-    if (hasAdditionalFields) {
-      let previewContent = '';
-      if (typeof additionalFieldsObj === 'object' && additionalFieldsObj !== null) {
-        previewContent = JSON.stringify(additionalFieldsObj, null, 2);
-      } else {
-        previewContent = '';
-      }
-      document.getElementById('summary-additional-fields-content').textContent = previewContent;
-      additionalFieldsPreview.style.display = '';
-    } else {
-      additionalFieldsPreview.style.display = 'none';
-    }
-    
     // Show advanced section if there's any advanced content
     if (hasMetadata || hasAdditionalFields) {
       advancedSection.style.display = '';
diff --git a/application/single_app/static/js/public/public_workspace.js b/application/single_app/static/js/public/public_workspace.js
index 995fb51c..fb1133a7 100644
--- a/application/single_app/static/js/public/public_workspace.js
+++ b/application/single_app/static/js/public/public_workspace.js
@@ -199,8 +199,6 @@ document.addEventListener('DOMContentLoaded', ()=>{
     if (activePublicId) fetchPublicDocs();
   });
 
-  Array.from(publicDropdownItems.children).forEach(()=>{}); // placeholder
-
   // --- Document selection event listeners ---
   // Event delegation for document checkboxes
   document.addEventListener('change', function(event) {
@@ -266,8 +264,7 @@ function updatePublicRoleDisplay(){
     if (nameRoleEl) nameRoleEl.textContent = activePublicName;
     if (display) display.style.display = 'block';
     if (uploadSection) uploadSection.style.display = ['Owner','Admin','DocumentManager'].includes(userRoleInActivePublic) ? 'block' : 'none';
-    // uploadHr was removed from template, so skip
-    
+
     // Control visibility of Settings tab (only for Owners and Admins)
     const settingsTabNav = document.getElementById('public-settings-tab-nav');
     const canManageSettings = ['Owner', 'Admin'].includes(userRoleInActivePublic);
@@ -491,6 +488,7 @@ function renderPublicDocumentRow(doc) {
         <p class="mb-1"><strong>Citations:</strong> ${getCitationBadge(doc.enhanced_citations)}</p>
         <p class="mb-1"><strong>Publication Date:</strong> ${escapeHtml(doc.publication_date || 'N/A')}</p>
         <p class="mb-1"><strong>Keywords:</strong> ${escapeHtml(doc.keywords || 'N/A')}</p>
+        <p class="mb-1"><strong>Tags:</strong> ${renderPublicTagBadges(doc.tags || [])}</p>
         <p class="mb-0"><strong>Abstract:</strong> ${escapeHtml(doc.abstract || 'N/A')}</p>
         <hr class="my-2">
         <div class="d-flex flex-wrap gap-2">
@@ -1708,7 +1706,7 @@ window.loadPublicWorkspaceTags = loadPublicWorkspaceTags;
 function isPublicColorLight(hex) {
   if (!hex) return true;
   hex = hex.replace('#', '');
-  const r = parseInt(hex.substr(0,2),16), g = parseInt(hex.substr(2,2),16), b = parseInt(hex.substr(4,2),16);
+  const r = parseInt(hex.substring(0, 2), 16), g = parseInt(hex.substring(2, 4), 16), b = parseInt(hex.substring(4, 6), 16);
   return (r * 299 + g * 587 + b * 114) / 1000 > 155;
 }
 
@@ -1718,6 +1716,29 @@ function escapePublicHtml(text) {
   return d.innerHTML;
 }
 
+function renderPublicTagBadges(tags, maxDisplay = 3) {
+  if (!Array.isArray(tags) || tags.length === 0) {
+    return '<span class="text-muted small">No tags</span>';
+  }
+
+  let html = '';
+  const displayTags = tags.slice(0, maxDisplay);
+
+  displayTags.forEach(tagName => {
+    const tag = publicWorkspaceTags.find(t => t.name === tagName);
+    const color = tag && tag.color ? tag.color : '#6c757d';
+    const textClass = isPublicColorLight(color) ? 'text-dark' : 'text-light';
+
+    html += `<span class="tag-badge ${textClass}" style="background-color:${color};" title="${escapePublicHtml(tagName)}">${escapePublicHtml(tagName)}</span>`;
+  });
+
+  if (tags.length > maxDisplay) {
+    html += `<span class="badge bg-secondary">+${tags.length - maxDisplay}</span>`;
+  }
+
+  return html;
+}
+
 // --- Tag Management Modal ---
 function showPublicTagManagementModal() {
   loadPublicWorkspaceTags().then(() => {
diff --git a/application/single_app/static/js/workspace/group_agents.js b/application/single_app/static/js/workspace/group_agents.js
index f97dbd07..608f029e 100644
--- a/application/single_app/static/js/workspace/group_agents.js
+++ b/application/single_app/static/js/workspace/group_agents.js
@@ -4,16 +4,23 @@
 import { showToast } from "../chat/chat-toast.js";
 import * as agentsCommon from "../agents_common.js";
 import { AgentModalStepper } from "../agent_modal_stepper.js";
+import {
+  humanizeName, truncateDescription, escapeHtml as escapeHtmlUtil,
+  setupViewToggle, switchViewContainers, openViewModal, createAgentCard
+} from './view-utils.js';
 
 const tableBody = document.getElementById("group-agents-table-body");
 const errorContainer = document.getElementById("group-agents-error");
 const searchInput = document.getElementById("group-agents-search");
 const createButton = document.getElementById("create-group-agent-btn");
 const permissionWarning = document.getElementById("group-agents-permission-warning");
+const agentsListView = document.getElementById("group-agents-list-view");
+const agentsGridView = document.getElementById("group-agents-grid-view");
 
 let agents = [];
 let filteredAgents = [];
 let agentStepper = null;
+let currentViewMode = 'list';
 let currentContext = window.groupWorkspaceContext || {
   activeGroupId: null,
   activeGroupName: "",
@@ -21,14 +28,7 @@ let currentContext = window.groupWorkspaceContext || {
 };
 
 function escapeHtml(value) {
-  if (!value) return "";
-  return value.replace(/[&<>"']/g, (char) => ({
-    "&": "&amp;",
-    "<": "&lt;",
-    ">": "&gt;",
-    '"': "&quot;",
-    "'": "&#39;"
-  }[char] || char));
+  return escapeHtmlUtil(value);
 }
 
 function canManageAgents() {
@@ -46,6 +46,7 @@ function groupAllowsModifications() {
 }
 
 function truncateName(name, maxLength = 18) {
+  // Kept for backward compat; prefer humanizeName for display
   if (!name || name.length <= maxLength) return name || "";
   return `${name.substring(0, maxLength)}…`;
 }
@@ -114,29 +115,61 @@ function renderAgentsTable(list) {
 
   list.forEach((agent) => {
     const tr = document.createElement("tr");
-    const displayName = truncateName(agent.display_name || agent.displayName || agent.name || "");
-    const description = escapeHtml(agent.description || "No description available.");
-
-    let actionsHtml = "<span class=\"text-muted small\">—</span>";
+    const rawName = agent.display_name || agent.displayName || agent.name || "";
+    const displayName = humanizeName(rawName);
+    const fullDesc = agent.description || "No description available.";
+    const shortDesc = truncateDescription(fullDesc, 90);
+
+    let actionsHtml = `
+        <button type="button" class="btn btn-sm btn-outline-info me-1 view-group-agent-btn" data-agent-name="${escapeHtml(agent.name || '')}" title="View details">
+          <i class="bi bi-eye"></i>
+        </button>
+        <button type="button" class="btn btn-sm btn-primary me-1 chat-group-agent-btn" data-agent-name="${escapeHtml(agent.name || '')}" title="Chat with this agent">
+          <i class="bi bi-chat-dots me-1"></i>Chat
+        </button>`;
     if (canManage) {
-      actionsHtml = `
-        <button type="button" class="btn btn-sm btn-outline-secondary me-1 edit-group-agent-btn" data-agent-id="${escapeHtml(agent.id || agent.name || "")}">
+      actionsHtml += `
+        <button type="button" class="btn btn-sm btn-outline-secondary me-1 edit-group-agent-btn" data-agent-id="${escapeHtml(agent.id || agent.name || '')}">
           <i class="bi bi-pencil"></i>
         </button>
-        <button type="button" class="btn btn-sm btn-outline-danger delete-group-agent-btn" data-agent-id="${escapeHtml(agent.id || agent.name || "")}">
+        <button type="button" class="btn btn-sm btn-outline-danger delete-group-agent-btn" data-agent-id="${escapeHtml(agent.id || agent.name || '')}">
           <i class="bi bi-trash"></i>
         </button>`;
     }
 
     tr.innerHTML = `
-      <td><strong title="${escapeHtml(agent.display_name || agent.displayName || agent.name || "")}">${escapeHtml(displayName)}</strong></td>
-      <td class="text-muted small">${description}</td>
+      <td><strong title="${escapeHtml(rawName)}">${escapeHtml(displayName)}</strong></td>
+      <td class="text-muted small" title="${escapeHtml(fullDesc)}">${escapeHtml(shortDesc)}</td>
       <td>${actionsHtml}</td>`;
 
     tableBody.appendChild(tr);
   });
 }
 
+function renderAgentsGrid(list) {
+  if (!agentsGridView) return;
+  agentsGridView.innerHTML = '';
+
+  if (!list.length) {
+    agentsGridView.innerHTML = '<div class="col-12 text-center text-muted p-4">No group agents found.</div>';
+    return;
+  }
+
+  const canManage = canManageAgents() && groupAllowsModifications();
+  list.forEach(agent => {
+    const col = createAgentCard(agent, {
+      onChat: a => chatWithGroupAgent(a.name || a),
+      onView: a => openGroupAgentViewModal(a),
+      onEdit: canManage ? a => {
+        const found = agents.find(x => x.id === (a.id || a.name || a) || x.name === (a.name || a));
+        openAgentModal(found || null);
+      } : null,
+      onDelete: canManage ? a => deleteGroupAgent(a.id || a.name || a) : null
+    });
+    agentsGridView.appendChild(col);
+  });
+}
+
 function filterAgents(term) {
   if (!term) {
     filteredAgents = agents.slice();
@@ -149,6 +182,23 @@ function filterAgents(term) {
     });
   }
   renderAgentsTable(filteredAgents);
+  renderAgentsGrid(filteredAgents);
+}
+
+// Open the view modal for a group agent with Chat/Edit/Delete actions
+function openGroupAgentViewModal(agent) {
+  const canManage = canManageAgents() && groupAllowsModifications();
+  const callbacks = {
+    onChat: (a) => chatWithGroupAgent(a.name)
+  };
+  if (canManage) {
+    callbacks.onEdit = (a) => {
+      const found = agents.find(x => x.id === a.id || x.name === a.name);
+      openAgentModal(found || a);
+    };
+    callbacks.onDelete = (a) => deleteGroupAgent(a.id || a.name);
+  }
+  openViewModal(agent, 'agent', callbacks);
 }
 
 function overrideAgentStepper(stepper) {
@@ -343,7 +393,57 @@ async function fetchGroupAgents() {
   }
 }
 
+async function chatWithGroupAgent(agentName) {
+  try {
+    const agent = agents.find(a => a.name === agentName);
+    if (!agent) {
+      throw new Error("Agent not found");
+    }
+
+    const payloadData = {
+      selected_agent: {
+        name: agentName,
+        display_name: agent.display_name || agent.displayName || agentName,
+        is_global: !!agent.is_global,
+        is_group: true,
+        group_id: currentContext.activeGroupId,
+        group_name: currentContext.activeGroupName
+      }
+    };
+
+    const resp = await fetch("/api/user/settings/selected_agent", {
+      method: "POST",
+      headers: { "Content-Type": "application/json" },
+      body: JSON.stringify(payloadData)
+    });
+
+    if (!resp.ok) {
+      throw new Error("Failed to select agent");
+    }
+
+    window.location.href = "/chats";
+  } catch (err) {
+    console.error("Error selecting group agent for chat:", err);
+    showToast("Error selecting agent for chat. Please try again.", "danger");
+  }
+}
+
 function handleTableClick(event) {
+  const viewBtn = event.target.closest(".view-group-agent-btn");
+  if (viewBtn) {
+    const agentName = viewBtn.dataset.agentName;
+    const agent = agents.find(a => a.name === agentName);
+    if (agent) openGroupAgentViewModal(agent);
+    return;
+  }
+
+  const chatBtn = event.target.closest(".chat-group-agent-btn");
+  if (chatBtn) {
+    const agentName = chatBtn.dataset.agentName;
+    chatWithGroupAgent(agentName);
+    return;
+  }
+
   const editBtn = event.target.closest(".edit-group-agent-btn");
   if (editBtn) {
     const agentId = editBtn.dataset.agentId;
@@ -384,6 +484,11 @@ function initialize() {
   updatePermissionUI();
   bindEventHandlers();
 
+  setupViewToggle('groupAgents', 'groupAgentsViewPreference', (mode) => {
+    currentViewMode = mode;
+    switchViewContainers(mode, agentsListView, agentsGridView);
+  });
+
   if (document.getElementById("group-agents-tab-btn")?.classList.contains("active")) {
     fetchGroupAgents();
   }
diff --git a/application/single_app/static/js/workspace/group_plugins.js b/application/single_app/static/js/workspace/group_plugins.js
index 60a7f42e..8acdf5bd 100644
--- a/application/single_app/static/js/workspace/group_plugins.js
+++ b/application/single_app/static/js/workspace/group_plugins.js
@@ -3,6 +3,10 @@
 
 import { ensurePluginsTableInRoot, validatePluginManifest } from "../plugin_common.js";
 import { showToast } from "../chat/chat-toast.js";
+import {
+  humanizeName, truncateDescription, escapeHtml as escapeHtmlUtil,
+  setupViewToggle, switchViewContainers, openViewModal, createActionCard
+} from './view-utils.js';
 
 const root = document.getElementById("group-plugins-root");
 const permissionWarning = document.getElementById("group-plugins-permission-warning");
@@ -11,6 +15,7 @@ let plugins = [];
 let filteredPlugins = [];
 let templateReady = false;
 let listenersBound = false;
+let currentViewMode = 'list';
 let currentContext = window.groupWorkspaceContext || {
   activeGroupId: null,
   activeGroupName: "",
@@ -18,14 +23,7 @@ let currentContext = window.groupWorkspaceContext || {
 };
 
 function escapeHtml(value) {
-  if (!value) return "";
-  return value.replace(/[&<>"']/g, (char) => ({
-    "&": "&amp;",
-    "<": "&lt;",
-    ">": "&gt;",
-    '"': "&quot;",
-    "'": "&#39;"
-  }[char] || char));
+  return escapeHtmlUtil(value);
 }
 
 function canManagePlugins() {
@@ -66,6 +64,14 @@ function bindRootEvents() {
   });
 
   root.addEventListener("click", async (event) => {
+    const viewBtn = event.target.closest(".view-group-plugin-btn");
+    if (viewBtn) {
+      const pluginId = viewBtn.dataset.pluginId;
+      const plugin = plugins.find(x => x.id === pluginId || x.name === pluginId);
+      if (plugin) openGroupPluginViewModal(plugin);
+      return;
+    }
+
     const createBtn = event.target.closest("#create-group-plugin-btn");
     if (createBtn) {
       event.preventDefault();
@@ -148,23 +154,28 @@ function renderPluginsTable(list) {
   const canManage = canManagePlugins() && groupAllowsModifications();
   list.forEach((plugin) => {
     const tr = document.createElement("tr");
-    const displayName = plugin.displayName || plugin.display_name || plugin.name || "";
-    const description = plugin.description || "No description available.";
+    const rawName = plugin.displayName || plugin.display_name || plugin.name || "";
+    const displayName = humanizeName(rawName);
+    const fullDesc = plugin.description || "No description available.";
+    const shortDesc = truncateDescription(fullDesc, 90);
     const isGlobal = Boolean(plugin.is_global);
 
-    let actionsHtml = "<span class=\"text-muted small\">—</span>";
+    // View button always visible
+    let actionsHtml = `
+      <button type="button" class="btn btn-sm btn-outline-info me-1 view-group-plugin-btn" data-plugin-id="${escapeHtml(plugin.id || plugin.name || '')}" title="View details">
+        <i class="bi bi-eye"></i>
+      </button>`;
+
     if (canManage && !isGlobal) {
-      actionsHtml = `
-        <div class="d-flex gap-1">
-          <button type="button" class="btn btn-sm btn-outline-secondary edit-group-plugin-btn" data-plugin-id="${escapeHtml(plugin.id || plugin.name || "")}">
-            <i class="bi bi-pencil"></i>
-          </button>
-          <button type="button" class="btn btn-sm btn-outline-danger delete-group-plugin-btn" data-plugin-id="${escapeHtml(plugin.id || plugin.name || "")}">
-            <i class="bi bi-trash"></i>
-          </button>
-        </div>`;
+      actionsHtml += `
+        <button type="button" class="btn btn-sm btn-outline-secondary me-1 edit-group-plugin-btn" data-plugin-id="${escapeHtml(plugin.id || plugin.name || '')}">
+          <i class="bi bi-pencil"></i>
+        </button>
+        <button type="button" class="btn btn-sm btn-outline-danger delete-group-plugin-btn" data-plugin-id="${escapeHtml(plugin.id || plugin.name || '')}">
+          <i class="bi bi-trash"></i>
+        </button>`;
     } else if (canManage && isGlobal) {
-      actionsHtml = "<span class=\"text-muted small\">Managed globally</span>";
+      actionsHtml += `<span class="text-muted small ms-1">Managed globally</span>`;
     }
 
     const titleHtml = isGlobal
@@ -172,14 +183,36 @@ function renderPluginsTable(list) {
       : escapeHtml(displayName);
 
     tr.innerHTML = `
-      <td><strong title="${escapeHtml(displayName)}">${titleHtml}</strong></td>
-      <td class="text-muted small">${escapeHtml(description)}</td>
+      <td><strong title="${escapeHtml(rawName)}">${titleHtml}</strong></td>
+      <td class="text-muted small" title="${escapeHtml(fullDesc)}">${escapeHtml(shortDesc)}</td>
       <td>${actionsHtml}</td>`;
 
     tbody.appendChild(tr);
   });
 }
 
+function renderPluginsGrid(list) {
+  const gridView = document.getElementById('group-plugins-grid-view');
+  if (!gridView) return;
+  gridView.innerHTML = '';
+
+  if (!list.length) {
+    gridView.innerHTML = '<div class="col-12 text-center text-muted p-4">No group actions found.</div>';
+    return;
+  }
+
+  const canManage = canManagePlugins() && groupAllowsModifications();
+  list.forEach(plugin => {
+    const isGlobal = Boolean(plugin.is_global);
+    const col = createActionCard(plugin, {
+      onView: p => openGroupPluginViewModal(p),
+      onEdit: (canManage && !isGlobal) ? p => openPluginModal(p.id || p.name) : null,
+      onDelete: (canManage && !isGlobal) ? p => deleteGroupPlugin(p.id || p.name) : null
+    });
+    gridView.appendChild(col);
+  });
+}
+
 function filterPlugins(term) {
   if (!term) {
     filteredPlugins = plugins.slice();
@@ -192,6 +225,19 @@ function filterPlugins(term) {
     });
   }
   renderPluginsTable(filteredPlugins);
+  renderPluginsGrid(filteredPlugins);
+}
+
+// Open the view modal for a group action with Edit/Delete actions
+function openGroupPluginViewModal(plugin) {
+  const canManage = canManagePlugins() && groupAllowsModifications();
+  const isGlobal = Boolean(plugin.is_global);
+  const callbacks = {};
+  if (canManage && !isGlobal) {
+    callbacks.onEdit = (p) => openPluginModal(p.id || p.name);
+    callbacks.onDelete = (p) => deleteGroupPlugin(p.id || p.name);
+  }
+  openViewModal(plugin, 'action', callbacks);
 }
 
 async function fetchGroupPlugins() {
@@ -220,7 +266,17 @@ async function fetchGroupPlugins() {
     filteredPlugins = plugins.slice();
 
     renderPluginsTable(filteredPlugins);
+    renderPluginsGrid(filteredPlugins);
     updatePermissionUI();
+
+    // Set up view toggle (only once after template is in DOM)
+    setupViewToggle('groupPlugins', 'groupPluginsViewPreference', (mode) => {
+      currentViewMode = mode;
+      switchViewContainers(mode,
+        document.getElementById('group-plugins-list-view'),
+        document.getElementById('group-plugins-grid-view')
+      );
+    });
   } catch (error) {
     console.error("Error loading group actions:", error);
     renderError(error.message || "Unable to load group actions.");
diff --git a/application/single_app/static/js/workspace/view-utils.js b/application/single_app/static/js/workspace/view-utils.js
new file mode 100644
index 00000000..3b78bc15
--- /dev/null
+++ b/application/single_app/static/js/workspace/view-utils.js
@@ -0,0 +1,523 @@
+// view-utils.js
+// Shared utilities for list/grid view toggle, name humanization, and view modal
+// Used by personal and group agents/actions workspace modules
+
+/**
+ * Convert a technical name to a human-readable display name.
+ * Handles underscores, camelCase, PascalCase, and consecutive uppercase.
+ * Examples:
+ *   "sql_query" → "Sql Query"
+ *   "myAgentName" → "My Agent Name"
+ *   "OpenAPIPlugin" → "Open API Plugin"
+ *   "log_analytics" → "Log Analytics"
+ */
+export function humanizeName(name) {
+    if (!name) return "";
+    // Replace underscores and hyphens with spaces
+    let result = name.replace(/[_-]/g, " ");
+    // Insert space before uppercase letters that follow lowercase letters (camelCase)
+    result = result.replace(/([a-z])([A-Z])/g, "$1 $2");
+    // Insert space between consecutive uppercase followed by lowercase (e.g., "APIPlugin" → "API Plugin")
+    result = result.replace(/([A-Z]+)([A-Z][a-z])/g, "$1 $2");
+    // Capitalize first letter of each word
+    result = result.replace(/\b\w/g, (c) => c.toUpperCase());
+    // Collapse multiple spaces
+    result = result.replace(/\s+/g, " ").trim();
+    return result;
+}
+
+/**
+ * Truncate a description string to maxLen characters, appending "…" if truncated.
+ */
+export function truncateDescription(text, maxLen = 100) {
+    if (!text) return "";
+    if (text.length <= maxLen) return text;
+    return text.substring(0, maxLen).trimEnd() + "…";
+}
+
+/**
+ * Escape HTML entities to prevent XSS.
+ */
+export function escapeHtml(str) {
+    if (!str) return "";
+    return str.replace(/[&<>"']/g, (c) =>
+        ({ "&": "&amp;", "<": "&lt;", ">": "&gt;", '"': "&quot;", "'": "&#39;" }[c])
+    );
+}
+
+/**
+ * Get an appropriate Bootstrap icon class for an action/plugin type.
+ */
+export function getTypeIcon(type) {
+    if (!type) return "bi-lightning-charge";
+    const t = type.toLowerCase();
+    if (t.includes("sql")) return "bi-database";
+    if (t.includes("openapi")) return "bi-globe";
+    if (t.includes("log_analytics")) return "bi-graph-up";
+    if (t.includes("msgraph")) return "bi-microsoft";
+    if (t.includes("databricks")) return "bi-bricks";
+    if (t.includes("http") || t.includes("smart_http")) return "bi-cloud-arrow-up";
+    if (t.includes("azure_function")) return "bi-lightning";
+    if (t.includes("blob")) return "bi-file-earmark";
+    if (t.includes("queue")) return "bi-inbox";
+    if (t.includes("embedding")) return "bi-vector-pen";
+    if (t.includes("fact_memory")) return "bi-brain";
+    if (t.includes("math")) return "bi-calculator";
+    if (t.includes("text")) return "bi-fonts";
+    if (t.includes("time")) return "bi-clock";
+    return "bi-lightning-charge";
+}
+
+/**
+ * Create the HTML string for a list/grid view toggle button group.
+ * @param {string} prefix - Unique prefix for element IDs (e.g., "agents", "plugins", "group-agents")
+ * @returns {string} HTML string
+ */
+export function createViewToggleHtml(prefix) {
+    return `
+        <div class="btn-group btn-group-sm" role="group" aria-label="View mode">
+            <input type="radio" class="btn-check" name="${prefix}-view-mode" id="${prefix}-view-list" autocomplete="off" checked>
+            <label class="btn btn-outline-secondary" for="${prefix}-view-list">
+                <i class="bi bi-list-ul"></i>
+            </label>
+            <input type="radio" class="btn-check" name="${prefix}-view-mode" id="${prefix}-view-grid" autocomplete="off">
+            <label class="btn btn-outline-secondary" for="${prefix}-view-grid">
+                <i class="bi bi-grid-3x3-gap"></i>
+            </label>
+        </div>`;
+}
+
+/**
+ * Set up view toggle event listeners and restore saved preference.
+ * @param {string} prefix - Unique prefix matching createViewToggleHtml
+ * @param {string} storageKey - localStorage key for persistence
+ * @param {function} onSwitch - Callback receiving 'list' or 'grid'
+ */
+export function setupViewToggle(prefix, storageKey, onSwitch) {
+    const listRadio = document.getElementById(`${prefix}-view-list`);
+    const gridRadio = document.getElementById(`${prefix}-view-grid`);
+    if (!listRadio || !gridRadio) return;
+
+    listRadio.addEventListener("change", () => {
+        if (listRadio.checked) {
+            localStorage.setItem(storageKey, "list");
+            onSwitch("list");
+        }
+    });
+
+    gridRadio.addEventListener("change", () => {
+        if (gridRadio.checked) {
+            localStorage.setItem(storageKey, "grid");
+            onSwitch("grid");
+        }
+    });
+
+    // Restore saved preference
+    const saved = localStorage.getItem(storageKey);
+    if (saved === "grid") {
+        gridRadio.checked = true;
+        listRadio.checked = false;
+        onSwitch("grid");
+    } else {
+        onSwitch("list");
+    }
+}
+
+/**
+ * Toggle visibility of list and grid containers.
+ * @param {string} mode - 'list' or 'grid'
+ * @param {HTMLElement} listContainer - The list/table container element
+ * @param {HTMLElement} gridContainer - The grid container element
+ */
+export function switchViewContainers(mode, listContainer, gridContainer) {
+    if (listContainer) {
+        listContainer.classList.toggle("d-none", mode !== "list");
+    }
+    if (gridContainer) {
+        gridContainer.classList.toggle("d-none", mode !== "grid");
+    }
+}
+
+// ============================================================================
+// VIEW MODAL — Lightweight read-only detail view
+// ============================================================================
+
+/**
+ * Open a read-only view modal for an agent or action.
+ * @param {object} item - The agent or action data object
+ * @param {'agent'|'action'} type - What kind of item this is
+ * @param {object} [callbacks] - Optional action callbacks { onChat, onEdit, onDelete }
+ */
+export function openViewModal(item, type, callbacks = {}) {
+    const modalEl = document.getElementById("item-view-modal");
+    if (!modalEl) return;
+
+    const titleEl = modalEl.querySelector(".modal-title");
+    const bodyEl = modalEl.querySelector(".modal-body");
+    const footerEl = modalEl.querySelector(".modal-footer");
+    if (!titleEl || !bodyEl || !footerEl) return;
+
+    if (type === "agent") {
+        titleEl.textContent = "Agent Details";
+        bodyEl.innerHTML = buildAgentViewHtml(item);
+    } else {
+        titleEl.textContent = "Action Details";
+        bodyEl.innerHTML = buildActionViewHtml(item);
+    }
+
+    // Build footer buttons dynamically
+    footerEl.innerHTML = '';
+    const { onChat, onEdit, onDelete } = callbacks;
+
+    if (onChat && typeof onChat === 'function') {
+        const chatBtn = document.createElement('button');
+        chatBtn.type = 'button';
+        chatBtn.className = 'btn btn-primary';
+        chatBtn.innerHTML = '<i class="bi bi-chat-dots me-1"></i>Chat';
+        chatBtn.addEventListener('click', () => {
+            bootstrap.Modal.getInstance(modalEl)?.hide();
+            onChat(item);
+        });
+        footerEl.appendChild(chatBtn);
+    }
+
+    if (onEdit && typeof onEdit === 'function') {
+        const editBtn = document.createElement('button');
+        editBtn.type = 'button';
+        editBtn.className = 'btn btn-outline-secondary';
+        editBtn.innerHTML = '<i class="bi bi-pencil me-1"></i>Edit';
+        editBtn.addEventListener('click', () => {
+            bootstrap.Modal.getInstance(modalEl)?.hide();
+            onEdit(item);
+        });
+        footerEl.appendChild(editBtn);
+    }
+
+    if (onDelete && typeof onDelete === 'function') {
+        const delBtn = document.createElement('button');
+        delBtn.type = 'button';
+        delBtn.className = 'btn btn-outline-danger';
+        delBtn.innerHTML = '<i class="bi bi-trash me-1"></i>Delete';
+        delBtn.addEventListener('click', () => {
+            bootstrap.Modal.getInstance(modalEl)?.hide();
+            onDelete(item);
+        });
+        footerEl.appendChild(delBtn);
+    }
+
+    const closeBtn = document.createElement('button');
+    closeBtn.type = 'button';
+    closeBtn.className = 'btn btn-secondary';
+    closeBtn.textContent = 'Close';
+    closeBtn.setAttribute('data-bs-dismiss', 'modal');
+    footerEl.appendChild(closeBtn);
+
+    const modal = new bootstrap.Modal(modalEl);
+    modal.show();
+}
+
+function buildAgentViewHtml(agent) {
+    const displayName = escapeHtml(agent.display_name || agent.displayName || agent.name || "");
+    const name = escapeHtml(agent.name || "");
+    const description = escapeHtml(agent.description || "No description available.");
+    const model = escapeHtml(agent.azure_openai_gpt_deployment || agent.model || "Default");
+    const agentType = agent.agent_type === "aifoundry" ? "Azure AI Foundry" : "Local (Semantic Kernel)";
+    const rawInstructions = agent.instructions || "No instructions defined.";
+    // Render instructions as Markdown (marked + DOMPurify are loaded globally in base.html)
+    const renderedInstructions = (typeof marked !== 'undefined' && typeof DOMPurify !== 'undefined')
+        ? DOMPurify.sanitize(marked.parse(rawInstructions))
+        : escapeHtml(rawInstructions);
+    const isGlobal = agent.is_global;
+    const scopeBadge = isGlobal
+        ? '<span class="badge bg-info text-dark">Global</span>'
+        : '<span class="badge bg-secondary">Personal</span>';
+
+    return `
+        <div class="card mb-3 border-0 shadow-sm">
+            <div class="card-header text-white py-2" style="background: linear-gradient(135deg, #007bff 0%, #0056b3 100%);">
+                <i class="bi bi-info-circle me-2"></i><strong>Basic Information</strong>
+            </div>
+            <div class="card-body">
+                <div class="row g-3">
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Display Name</label>
+                        <span class="fw-medium">${displayName}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Generated Name</label>
+                        <span class="fw-medium font-monospace">${name}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Scope</label>
+                        ${scopeBadge}
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Agent Type</label>
+                        <span class="badge bg-info text-dark">${escapeHtml(agentType)}</span>
+                    </div>
+                    <div class="col-12">
+                        <label class="text-muted small mb-1 d-block">Description</label>
+                        <span class="fw-medium">${description}</span>
+                    </div>
+                </div>
+            </div>
+        </div>
+        <div class="card mb-3 border-0 shadow-sm">
+            <div class="card-header text-white py-2" style="background: linear-gradient(135deg, #ffc107 0%, #e0a800 100%);">
+                <i class="bi bi-gear me-2"></i><strong>Model Configuration</strong>
+            </div>
+            <div class="card-body">
+                <div class="row g-3">
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Model / Deployment</label>
+                        <span class="fw-medium font-monospace">${model}</span>
+                    </div>
+                </div>
+            </div>
+        </div>
+        <div class="card mb-3 border-0 shadow-sm">
+            <div class="card-header text-white py-2" style="background: linear-gradient(135deg, #28a745 0%, #1e7e34 100%);">
+                <i class="bi bi-file-text me-2"></i><strong>Instructions</strong>
+            </div>
+            <div class="card-body">
+                <div class="p-3 bg-light border rounded rendered-markdown" style="max-height: 300px; overflow-y: auto; font-size: 0.9rem;">
+${renderedInstructions}
+                </div>
+            </div>
+        </div>`;
+}
+
+function buildActionViewHtml(action) {
+    const displayName = escapeHtml(action.display_name || action.displayName || action.name || "");
+    const name = escapeHtml(action.name || "");
+    const description = escapeHtml(action.description || "No description available.");
+    const type = escapeHtml(action.type || "unknown");
+    const typeIcon = getTypeIcon(action.type);
+    const authType = escapeHtml(formatAuthType(action.auth?.type || action.auth_type || ""));
+    const endpoint = escapeHtml(action.endpoint || action.base_url || "");
+    const isGlobal = action.is_global;
+    const scopeBadge = isGlobal
+        ? '<span class="badge bg-info text-dark">Global</span>'
+        : '<span class="badge bg-secondary">Personal</span>';
+
+    let configHtml = "";
+    if (endpoint) {
+        configHtml = `
+        <div class="card mb-3 border-0 shadow-sm">
+            <div class="card-header text-white py-2" style="background: linear-gradient(135deg, #28a745 0%, #1e7e34 100%);">
+                <i class="bi bi-gear me-2"></i><strong>Configuration</strong>
+            </div>
+            <div class="card-body">
+                <div class="row g-3">
+                    <div class="col-12">
+                        <label class="text-muted small mb-1 d-block">Endpoint</label>
+                        <span class="fw-medium font-monospace text-break">${endpoint}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Authentication</label>
+                        <span class="fw-medium">${authType || "None"}</span>
+                    </div>
+                </div>
+            </div>
+        </div>`;
+    }
+
+    return `
+        <div class="card mb-3 border-0 shadow-sm">
+            <div class="card-header text-white py-2" style="background: linear-gradient(135deg, #007bff 0%, #0056b3 100%);">
+                <i class="bi bi-info-circle me-2"></i><strong>Basic Information</strong>
+            </div>
+            <div class="card-body">
+                <div class="row g-3">
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Display Name</label>
+                        <span class="fw-medium">${displayName}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Generated Name</label>
+                        <span class="fw-medium font-monospace">${name}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Type</label>
+                        <span class="fw-medium"><i class="bi ${typeIcon} me-1"></i>${humanizeName(type)}</span>
+                    </div>
+                    <div class="col-md-6">
+                        <label class="text-muted small mb-1 d-block">Scope</label>
+                        ${scopeBadge}
+                    </div>
+                    <div class="col-12">
+                        <label class="text-muted small mb-1 d-block">Description</label>
+                        <span class="fw-medium">${description}</span>
+                    </div>
+                </div>
+            </div>
+        </div>
+        ${configHtml}`;
+}
+
+function formatAuthType(type) {
+    if (!type) return "";
+    const map = {
+        "key": "API Key",
+        "identity": "Managed Identity",
+        "user": "User (Delegated)",
+        "servicePrincipal": "Service Principal",
+        "connection_string": "Connection String",
+        "basic": "Basic Auth",
+        "username_password": "Username / Password",
+        "NoAuth": "No Authentication"
+    };
+    return map[type] || type;
+}
+
+// ============================================================================
+// GRID CARD RENDERERS
+// ============================================================================
+
+/**
+ * Create a grid card element for an agent.
+ * @param {object} agent - Agent data object
+ * @param {object} options - { onChat, onView, onEdit, onDelete, canManage, isGroup }
+ * @returns {HTMLElement}
+ */
+export function createAgentCard(agent, options = {}) {
+    const { onChat, onView, onEdit, onDelete, canManage = false, isGroup = false } = options;
+    const col = document.createElement("div");
+    col.className = "col-sm-6 col-md-4 col-lg-3";
+
+    const displayName = humanizeName(agent.display_name || agent.displayName || agent.name || "");
+    const description = agent.description || "No description available.";
+    const isGlobal = agent.is_global;
+
+    let badgeHtml = "";
+    if (isGlobal) {
+        badgeHtml = '<span class="badge bg-info text-dark ms-1" style="font-size: 0.65rem;">Global</span>';
+    }
+
+    let buttonsHtml = `
+        <button class="btn btn-sm btn-primary item-card-chat-btn me-1" title="Chat with this agent">
+            <i class="bi bi-chat-dots me-1"></i>Chat
+        </button>
+        <button class="btn btn-sm btn-outline-info item-card-view-btn me-1" title="View details">
+            <i class="bi bi-eye"></i>
+        </button>`;
+
+    if (canManage && !isGlobal) {
+        buttonsHtml += `
+        <button class="btn btn-sm btn-outline-secondary item-card-edit-btn me-1" title="Edit">
+            <i class="bi bi-pencil"></i>
+        </button>
+        <button class="btn btn-sm btn-outline-danger item-card-delete-btn" title="Delete">
+            <i class="bi bi-trash"></i>
+        </button>`;
+    }
+
+    col.innerHTML = `
+        <div class="card item-card h-100">
+            <div class="card-body d-flex flex-column">
+                <div class="item-card-icon mb-2">
+                    <i class="bi bi-robot" style="font-size: 1.75rem;"></i>
+                </div>
+                <h6 class="card-title mb-1">${escapeHtml(displayName)}${badgeHtml}</h6>
+                <p class="card-text small text-muted flex-grow-1">${escapeHtml(truncateDescription(description, 120))}</p>
+                <div class="item-card-buttons mt-2 d-flex flex-wrap gap-1">
+                    ${buttonsHtml}
+                </div>
+            </div>
+        </div>`;
+
+    // Bind button events
+    const chatBtn = col.querySelector(".item-card-chat-btn");
+    const viewBtn = col.querySelector(".item-card-view-btn");
+    const editBtn = col.querySelector(".item-card-edit-btn");
+    const deleteBtn = col.querySelector(".item-card-delete-btn");
+
+    if (chatBtn && onChat) chatBtn.addEventListener("click", (e) => { e.stopPropagation(); onChat(agent); });
+    if (viewBtn && onView) viewBtn.addEventListener("click", (e) => { e.stopPropagation(); onView(agent); });
+    if (editBtn && onEdit) editBtn.addEventListener("click", (e) => { e.stopPropagation(); onEdit(agent); });
+    if (deleteBtn && onDelete) deleteBtn.addEventListener("click", (e) => { e.stopPropagation(); onDelete(agent); });
+
+    // Clicking anywhere on the card opens the detail view
+    const cardEl = col.querySelector(".item-card");
+    if (cardEl && onView) {
+        cardEl.style.cursor = "pointer";
+        cardEl.addEventListener("click", () => onView(agent));
+    }
+
+    return col;
+}
+
+/**
+ * Create a grid card element for an action/plugin.
+ * @param {object} plugin - Action/plugin data object
+ * @param {object} options - { onView, onEdit, onDelete, canManage, isAdmin }
+ * @returns {HTMLElement}
+ */
+export function createActionCard(plugin, options = {}) {
+    const { onView, onEdit, onDelete, canManage = true, isAdmin = false } = options;
+    const col = document.createElement("div");
+    col.className = "col-sm-6 col-md-4 col-lg-3";
+
+    const displayName = humanizeName(plugin.display_name || plugin.displayName || plugin.name || "");
+    const description = plugin.description || "No description available.";
+    const type = plugin.type || "";
+    const typeIcon = getTypeIcon(type);
+    const isGlobal = plugin.is_global;
+
+    let badgeHtml = "";
+    if (isGlobal) {
+        badgeHtml = '<span class="badge bg-info text-dark ms-1" style="font-size: 0.65rem;">Global</span>';
+    }
+
+    const typeBadge = type
+        ? `<span class="badge bg-light text-dark border me-1" style="font-size: 0.65rem;"><i class="bi ${typeIcon} me-1"></i>${escapeHtml(humanizeName(type))}</span>`
+        : "";
+
+    let buttonsHtml = `
+        <button class="btn btn-sm btn-outline-info item-card-view-btn me-1" title="View details">
+            <i class="bi bi-eye"></i>
+        </button>`;
+
+    if ((isAdmin || (canManage && !isGlobal))) {
+        buttonsHtml += `
+        <button class="btn btn-sm btn-outline-secondary item-card-edit-btn me-1" title="Edit">
+            <i class="bi bi-pencil"></i>
+        </button>
+        <button class="btn btn-sm btn-outline-danger item-card-delete-btn" title="Delete">
+            <i class="bi bi-trash"></i>
+        </button>`;
+    }
+
+    col.innerHTML = `
+        <div class="card item-card h-100">
+            <div class="card-body d-flex flex-column">
+                <div class="item-card-icon mb-2">
+                    <i class="bi ${typeIcon}" style="font-size: 1.75rem;"></i>
+                </div>
+                <h6 class="card-title mb-1">${escapeHtml(displayName)}${badgeHtml}</h6>
+                <div class="mb-2">${typeBadge}</div>
+                <p class="card-text small text-muted flex-grow-1">${escapeHtml(truncateDescription(description, 120))}</p>
+                <div class="item-card-buttons mt-2 d-flex flex-wrap gap-1">
+                    ${buttonsHtml}
+                </div>
+            </div>
+        </div>`;
+
+    // Bind button events
+    const viewBtn = col.querySelector(".item-card-view-btn");
+    const editBtn = col.querySelector(".item-card-edit-btn");
+    const deleteBtn = col.querySelector(".item-card-delete-btn");
+
+    if (viewBtn && onView) viewBtn.addEventListener("click", (e) => { e.stopPropagation(); onView(plugin); });
+    if (editBtn && onEdit) editBtn.addEventListener("click", (e) => { e.stopPropagation(); onEdit(plugin); });
+    if (deleteBtn && onDelete) deleteBtn.addEventListener("click", (e) => { e.stopPropagation(); onDelete(plugin); });
+
+    // Clicking anywhere on the card opens the detail view
+    const cardEl = col.querySelector(".item-card");
+    if (cardEl && onView) {
+        cardEl.style.cursor = "pointer";
+        cardEl.addEventListener("click", () => onView(plugin));
+    }
+
+    return col;
+}
diff --git a/application/single_app/static/js/workspace/workspace_agents.js b/application/single_app/static/js/workspace/workspace_agents.js
index a0839b25..623be234 100644
--- a/application/single_app/static/js/workspace/workspace_agents.js
+++ b/application/single_app/static/js/workspace/workspace_agents.js
@@ -4,14 +4,22 @@
 import { showToast } from "../chat/chat-toast.js";
 import * as agentsCommon from '../agents_common.js';
 import { AgentModalStepper } from '../agent_modal_stepper.js';
+import {
+    humanizeName, truncateDescription, escapeHtml,
+    setupViewToggle, switchViewContainers,
+    openViewModal, createAgentCard
+} from './view-utils.js';
 
 // --- DOM Elements & Globals ---
 const agentsTbody = document.getElementById('agents-table-body');
 const agentsErrorDiv = document.getElementById('workspace-agents-error');
 const createAgentBtn = document.getElementById('create-agent-btn');
 const agentsSearchInput = document.getElementById('agents-search');
+const agentsListView = document.getElementById('agents-list-view');
+const agentsGridView = document.getElementById('agents-grid-view');
 let agents = [];
 let filteredAgents = [];
+let currentViewMode = 'list';
 
 
 // --- Function Definitions ---
@@ -43,104 +51,87 @@ function filterAgents(searchTerm) {
     });
   }
   renderAgentsTable(filteredAgents);
+  renderAgentsGrid(filteredAgents);
 }
 
-// --- Helper Functions ---
-
-function truncateDisplayName(displayName, maxLength = 12) {
-  if (!displayName || displayName.length <= maxLength) {
-    return displayName;
+// Open the view modal for an agent with Chat/Edit/Delete actions in the footer
+function openAgentViewModal(agent) {
+  const callbacks = {
+    onChat: (a) => chatWithAgent(a.name),
+    onDelete: !agent.is_global ? (a) => { if (confirm(`Delete agent '${a.name}'?`)) deleteAgent(a.name); } : null
+  };
+  if (!agent.is_global) {
+    callbacks.onEdit = (a) => openAgentModal(a);
   }
-  return displayName.substring(0, maxLength) + '...';
+  openViewModal(agent, 'agent', callbacks);
 }
 
+// --- Rendering Functions ---
 function renderAgentsTable(agentsList) {
   if (!agentsTbody) return;
   agentsTbody.innerHTML = '';
   if (!agentsList.length) {
     const tr = document.createElement('tr');
-    tr.innerHTML = '<td colspan="4" class="text-center text-muted">No agents found.</td>';
+    tr.innerHTML = '<td colspan="3" class="text-center text-muted">No agents found.</td>';
     agentsTbody.appendChild(tr);
     return;
   }
-  // Fetch selected_agent from user settings (async)
-  fetch('/api/user/settings').then(res => {
-    if (!res.ok) throw new Error('Failed to load user settings');
-    return res.json();
-  }).then(settings => {
-    let selectedAgentObj = settings.selected_agent;
-    if (!selectedAgentObj && settings.settings && settings.settings.selected_agent) {
-      selectedAgentObj = settings.settings.selected_agent;
-    }
-    let selectedAgentName = typeof selectedAgentObj === 'object' ? selectedAgentObj.name : selectedAgentObj;
-    agentsTbody.innerHTML = '';
-    for (const agent of agentsList) {
-      const tr = document.createElement('tr');
-      
-      // Create action buttons
-      let actionButtons = `<button class="btn btn-sm btn-primary chat-agent-btn me-1" data-name="${agent.name}" title="Chat with this agent">
-        <i class="bi bi-chat-dots me-1"></i>Chat
-      </button>`;
-      
-      if (!agent.is_global) {
-        actionButtons += `
-          <button class="btn btn-sm btn-outline-secondary edit-agent-btn me-1" data-name="${agent.name}" title="Edit agent">
-            <i class="bi bi-pencil"></i>
-          </button>
-          <button class="btn btn-sm btn-outline-danger delete-agent-btn" data-name="${agent.name}" title="Delete agent">
-            <i class="bi bi-trash"></i>
-          </button>
-        `;
-      }
-      
-      const truncatedDisplayName = truncateDisplayName(agent.display_name || agent.name || '');
-      
-      tr.innerHTML = `
-        <td>
-          <strong>${truncatedDisplayName}</strong>
-          ${agent.is_global ? ' <span class="badge bg-info text-dark">Global</span>' : ''}
-        </td>
-        <td class="text-muted small">${agent.description || 'No description available'}</td>
-        <td>${actionButtons}</td>
-      `;
-      agentsTbody.appendChild(tr);
-    }
-  }).catch(e => {
-    renderError('Could not load agent settings: ' + e.message);
-    // Fallback: render table without settings
-    agentsTbody.innerHTML = '';
-    for (const agent of agentsList) {
-      const tr = document.createElement('tr');
-      
-      // Create action buttons
-      let actionButtons = `<button class="btn btn-sm btn-primary chat-agent-btn me-1" data-name="${agent.name}" title="Chat with this agent">
+
+  for (const agent of agentsList) {
+    const tr = document.createElement('tr');
+    const displayName = humanizeName(agent.display_name || agent.name || '');
+    const description = agent.description || 'No description available';
+    const truncatedDesc = truncateDescription(description, 90);
+    const isGlobal = agent.is_global;
+
+    // Action buttons — Chat + View always, Edit/Delete for non-global
+    let actionButtons = `<button class="btn btn-sm btn-primary chat-agent-btn me-1" data-name="${escapeHtml(agent.name)}" title="Chat with this agent">
         <i class="bi bi-chat-dots me-1"></i>Chat
+      </button>
+      <button class="btn btn-sm btn-outline-info view-agent-btn me-1" data-name="${escapeHtml(agent.name)}" title="View details">
+        <i class="bi bi-eye"></i>
       </button>`;
-      
-      if (!agent.is_global) {
-        actionButtons += `
-          <button class="btn btn-sm btn-outline-secondary edit-agent-btn me-1" data-name="${agent.name}" title="Edit agent">
-            <i class="bi bi-pencil"></i>
-          </button>
-          <button class="btn btn-sm btn-outline-danger delete-agent-btn" data-name="${agent.name}" title="Delete agent">
-            <i class="bi bi-trash"></i>
-          </button>
-        `;
-      }
-      
-      const truncatedDisplayName = truncateDisplayName(agent.display_name || agent.name || '');
-      
-      tr.innerHTML = `
-        <td>
-          <strong>${truncatedDisplayName}</strong>
-          ${agent.is_global ? ' <span class="badge bg-info text-dark">Global</span>' : ''}
-        </td>
-        <td class="text-muted small">${agent.description || 'No description available'}</td>
-        <td>${actionButtons}</td>
-      `;
-      agentsTbody.appendChild(tr);
+
+    if (!isGlobal) {
+      actionButtons += `
+        <button class="btn btn-sm btn-outline-secondary edit-agent-btn me-1" data-name="${escapeHtml(agent.name)}" title="Edit agent">
+          <i class="bi bi-pencil"></i>
+        </button>
+        <button class="btn btn-sm btn-outline-danger delete-agent-btn" data-name="${escapeHtml(agent.name)}" title="Delete agent">
+          <i class="bi bi-trash"></i>
+        </button>`;
     }
-  });
+
+    tr.innerHTML = `
+      <td>
+        <strong title="${escapeHtml(agent.display_name || agent.name || '')}">${escapeHtml(displayName)}</strong>
+        ${isGlobal ? ' <span class="badge bg-info text-dark">Global</span>' : ''}
+      </td>
+      <td class="text-muted small" title="${escapeHtml(description)}">${escapeHtml(truncatedDesc)}</td>
+      <td>${actionButtons}</td>
+    `;
+    agentsTbody.appendChild(tr);
+  }
+}
+
+function renderAgentsGrid(agentsList) {
+  if (!agentsGridView) return;
+  agentsGridView.innerHTML = '';
+  if (!agentsList.length) {
+    agentsGridView.innerHTML = '<div class="col-12 text-center text-muted p-4">No agents found.</div>';
+    return;
+  }
+
+  for (const agent of agentsList) {
+    const card = createAgentCard(agent, {
+      onChat: (a) => chatWithAgent(a.name),
+      onView: (a) => openAgentViewModal(a),
+      onEdit: (a) => openAgentModal(a),
+      onDelete: (a) => { if (confirm(`Delete agent '${a.name}'?`)) deleteAgent(a.name); },
+      canManage: !agent.is_global
+    });
+    agentsGridView.appendChild(card);
+  }
 }
 
 async function fetchAgents() {
@@ -151,6 +142,7 @@ async function fetchAgents() {
     agents = await res.json();
     filteredAgents = agents; // Initialize filtered list
     renderAgentsTable(filteredAgents);
+    renderAgentsGrid(filteredAgents);
   } catch (e) {
     renderError(e.message);
   }
@@ -177,17 +169,14 @@ function attachAgentTableEvents() {
   }
   
   agentsTbody.addEventListener('click', function (e) {
-    console.log('Agent table clicked, target:', e.target);
-    
     // Find the button element (could be the target or a parent)
     const editBtn = e.target.closest('.edit-agent-btn');
     const deleteBtn = e.target.closest('.delete-agent-btn');
     const chatBtn = e.target.closest('.chat-agent-btn');
+    const viewBtn = e.target.closest('.view-agent-btn');
     
     if (editBtn) {
-      console.log('Edit agent button clicked, dataset:', editBtn.dataset);
       const agent = agents.find(a => a.name === editBtn.dataset.name);
-      console.log('Found agent:', agent);
       openAgentModal(agent);
     }
     
@@ -201,33 +190,27 @@ function attachAgentTableEvents() {
       const agentName = chatBtn.dataset.name;
       chatWithAgent(agentName);
     }
+
+    if (viewBtn) {
+      const agent = agents.find(a => a.name === viewBtn.dataset.name);
+      if (agent) openAgentViewModal(agent);
+    }
   });
 }
 
 async function chatWithAgent(agentName) {
   try {
-    console.log('DEBUG: chatWithAgent called with agentName:', agentName);
-    console.log('DEBUG: Available agents:', agents);
-    
-    // Find the agent to get its is_global status
     const agent = agents.find(a => a.name === agentName);
-    console.log('DEBUG: Found agent:', agent);
-    
     if (!agent) {
       throw new Error('Agent not found');
     }
     
-    console.log('DEBUG: Agent is_global flag:', agent.is_global);
-    console.log('DEBUG: !!agent.is_global:', !!agent.is_global);
-    
-    // Set the selected agent with proper is_global flag
     const payloadData = { 
       selected_agent: { 
         name: agentName, 
         is_global: !!agent.is_global 
       } 
     };
-    console.log('DEBUG: Sending payload:', payloadData);
     
     const resp = await fetch('/api/user/settings/selected_agent', {
       method: 'POST',
@@ -239,9 +222,6 @@ async function chatWithAgent(agentName) {
       throw new Error('Failed to select agent');
     }
     
-    console.log('DEBUG: Agent selection saved successfully');
-    
-    // Navigate to chat page
     window.location.href = '/chats';
   } catch (err) {
     console.error('Error selecting agent for chat:', err);
@@ -353,6 +333,17 @@ async function deleteAgent(name) {
 function initializeWorkspaceAgentUI() {
   window.agentModalStepper = new AgentModalStepper(false);
   attachAgentTableEvents();
+
+  // Set up view toggle
+  setupViewToggle('agents', 'agentsViewPreference', (mode) => {
+    currentViewMode = mode;
+    switchViewContainers(mode, agentsListView, agentsGridView);
+    // Re-render grid if switching to grid and we have data
+    if (mode === 'grid' && filteredAgents.length) {
+      renderAgentsGrid(filteredAgents);
+    }
+  });
+
   fetchAgents();
 }
 
diff --git a/application/single_app/static/js/workspace/workspace_plugins.js b/application/single_app/static/js/workspace/workspace_plugins.js
index 30fef0d5..8ed4f6b5 100644
--- a/application/single_app/static/js/workspace/workspace_plugins.js
+++ b/application/single_app/static/js/workspace/workspace_plugins.js
@@ -1,10 +1,14 @@
 // workspace_plugins.js (refactored to use plugin_common.js and new multi-step modal)
-import { renderPluginsTable, ensurePluginsTableInRoot, validatePluginManifest } from '../plugin_common.js';
+import { renderPluginsTable, renderPluginsGrid, ensurePluginsTableInRoot, validatePluginManifest } from '../plugin_common.js';
 import { showToast } from "../chat/chat-toast.js"
+import {
+    setupViewToggle, switchViewContainers, openViewModal
+} from './view-utils.js';
 
 const root = document.getElementById('workspace-plugins-root');
 let plugins = [];
 let filteredPlugins = [];
+let currentViewMode = 'list';
 
 function renderLoading() {
   root.innerHTML = `<div class="text-center p-4"><div class="spinner-border" role="status"><span class="visually-hidden">Loading...</span></div></div>`;
@@ -14,6 +18,22 @@ function renderError(msg) {
   root.innerHTML = `<div class="alert alert-danger">${msg}</div>`;
 }
 
+function getViewHandlers() {
+  return {
+    onEdit: name => openPluginModal(plugins.find(p => p.name === name)),
+    onDelete: name => deletePlugin(name),
+    onView: name => {
+      const plugin = plugins.find(p => p.name === name);
+      if (plugin) {
+        openViewModal(plugin, 'action', {
+          onEdit: (item) => openPluginModal(item),
+          onDelete: (item) => deletePlugin(item.name)
+        });
+      }
+    }
+  };
+}
+
 function filterPlugins(searchTerm) {
   if (!searchTerm || !searchTerm.trim()) {
     filteredPlugins = plugins;
@@ -26,14 +46,18 @@ function filterPlugins(searchTerm) {
     });
   }
   
-  // Ensure table template is in place
   ensurePluginsTableInRoot();
+  const handlers = getViewHandlers();
   
   renderPluginsTable({
     plugins: filteredPlugins,
     tbodySelector: '#plugins-table-body',
-    onEdit: name => openPluginModal(plugins.find(p => p.name === name)),
-    onDelete: name => deletePlugin(name)
+    ...handlers
+  });
+  renderPluginsGrid({
+    plugins: filteredPlugins,
+    containerSelector: '#plugins-grid-view',
+    ...handlers
   });
 }
 
@@ -47,12 +71,26 @@ async function fetchPlugins() {
     
     // Ensure table template is in place
     ensurePluginsTableInRoot();
+    const handlers = getViewHandlers();
     
     renderPluginsTable({
       plugins: filteredPlugins,
       tbodySelector: '#plugins-table-body',
-      onEdit: name => openPluginModal(plugins.find(p => p.name === name)),
-      onDelete: name => deletePlugin(name)
+      ...handlers
+    });
+    renderPluginsGrid({
+      plugins: filteredPlugins,
+      containerSelector: '#plugins-grid-view',
+      ...handlers
+    });
+    
+    // Set up view toggle (only once after template is in DOM)
+    setupViewToggle('plugins', 'pluginsViewPreference', (mode) => {
+      currentViewMode = mode;
+      switchViewContainers(mode,
+        document.getElementById('plugins-list-view'),
+        document.getElementById('plugins-grid-view')
+      );
     });
     
     // Set up the create action button
@@ -137,6 +175,8 @@ function setupSaveHandler(plugin, modal) {
 }
 
 async function savePlugin(pluginData, existingPlugin = null) {
+  const payload = existingPlugin?.id ? { ...pluginData, id: existingPlugin.id } : { ...pluginData };
+
   // Get all plugins first
   const res = await fetch('/api/user/plugins');
 
@@ -145,11 +185,19 @@ async function savePlugin(pluginData, existingPlugin = null) {
   let plugins = await res.json();
   
   // Update or add the plugin
-  const existingIndex = plugins.findIndex(p => p.name === pluginData.name);
+  const existingIndex = plugins.findIndex(p => {
+    if (payload.id && p.id === payload.id) {
+      return true;
+    }
+    if (existingPlugin?.name && p.name === existingPlugin.name) {
+      return true;
+    }
+    return p.name === payload.name;
+  });
   if (existingIndex >= 0) {
-    plugins[existingIndex] = pluginData;
+    plugins[existingIndex] = payload;
   } else {
-    plugins.push(pluginData);
+    plugins.push(payload);
   }
   
   // Save back to server
diff --git a/application/single_app/static/json/schemas/sql_query.definition.json b/application/single_app/static/json/schemas/sql_query.definition.json
index d38a41a8..6903c22a 100644
--- a/application/single_app/static/json/schemas/sql_query.definition.json
+++ b/application/single_app/static/json/schemas/sql_query.definition.json
@@ -1,6 +1,9 @@
 {
   "$schema": "./plugin.definition.schema.json",
   "allowedAuthTypes": [
+    "user",
+    "identity",
+    "servicePrincipal",
     "connection_string"
   ]
 }
diff --git a/application/single_app/static/json/schemas/sql_query_plugin.additional_settings.schema.json b/application/single_app/static/json/schemas/sql_query_plugin.additional_settings.schema.json
index 9e4f6d34..f7f46ebd 100644
--- a/application/single_app/static/json/schemas/sql_query_plugin.additional_settings.schema.json
+++ b/application/single_app/static/json/schemas/sql_query_plugin.additional_settings.schema.json
@@ -3,13 +3,13 @@
   "title": "SQL Query Plugin Additional Settings",
   "type": "object",
   "properties": {
-    "connection_string__Secret": {
+    "connection_string": {
       "type": "string",
       "description": "Database connection string. Required if server/database not provided."
     },
     "database_type": {
       "type": "string",
-      "enum": ["sqlserver", "postgresql", "mysql", "sqlite", "azure_sql", "azuresql"],
+      "enum": ["sqlserver", "postgresql", "mysql", "sqlite", "azure_sql"],
       "description": "Type of database engine."
     },
     "server": {
@@ -24,7 +24,7 @@
       "type": "string",
       "description": "Username for authentication."
     },
-    "password__Secret": {
+    "password": {
       "type": "string",
       "description": "Password for authentication."
     },
@@ -50,6 +50,6 @@
       "description": "Query timeout in seconds."
     }
   },
-  "required": ["database_type", "database"],
+  "required": ["database_type"],
   "additionalProperties": false
 }
diff --git a/application/single_app/static/json/schemas/sql_schema.definition.json b/application/single_app/static/json/schemas/sql_schema.definition.json
index d38a41a8..6903c22a 100644
--- a/application/single_app/static/json/schemas/sql_schema.definition.json
+++ b/application/single_app/static/json/schemas/sql_schema.definition.json
@@ -1,6 +1,9 @@
 {
   "$schema": "./plugin.definition.schema.json",
   "allowedAuthTypes": [
+    "user",
+    "identity",
+    "servicePrincipal",
     "connection_string"
   ]
 }
diff --git a/application/single_app/static/json/schemas/sql_schema_plugin.additional_settings.schema.json b/application/single_app/static/json/schemas/sql_schema_plugin.additional_settings.schema.json
index e97c7b4b..29fb6b3f 100644
--- a/application/single_app/static/json/schemas/sql_schema_plugin.additional_settings.schema.json
+++ b/application/single_app/static/json/schemas/sql_schema_plugin.additional_settings.schema.json
@@ -3,13 +3,13 @@
   "title": "SQL Schema Plugin Additional Settings",
   "type": "object",
   "properties": {
-    "connection_string__Secret": {
+    "connection_string": {
       "type": "string",
       "description": "Database connection string. Required if server/database not provided."
     },
     "database_type": {
       "type": "string",
-      "enum": ["sqlserver", "postgresql", "mysql", "sqlite", "azure_sql", "azuresql"],
+      "enum": ["sqlserver", "postgresql", "mysql", "sqlite", "azure_sql"],
       "description": "Type of database engine."
     },
     "server": {
@@ -24,7 +24,7 @@
       "type": "string",
       "description": "Username for authentication."
     },
-    "password__Secret": {
+    "password": {
       "type": "string",
       "description": "Password for authentication."
     },
@@ -33,6 +33,6 @@
       "description": "ODBC or DB driver name."
     }
   },
-  "required": ["database_type", "database"],
+  "required": ["database_type"],
   "additionalProperties": false
 }
diff --git a/application/single_app/templates/_agent_examples_modal.html b/application/single_app/templates/_agent_examples_modal.html
index 52f95cdc..398e930c 100644
--- a/application/single_app/templates/_agent_examples_modal.html
+++ b/application/single_app/templates/_agent_examples_modal.html
@@ -92,7 +92,7 @@ <h5 class="modal-title mb-0" id="agentExampleDetailsModalLabel">Agent Template</
         <div class="mb-3" id="agent-example-details-actions"></div>
         <div class="mb-3">
           <label class="fw-semibold text-muted small">Instructions</label>
-          <pre class="bg-dark text-white rounded p-3" id="agent-example-details-instructions"></pre>
+          <div class="border rounded p-3 rendered-markdown" id="agent-example-details-instructions" style="max-height: 400px; overflow-y: auto; font-size: 0.9rem;"></div>
         </div>
         <div class="mb-3 d-none" id="agent-example-details-settings-wrapper">
           <label class="fw-semibold text-muted small">Additional Settings</label>
@@ -427,7 +427,12 @@ <h5 class="modal-title mb-0" id="agentExampleDetailsModalLabel">Agent Template</
                 detailDescription.textContent = template.description || template.helper_text || 'No description provided.';
               }
               if (detailInstructions) {
-                detailInstructions.textContent = template.instructions || '';
+                const rawInstructions = template.instructions || '';
+                if (typeof marked !== 'undefined' && typeof DOMPurify !== 'undefined' && rawInstructions) {
+                  detailInstructions.innerHTML = DOMPurify.sanitize(marked.parse(rawInstructions));
+                } else {
+                  detailInstructions.textContent = rawInstructions;
+                }
               }
               if (detailSettingsWrapper && detailSettings) {
                 if (template.additional_settings) {
diff --git a/application/single_app/templates/_plugin_modal.html b/application/single_app/templates/_plugin_modal.html
index 3af18019..8acef22d 100644
--- a/application/single_app/templates/_plugin_modal.html
+++ b/application/single_app/templates/_plugin_modal.html
@@ -425,21 +425,46 @@ <h6><i class="bi bi-info-circle me-2"></i>API Information</h6>
                 </div>
               </div>
             </div>
+
+            <!-- Test Connection Button -->
+            <div class="mt-4 pt-3 border-top" id="sql-test-connection-section">
+              <div class="d-flex align-items-center gap-3">
+                <button type="button" class="btn btn-outline-primary" id="sql-test-connection-btn">
+                  <i class="bi bi-plug me-2"></i>Test Connection
+                </button>
+                <div id="sql-test-connection-status" class="d-none">
+                  <span id="sql-test-connection-spinner" class="spinner-border spinner-border-sm me-2 d-none" role="status" aria-hidden="true"></span>
+                  <span id="sql-test-connection-message"></span>
+                </div>
+              </div>
+              <div id="sql-test-connection-result" class="mt-2 d-none">
+                <div class="alert mb-0 py-2 px-3 small" id="sql-test-connection-alert" role="alert"></div>
+              </div>
+            </div>
           </div>
         </div>
 
         <!-- Step 4: Advanced -->
         <div class="plugin-step d-none" id="plugin-step-4">
           <h6 class="mb-3">Advanced</h6>
+          <p class="text-muted small mb-3">Advanced settings are typically not required. Expand below if you need to customize metadata or additional fields.</p>
           <div class="mb-3">
-            <label for="plugin-metadata" class="form-label">Metadata (JSON)</label>
-            <textarea class="form-control" id="plugin-metadata" rows="4" placeholder="{}"></textarea>
-            <div class="form-text">Optional metadata for this action.</div>
+            <button class="btn btn-sm btn-outline-secondary w-100 d-flex align-items-center justify-content-between" type="button" data-bs-toggle="collapse" data-bs-target="#plugin-advanced-collapse" aria-expanded="false" aria-controls="plugin-advanced-collapse">
+              <span><i class="bi bi-gear me-2"></i>Show Advanced Settings</span>
+              <i class="bi bi-chevron-down" id="plugin-advanced-toggle-icon"></i>
+            </button>
           </div>
-          <div class="mb-3" id="plugin-additional-fields-div">
-            <label for="plugin-additional-fields" class="form-label">Additional Fields (JSON)</label>
-            <textarea class="form-control" id="plugin-additional-fields" rows="6" placeholder="{}"></textarea>
-            <div class="form-text">Additional configuration fields specific to this action type.</div>
+          <div class="collapse" id="plugin-advanced-collapse">
+            <div class="mb-3">
+              <label for="plugin-metadata" class="form-label">Metadata (JSON)</label>
+              <textarea class="form-control" id="plugin-metadata" rows="4" placeholder="{}"></textarea>
+              <div class="form-text">Optional metadata for this action.</div>
+            </div>
+            <div class="mb-3" id="plugin-additional-fields-div">
+              <label for="plugin-additional-fields" class="form-label">Additional Fields (JSON)</label>
+              <textarea class="form-control" id="plugin-additional-fields" rows="6" placeholder="{}"></textarea>
+              <div class="form-text">Additional configuration fields specific to this action type.</div>
+            </div>
           </div>
         </div>
 
@@ -777,6 +802,15 @@ <h6 class="mb-0 fw-bold">
   background-color: #f8f9fa;
 }
 
+/* Advanced toggle chevron animation */
+#plugin-advanced-toggle-icon {
+  transition: transform 0.3s ease;
+}
+#plugin-advanced-collapse.show ~ .mb-3 #plugin-advanced-toggle-icon,
+[aria-expanded="true"] #plugin-advanced-toggle-icon {
+  transform: rotate(180deg);
+}
+
 .sql-connection-config, 
 .sql-auth-config {
   background-color: white;
diff --git a/application/single_app/templates/_sidebar_nav.html b/application/single_app/templates/_sidebar_nav.html
index a0bceee8..33a89b04 100644
--- a/application/single_app/templates/_sidebar_nav.html
+++ b/application/single_app/templates/_sidebar_nav.html
@@ -287,6 +287,11 @@
                   <i class="bi bi-chat-square-text me-2" style="font-size: 0.8em;"></i><span class="nav-text">GPT Configuration</span>
                 </a>
               </li>
+              <li class="nav-item">
+                <a class="nav-link d-flex align-items-center admin-nav-section" href="#" data-tab="ai-models" data-section="processing-thoughts-section">
+                  <i class="bi bi-stars me-2" style="font-size: 0.8em;"></i><span class="nav-text">Processing Thoughts</span>
+                </a>
+              </li>
               <li class="nav-item">
                 <a class="nav-link d-flex align-items-center admin-nav-section" href="#" data-tab="ai-models" data-section="embeddings-config">
                   <i class="bi bi-vector-pen me-2" style="font-size: 0.8em;"></i><span class="nav-text">Embeddings</span>
diff --git a/application/single_app/templates/admin_settings.html b/application/single_app/templates/admin_settings.html
index 7d01f7da..76e366f0 100644
--- a/application/single_app/templates/admin_settings.html
+++ b/application/single_app/templates/admin_settings.html
@@ -953,6 +953,12 @@ <h6 class="mb-2">Core Action Toggles</h6>
                                     <label class="form-check-label ms-2" for="toggle-fact-memory-plugin">Enable Fact Memory Action</label>
                                     <i class="bi bi-info-circle ms-2" data-bs-toggle="tooltip" title="Enables agents to store and retrieve facts and information across conversations for improved memory and context retention."></i>
                                 </div>
+                                <div class="form-check form-switch mb-2" id="tabular-processing-toggle-wrapper">
+                                    <input class="form-check-input" type="checkbox" role="switch" id="toggle-tabular-processing-plugin" name="enable_tabular_processing_plugin">
+                                    <label class="form-check-label ms-2" for="toggle-tabular-processing-plugin">Enable Tabular Processing Action</label>
+                                    <i class="bi bi-info-circle ms-2" data-bs-toggle="tooltip" title="Enables agents to analyze tabular data files (CSV, XLSX) using pandas for aggregations, filtering, and full dataset operations. Requires Enhanced Citations."></i>
+                                    <small class="text-muted d-block ms-4" id="tabular-processing-dependency-note">Requires Enhanced Citations</small>
+                                </div>
                             </div>
                             <table class="table table-bordered" id="plugins-table">
                                 <thead>
@@ -1428,6 +1434,12 @@ <h5>
                         <textarea class="form-control" id="default_system_prompt" name="default_system_prompt"
                             rows="5">{{ settings.default_system_prompt }}</textarea>
                     </div>
+                    <div class="mb-3">
+                        <label for="access_denied_message" class="form-label">Access Denied Message</label>
+                        <small class="text-muted d-block mb-1">Shown to signed-in users who lack the required roles. Use Enter for line breaks.</small>
+                        <textarea class="form-control" id="access_denied_message" name="access_denied_message"
+                            rows="3">{{ settings.access_denied_message }}</textarea>
+                    </div>
                 </div>
             </div>
 
@@ -1580,6 +1592,27 @@ <h5 class="card-title">
                 <div id="test_gpt_result" class="mt-2"></div>
                 </div>
 
+                <!-- Processing Thoughts Section -->
+                <div class="card p-3 mb-4" id="processing-thoughts-section">
+                    <h5>
+                        <i class="bi bi-stars me-2"></i>Processing Thoughts
+                    </h5>
+                    <p class="text-muted">When enabled, real-time processing steps are shown to users during chat responses and persisted for later review.</p>
+                    <div class="form-group form-check form-switch mb-3">
+                        <input
+                            type="checkbox"
+                            class="form-check-input"
+                            id="enable_thoughts"
+                            name="enable_thoughts"
+                            {% if settings.enable_thoughts %}checked{% endif %}
+                        >
+                        <label class="form-check-label ms-2" for="enable_thoughts">
+                            Enable Processing Thoughts
+                        </label>
+                        <i class="bi bi-info-circle ms-2" data-bs-toggle="tooltip" title="Show real-time processing steps (searching documents, web search, agent calls, etc.) to users during chat. Steps are stored in Cosmos DB and can be reviewed per-message via a stars icon."></i>
+                    </div>
+                </div>
+
                 <!-- Embeddings Configuration Section -->
                 <div class="card p-3 mb-4" id="embeddings-configuration">
                     <h5 class="card-title">
@@ -2036,14 +2069,26 @@ <h5>
                             <select class="form-select" id="redis_auth_type" name="redis_auth_type">
                                 <option value="key" {% if settings.redis_auth_type == 'key' or not settings.redis_auth_type %}selected{% endif %}>Key</option>
                                 <option value="managed_identity" {% if settings.redis_auth_type == 'managed_identity' %}selected{% endif %}>Managed Identity</option>
+                                <option value="key_vault" {% if settings.redis_auth_type == 'key_vault' %}selected{% endif %}>Key Vault</option>
                             </select>
                         </div>
-                        <div class="mb-3" id="redis_key_container" {% if settings.redis_auth_type != 'key' and settings.redis_auth_type %}style="display: none;"{% endif %}>
-                            <label for="redis_key" class="form-label">Redis Access Key</label>
+                        <div class="mb-3 {% if settings.redis_auth_type == 'managed_identity' %}d-none{% endif %}" id="redis_key_container">
+                            <label for="redis_key" class="form-label" id="redis_key_label">
+                                {%- if settings.redis_auth_type == 'key_vault' -%}
+                                    Key Vault Secret Name
+                                {%- else -%}
+                                    Redis Access Key
+                                {%- endif -%}
+                            </label>
                             <div class="input-group">
                                 <input type="password" class="form-control" id="redis_key" name="redis_key" value="{{ settings.redis_key or '' }}">
                                 <button type="button" class="btn btn-outline-secondary" id="toggle_redis_key">Show</button>
                             </div>
+                            <div id="redis_key_vault_hint" class="form-text text-muted {% if settings.redis_auth_type != 'key_vault' %}d-none{% endif %}">
+                                Enter the full Key Vault secret name. <a href="#security" onclick="switchTab(event, 'security-tab')">
+                                Enable Key Vault for Agent and Action Secrets
+                            </a> must be enabled and configured.
+                            </div>
                         </div>
                         <button type="button" class="btn btn-secondary mt-3" id="test_redis_button">
                             Test Redis Connection
@@ -2912,6 +2957,23 @@ <h6><strong>All filetypes</strong></h6>
                             </div>
                         </div>
 
+                        <div class="card mb-3 p-3">
+                            <h6><strong>Tabular Preview Limits</strong></h6>
+                            <div class="mb-3">
+                                <label for="tabular_preview_max_blob_size_mb" class="form-label">
+                                    Maximum File Size for Tabular Preview (MB)
+                                </label>
+                                <input type="number" class="form-control" id="tabular_preview_max_blob_size_mb"
+                                    name="tabular_preview_max_blob_size_mb"
+                                    min="1" max="1024"
+                                    value="{{ settings.tabular_preview_max_blob_size_mb or 200 }}">
+                                <small class="form-text text-muted">
+                                    Maximum blob size (in MB) allowed for tabular file previews (CSV, XLSX). Files larger than this will not be previewed. 
+                                    Increase for larger files if your compute has sufficient memory, or decrease to protect smaller instances. Default: 200 MB.
+                                </small>
+                            </div>
+                        </div>
+
                         
                         <!-- <div class="card mb-3 p-3">
                             <h6><strong>Video Files</strong></h6>
@@ -3217,9 +3279,10 @@ <h5>
                         <i class="bi bi-info-circle ms-2" data-bs-toggle="tooltip" title="Archive conversations instead of permanently deleting them, allowing for recovery and compliance with data retention policies."></i>
                     </div>
                 </div>
+
             </div>
 
-            
+
             <div class="tab-pane fade" id="search-extract" role="tabpanel" aria-labelledby="search-extract-tab">
                 
                  <p class="text-muted">
diff --git a/application/single_app/templates/chats.html b/application/single_app/templates/chats.html
index b6c212cc..704ff9fb 100644
--- a/application/single_app/templates/chats.html
+++ b/application/single_app/templates/chats.html
@@ -291,9 +291,9 @@ <h5 id="current-conversation-title" class="mb-0 flex-grow-1">
 
             <div class="p-3 border-top flex-shrink-0">
                 
-                 <div class="d-flex align-items-center mb-2 justify-content-between">
+                      <div class="chat-toolbar mb-2">
                     <!-- Left side buttons -->
-                    <div class="d-flex align-items-center flex-wrap gap-2">
+                          <div class="chat-toolbar-actions">
                         {% if settings.enable_image_generation %}
                         <button
                             id="image-generate-btn"
@@ -342,68 +342,98 @@ <h5 id="current-conversation-title" class="mb-0 flex-grow-1">
                     </div>
                     
                     <!-- Right side selectors -->
-                    <div class="d-flex align-items-center gap-2">
-
-                        <!-- TTS Autoplay toggle button -->
-                        {% if app_settings.enable_text_to_speech %}
-                        <button 
-                            id="tts-autoplay-toggle-btn" 
-                            class="btn btn-outline-secondary btn-sm rounded-circle" 
-                            data-bs-toggle="tooltip" 
-                            data-bs-placement="top" 
-                            data-bs-title="Toggle AI voice response"
-                            title="Auto voice response disabled - click to enable"
-                            style="width: 38px; height: 38px; padding: 0; display: flex; align-items: center; justify-content: center;"
-                        >
-                            <i class="bi bi-volume-mute"></i>
-                        </button>
-                        {% endif %}
-
-                        <!-- Streaming toggle button -->
-                        <button 
-                            id="streaming-toggle-btn" 
-                            class="btn btn-outline-secondary btn-sm rounded-circle" 
-                            data-bs-toggle="tooltip" 
-                            data-bs-placement="top" 
-                            title="Toggle streaming responses."
-                            style="width: 38px; height: 38px; padding: 0; display: flex; align-items: center; justify-content: center;"
-                        >
-                            <i class="bi bi-lightning"></i>
-                        </button>
-
-                        <!-- Reasoning effort toggle button -->
-                        <button 
-                            id="reasoning-toggle-btn" 
-                            class="btn btn-outline-secondary btn-sm rounded-circle" 
-                            data-bs-toggle="tooltip" 
-                            data-bs-placement="top" 
-                            title="Configure reasoning effort"
-                            style="width: 38px; height: 38px; padding: 0; display: flex; align-items: center; justify-content: center;"
-                        >
-                            <i class="bi bi-reception-2"></i>
-                        </button>
-
+                    <div class="chat-toolbar-controls">
                         <!-- Prompt Selector -->
+                        <div class="chat-toolbar-selectors">
                         {% if settings.enable_user_workspace or settings.enable_group_workspaces %}
-                        <div id="prompt-selection-container" class="flex-shrink-0" style="display: none;">
-                            <select class="form-select" id="prompt-select"></select>
+                        <div id="prompt-selection-container" class="chat-toolbar-selector" style="display: none;">
+                            <label for="prompt-select" class="visually-hidden">Prompt</label>
+                            <div class="dropdown chat-searchable-select" id="prompt-dropdown">
+                                <button class="form-select chat-searchable-select-button"
+                                        type="button"
+                                        id="prompt-dropdown-button"
+                                        data-bs-toggle="dropdown"
+                                        data-bs-auto-close="outside"
+                                        aria-expanded="false">
+                                    <span class="chat-searchable-select-text text-truncate">Select a Prompt...</span>
+                                </button>
+                                <div class="dropdown-menu p-2 chat-searchable-select-menu" id="prompt-dropdown-menu">
+                                    <div class="chat-searchable-select-search mb-2">
+                                        <input type="text"
+                                               class="form-control form-control-sm"
+                                               placeholder="Search prompts..."
+                                               id="prompt-search-input">
+                                    </div>
+                                    <div class="dropdown-items-container chat-searchable-select-items" id="prompt-dropdown-items"></div>
+                                </div>
+                            </div>
+                            <select class="form-select d-none" id="prompt-select"></select>
                         </div>
                         {% endif %}
 
 
                         <!-- Agent selector -->
-                        <div id="agent-select-container" style="display: none;">
+                        <div id="agent-select-container" class="chat-toolbar-selector" style="display: none;">
                             <label for="agent-select" class="visually-hidden">Agent</label>
-                            <select id="agent-select" class="form-select me-4">
+                            <div class="dropdown chat-searchable-select" id="agent-dropdown">
+                                <button class="form-select chat-searchable-select-button"
+                                        type="button"
+                                        id="agent-dropdown-button"
+                                        data-bs-toggle="dropdown"
+                                        data-bs-auto-close="outside"
+                                        aria-expanded="false">
+                                    <span class="chat-searchable-select-text text-truncate">Select an Agent</span>
+                                </button>
+                                <div class="dropdown-menu p-2 chat-searchable-select-menu" id="agent-dropdown-menu">
+                                    <div class="chat-searchable-select-search mb-2">
+                                        <input type="text"
+                                               class="form-control form-control-sm"
+                                               placeholder="Search agents..."
+                                               id="agent-search-input">
+                                    </div>
+                                    <div class="dropdown-items-container chat-searchable-select-items" id="agent-dropdown-items"></div>
+                                </div>
+                            </div>
+                            <select id="agent-select" class="form-select d-none">
                                 <!-- Agents will be populated dynamically by JS -->
                             </select>
                         </div>
                         
                         
                         <!-- Model selector - right aligned -->
-                        <div id="model-select-container" class="flex-shrink-0">
+                        <div id="model-select-container" class="chat-toolbar-selector">
                             <label for="model-select" class="visually-hidden">Model</label>
-                            <select id="model-select" class="form-select">
+                            <div class="dropdown chat-searchable-select" id="model-dropdown">
+                                <button class="form-select chat-searchable-select-button"
+                                        type="button"
+                                        id="model-dropdown-button"
+                                        data-bs-toggle="dropdown"
+                                        data-bs-auto-close="outside"
+                                        aria-expanded="false">
+                                    <span class="chat-searchable-select-text text-truncate">
+                                    {% if settings.enable_gpt_apim %}
+                                        {% set raw = settings.azure_apim_gpt_deployment or "" %}
+                                        {% set apim_list = raw.split(',') %}
+                                        {% set first_dep = apim_list[0].strip() if apim_list and apim_list[0].strip() else 'Select a Model' %}
+                                        {{ first_dep }}
+                                    {% elif settings.gpt_model.selected %}
+                                        {{ settings.gpt_model.selected[0].modelName }}
+                                    {% else %}
+                                        Select a Model
+                                    {% endif %}
+                                    </span>
+                                </button>
+                                <div class="dropdown-menu p-2 chat-searchable-select-menu" id="model-dropdown-menu">
+                                    <div class="chat-searchable-select-search mb-2">
+                                        <input type="text"
+                                               class="form-control form-control-sm"
+                                               placeholder="Search models..."
+                                               id="model-search-input">
+                                    </div>
+                                    <div class="dropdown-items-container chat-searchable-select-items" id="model-dropdown-items"></div>
+                                </div>
+                            </div>
+                            <select id="model-select" class="form-select d-none">
                             {% if settings.enable_gpt_apim %}
                                 {# when using APIM, azure_apim_gpt_deployment may be "dep1" or "dep1,dep2,…" #}
                                 {% set raw = settings.azure_apim_gpt_deployment or "" %}
@@ -427,6 +457,38 @@ <h5 id="current-conversation-title" class="mb-0 flex-grow-1">
                             {% endif %}
                             </select>
                         </div>
+
+                        <div class="chat-toolbar-toggles">
+
+                        <!-- Reasoning effort toggle button -->
+                        <button 
+                            id="reasoning-toggle-btn" 
+                            class="btn btn-outline-secondary btn-sm rounded-circle" 
+                            data-bs-toggle="tooltip" 
+                            data-bs-placement="top" 
+                            title="Configure reasoning effort"
+                            style="width: 38px; height: 38px; padding: 0; display: flex; align-items: center; justify-content: center;"
+                        >
+                            <i class="bi bi-reception-2"></i>
+                        </button>
+
+                        <!-- TTS Autoplay toggle button -->
+                        {% if app_settings.enable_text_to_speech %}
+                        <button 
+                            id="tts-autoplay-toggle-btn" 
+                            class="btn btn-outline-secondary btn-sm rounded-circle" 
+                            data-bs-toggle="tooltip" 
+                            data-bs-placement="top" 
+                            data-bs-title="Toggle AI voice response"
+                            title="Auto voice response disabled - click to enable"
+                            style="width: 38px; height: 38px; padding: 0; display: flex; align-items: center; justify-content: center;"
+                        >
+                            <i class="bi bi-volume-mute"></i>
+                        </button>
+                        {% endif %}
+
+                        </div>
+                        </div>
                     </div>
                 </div>
 
@@ -457,7 +519,13 @@ <h5 id="current-conversation-title" class="mb-0 flex-grow-1">
                                     <span class="selected-scope-text text-truncate">All</span>
                                 </button>
                                 <div class="dropdown-menu p-2" id="scope-dropdown-menu" style="width: 280px; max-height: 400px; overflow-y: auto;">
-                                    <div id="scope-dropdown-items">
+                                    <div class="chat-dropdown-search mb-2">
+                                        <input type="text"
+                                               class="form-control form-control-sm"
+                                               placeholder="Search workspaces..."
+                                               id="scope-search-input">
+                                    </div>
+                                    <div class="dropdown-items-container" id="scope-dropdown-items">
                                         <!-- Populated by JS from window.userGroups and window.userVisiblePublicWorkspaces -->
                                     </div>
                                 </div>
@@ -477,6 +545,12 @@ <h5 id="current-conversation-title" class="mb-0 flex-grow-1">
                                     <span class="selected-tags-text text-truncate">All Tags</span>
                                 </button>
                                 <div class="dropdown-menu p-2" id="tags-dropdown-menu" style="width: 250px; max-height: 300px; overflow-y: auto;">
+                                    <div class="chat-dropdown-search mb-2">
+                                        <input type="text"
+                                               class="form-control form-control-sm"
+                                               placeholder="Search tags..."
+                                               id="tags-search-input">
+                                    </div>
                                     <div class="dropdown-items-container" id="tags-dropdown-items">
                                         <!-- Tags will be populated here -->
                                     </div>
@@ -1039,7 +1113,8 @@ <h5 class="modal-title" id="scopeLockModalLabel">
         enable_text_to_speech: {{ 'true' if app_settings.enable_text_to_speech else 'false' }},
         enable_speech_to_text_input: {{ 'true' if app_settings.enable_speech_to_text_input else 'false' }},
         enable_web_search_user_notice: {{ 'true' if settings.enable_web_search_user_notice else 'false' }},
-        enforce_workspace_scope_lock: {{ 'true' if settings.enforce_workspace_scope_lock else 'false' }}
+        enforce_workspace_scope_lock: {{ 'true' if settings.enforce_workspace_scope_lock else 'false' }},
+        enable_thoughts: {{ 'true' if settings.enable_thoughts else 'false' }}
     };
 
     // Layout related globals (can stay here or move entirely into chat-layout.js if preferred)
diff --git a/application/single_app/templates/control_center.html b/application/single_app/templates/control_center.html
index dbdce4d7..d3d5adb2 100644
--- a/application/single_app/templates/control_center.html
+++ b/application/single_app/templates/control_center.html
@@ -3993,11 +3993,6 @@ <h5 class="modal-title">
         }
     },
 
-    // Alias for backward compatibility
-    refreshGroups: function() {
-        return this.loadGroups();
-    },
-    
     // CSV Bulk Upload Functions
     downloadCsvExample: function() {
         const csvContent = `userId,displayName,email,role
diff --git a/application/single_app/templates/group_workspaces.html b/application/single_app/templates/group_workspaces.html
index e14fe8a3..658f6cde 100644
--- a/application/single_app/templates/group_workspaces.html
+++ b/application/single_app/templates/group_workspaces.html
@@ -763,33 +763,42 @@ <h2>Group Workspace</h2>
         <div id="group-agents-permission-warning" class="alert alert-warning d-none">
           You do not have permission to manage group agents.
         </div>
-        <div class="mb-3">
+        <div class="d-flex align-items-center gap-2 mb-3">
           <input
             type="search"
             class="form-control"
             id="group-agents-search"
             placeholder="Search group agents by name or description..."
           />
+          <div class="btn-group btn-group-sm" role="group" aria-label="View toggle">
+            <input type="radio" class="btn-check" name="groupAgentsViewToggle" id="groupAgents-view-list" value="list" autocomplete="off" checked>
+            <label class="btn btn-outline-secondary" for="groupAgents-view-list" title="List view"><i class="bi bi-list-ul"></i></label>
+            <input type="radio" class="btn-check" name="groupAgentsViewToggle" id="groupAgents-view-grid" value="grid" autocomplete="off">
+            <label class="btn btn-outline-secondary" for="groupAgents-view-grid" title="Grid view"><i class="bi bi-grid-3x3-gap"></i></label>
+          </div>
         </div>
-        <table class="table table-striped" id="group-agents-table">
-          <thead>
-            <tr>
-              <th>Display Name</th>
-              <th>Description</th>
-              <th style="width: 160px;">Actions</th>
-            </tr>
-          </thead>
-          <tbody id="group-agents-table-body">
-            <tr class="table-loading-row">
-              <td colspan="3">
-                <div class="spinner-border spinner-border-sm me-2" role="status">
-                  <span class="visually-hidden">Loading...</span>
-                </div>
-                Select a group to load agents.
-              </td>
-            </tr>
-          </tbody>
-        </table>
+        <div id="group-agents-list-view">
+          <table class="table table-striped item-list-table" id="group-agents-table">
+            <thead>
+              <tr>
+                <th>Display Name</th>
+                <th>Description</th>
+                <th>Actions</th>
+              </tr>
+            </thead>
+            <tbody id="group-agents-table-body">
+              <tr class="table-loading-row">
+                <td colspan="3">
+                  <div class="spinner-border spinner-border-sm me-2" role="status">
+                    <span class="visually-hidden">Loading...</span>
+                  </div>
+                  Select a group to load agents.
+                </td>
+              </tr>
+            </tbody>
+          </table>
+        </div>
+        <div id="group-agents-grid-view" class="row g-3 d-none"></div>
       </div>
       <div id="group-agents-error"></div>
     </div>
@@ -813,33 +822,42 @@ <h2>Group Workspace</h2>
           <div class="d-flex justify-content-between align-items-center mb-2">
             <button class="btn btn-success btn-sm" id="create-group-plugin-btn">New Action</button>
           </div>
-          <div class="mb-3">
+          <div class="d-flex align-items-center gap-2 mb-3">
             <input
               type="search"
               class="form-control"
               id="group-plugins-search"
               placeholder="Search group actions by name or description..."
             />
+            <div class="btn-group btn-group-sm" role="group" aria-label="View toggle">
+              <input type="radio" class="btn-check" name="groupPluginsViewToggle" id="groupPlugins-view-list" value="list" autocomplete="off" checked>
+              <label class="btn btn-outline-secondary" for="groupPlugins-view-list" title="List view"><i class="bi bi-list-ul"></i></label>
+              <input type="radio" class="btn-check" name="groupPluginsViewToggle" id="groupPlugins-view-grid" value="grid" autocomplete="off">
+              <label class="btn btn-outline-secondary" for="groupPlugins-view-grid" title="Grid view"><i class="bi bi-grid-3x3-gap"></i></label>
+            </div>
           </div>
-          <table class="table table-striped" id="group-plugins-table">
-            <thead>
-              <tr>
-                <th>Display Name</th>
-                <th>Description</th>
-                <th style="width: 140px;">Actions</th>
-              </tr>
-            </thead>
-            <tbody id="group-plugins-table-body">
-              <tr class="table-loading-row">
-                <td colspan="3">
-                  <div class="spinner-border spinner-border-sm me-2" role="status">
-                    <span class="visually-hidden">Loading...</span>
-                  </div>
-                  Select a group to load actions.
-                </td>
-              </tr>
-            </tbody>
-          </table>
+          <div id="group-plugins-list-view">
+            <table class="table table-striped item-list-table" id="group-plugins-table">
+              <thead>
+                <tr>
+                  <th>Display Name</th>
+                  <th>Description</th>
+                  <th>Actions</th>
+                </tr>
+              </thead>
+              <tbody id="group-plugins-table-body">
+                <tr class="table-loading-row">
+                  <td colspan="3">
+                    <div class="spinner-border spinner-border-sm me-2" role="status">
+                      <span class="visually-hidden">Loading...</span>
+                    </div>
+                    Select a group to load actions.
+                  </td>
+                </tr>
+              </tbody>
+            </table>
+          </div>
+          <div id="group-plugins-grid-view" class="row g-3 d-none"></div>
         </div>
       </template>
     </div>
@@ -851,6 +869,22 @@ <h2>Group Workspace</h2>
 </div>
 <!-- End Container -->
 
+<!-- Item View Modal (shared by agents & actions grid/list view buttons) -->
+<div class="modal fade" id="item-view-modal" tabindex="-1" aria-labelledby="item-view-modal-title" aria-hidden="true">
+  <div class="modal-dialog modal-lg modal-dialog-scrollable">
+    <div class="modal-content">
+      <div class="modal-header">
+        <h5 class="modal-title" id="item-view-modal-title">View Item</h5>
+        <button type="button" class="btn-close" data-bs-dismiss="modal" aria-label="Close"></button>
+      </div>
+      <div class="modal-body" id="item-view-modal-body"></div>
+      <div class="modal-footer">
+        <button type="button" class="btn btn-secondary" data-bs-dismiss="modal">Close</button>
+      </div>
+    </div>
+  </div>
+</div>
+
 <!-- Loading Modal (Keep As Is - Used by Upload) -->
 <div
   class="modal fade"
@@ -2887,6 +2921,7 @@ <h5 class="modal-title" id="groupTagSelectionModalLabel">Select Tags</h5>
                     ? doc.keywords.join(", ")
                     : doc.keywords || "N/A"
                 )}</p>
+                <p class="mb-1"><strong>Tags:</strong> ${renderGroupTagBadges(doc.tags || [])}</p>
                 <p class="mb-0"><strong>Abstract:</strong> ${escapeHtml(
                   doc.abstract || "N/A"
                 )}</p>
@@ -4615,6 +4650,29 @@ <h5 class="alert-heading">
     return d.innerHTML;
   }
 
+  function renderGroupTagBadges(tags, maxDisplay = 3) {
+    if (!Array.isArray(tags) || tags.length === 0) {
+      return '<span class="text-muted small">No tags</span>';
+    }
+
+    let html = '';
+    const displayTags = tags.slice(0, maxDisplay);
+
+    displayTags.forEach(tagName => {
+      const tag = groupWorkspaceTags.find(t => t.name === tagName);
+      const color = tag && tag.color ? tag.color : '#6c757d';
+      const textClass = isGroupColorLight(color) ? 'text-dark' : 'text-light';
+
+      html += `<span class="tag-badge ${textClass}" style="background-color:${color};" title="${escapeGroupHtml(tagName)}">${escapeGroupHtml(tagName)}</span>`;
+    });
+
+    if (tags.length > maxDisplay) {
+      html += `<span class="badge bg-secondary">+${tags.length - maxDisplay}</span>`;
+    }
+
+    return html;
+  }
+
   // --- Tag Management Modal ---
   function showGroupTagManagementModal() {
     loadGroupWorkspaceTags().then(() => {
diff --git a/application/single_app/templates/index.html b/application/single_app/templates/index.html
index 7a146e0d..c3c2abc6 100644
--- a/application/single_app/templates/index.html
+++ b/application/single_app/templates/index.html
@@ -62,8 +62,7 @@
     {% else %}
         {% if session.get('user') %}
             <p class="lead">
-                You are logged in but do not have the required permissions to access this application.
-                Please submit a ticket to request access.
+                {{ app_settings.access_denied_message | nl2br }}
             </p>
         {% else %}
             <div>
diff --git a/application/single_app/templates/profile.html b/application/single_app/templates/profile.html
index 2ab543f7..a90a6fab 100644
--- a/application/single_app/templates/profile.html
+++ b/application/single_app/templates/profile.html
@@ -287,10 +287,6 @@ <h5 class="mb-3"><i class="bi bi-person-badge me-2"></i>Account Information</h5>
                         <label class="form-label text-muted small">Object ID</label>
                         <div class="font-monospace small">{{ user.get('oid', 'N/A') }}</div>
                     </div>
-                    <div class="col-md-6">
-                        <label class="form-label text-muted small">Streaming</label>
-                        <div><span id="streaming-badge" class="badge bg-secondary">-</span></div>
-                    </div>
                     <div class="col-md-6">
                         <label class="form-label text-muted small">Dark Mode</label>
                         <div><span id="darkmode-badge" class="badge bg-secondary">-</span></div>
@@ -731,9 +727,6 @@ <h5 class="modal-title" id="exportActivityModalLabel">
             }
             
             // Update preferences badges
-            document.getElementById('streaming-badge').textContent = data.settings?.streamingEnabled ? 'Enabled' : 'Disabled';
-            document.getElementById('streaming-badge').className = `badge ${data.settings?.streamingEnabled ? 'bg-success' : 'bg-secondary'}`;
-            
             document.getElementById('darkmode-badge').textContent = data.settings?.darkModeEnabled ? 'Enabled' : 'Disabled';
             document.getElementById('darkmode-badge').className = `badge ${data.settings?.darkModeEnabled ? 'bg-dark' : 'bg-light text-dark'}`;
             
diff --git a/application/single_app/templates/workspace.html b/application/single_app/templates/workspace.html
index c17ff5f5..798cff37 100644
--- a/application/single_app/templates/workspace.html
+++ b/application/single_app/templates/workspace.html
@@ -688,21 +688,32 @@ <h2>Personal Workspace</h2>
                   </button>
                   <button class="btn btn-success btn-sm" id="create-agent-btn" type="button">New Agent</button>
                 </div>
+                <div class="btn-group btn-group-sm" role="group" aria-label="Agents view mode">
+                  <input type="radio" class="btn-check" name="agents-view-mode" id="agents-view-list" autocomplete="off" checked>
+                  <label class="btn btn-outline-secondary" for="agents-view-list"><i class="bi bi-list-ul"></i></label>
+                  <input type="radio" class="btn-check" name="agents-view-mode" id="agents-view-grid" autocomplete="off">
+                  <label class="btn btn-outline-secondary" for="agents-view-grid"><i class="bi bi-grid-3x3-gap"></i></label>
+                </div>
               </div>
               <div class="mb-3">
                 <input type="search" class="form-control" id="agents-search" placeholder="Search agents by name or description...">
               </div>
-              <table class="table table-striped" id="agents-table">
-                <thead><tr><th>Display Name</th><th>Description</th><th>Actions</th></tr></thead>
-                <tbody id="agents-table-body">
-                  <tr class="table-loading-row">
-                    <td colspan="3">
-                      <div class="spinner-border spinner-border-sm me-2" role="status"><span class="visually-hidden">Loading...</span></div>
-                      Loading agents...
-                    </td>
-                  </tr>
-                </tbody>
-              </table>
+              <!-- List View -->
+              <div id="agents-list-view">
+                <table class="table table-striped item-list-table" id="agents-table">
+                  <thead><tr><th>Display Name</th><th>Description</th><th>Actions</th></tr></thead>
+                  <tbody id="agents-table-body">
+                    <tr class="table-loading-row">
+                      <td colspan="3">
+                        <div class="spinner-border spinner-border-sm me-2" role="status"><span class="visually-hidden">Loading...</span></div>
+                        Loading agents...
+                      </td>
+                    </tr>
+                  </tbody>
+                </table>
+              </div>
+              <!-- Grid View -->
+              <div id="agents-grid-view" class="row g-3 d-none"></div>
             </div>
             <div id="workspace-agents-error"></div>
           </div>
@@ -730,16 +741,27 @@ <h2>Personal Workspace</h2>
             <div class="card p-3 my-3">
               <div class="d-flex justify-content-between align-items-center mb-2">
                 <button class="btn btn-success btn-sm" id="create-plugin-btn">New Action</button>
+                <div class="btn-group btn-group-sm" role="group" aria-label="Actions view mode">
+                  <input type="radio" class="btn-check" name="plugins-view-mode" id="plugins-view-list" autocomplete="off" checked>
+                  <label class="btn btn-outline-secondary" for="plugins-view-list"><i class="bi bi-list-ul"></i></label>
+                  <input type="radio" class="btn-check" name="plugins-view-mode" id="plugins-view-grid" autocomplete="off">
+                  <label class="btn btn-outline-secondary" for="plugins-view-grid"><i class="bi bi-grid-3x3-gap"></i></label>
+                </div>
               </div>
               <div class="mb-3">
                 <input type="search" class="form-control" id="plugins-search" placeholder="Search actions by name or description...">
               </div>
-              <table class="table table-striped" id="plugins-table">
-                <thead><tr><th>Display Name</th><th>Description</th><th style="width: 140px;">Actions</th></tr></thead>
-                <tbody id="plugins-table-body">
-                  <!-- Populated by JS -->
-                </tbody>
-              </table>
+              <!-- List View -->
+              <div id="plugins-list-view">
+                <table class="table table-striped item-list-table" id="plugins-table">
+                  <thead><tr><th>Display Name</th><th>Description</th><th>Actions</th></tr></thead>
+                  <tbody id="plugins-table-body">
+                    <!-- Populated by JS -->
+                  </tbody>
+                </table>
+              </div>
+              <!-- Grid View -->
+              <div id="plugins-grid-view" class="row g-3 d-none"></div>
             </div>
           </template>
           </div>
@@ -754,6 +776,24 @@ <h2>Personal Workspace</h2>
   </div>
   <!--</div>-->
 
+  <!-- Shared Item View Modal (read-only details for agents/actions) -->
+  <div class="modal fade" id="item-view-modal" tabindex="-1" aria-labelledby="item-view-modal-label" aria-hidden="true">
+    <div class="modal-dialog modal-lg modal-dialog-scrollable">
+      <div class="modal-content">
+        <div class="modal-header">
+          <h5 class="modal-title" id="item-view-modal-label">Details</h5>
+          <button type="button" class="btn-close" data-bs-dismiss="modal" aria-label="Close"></button>
+        </div>
+        <div class="modal-body">
+          <!-- Populated dynamically by view-utils.js -->
+        </div>
+        <div class="modal-footer">
+          <button type="button" class="btn btn-secondary" data-bs-dismiss="modal">Close</button>
+        </div>
+      </div>
+    </div>
+  </div>
+
   <!-- Modal for Creating/Editing Prompts (Keep As Is) -->
   <div
     class="modal fade"
diff --git a/deployers/azure.yaml b/deployers/azure.yaml
index 1ce818e5..96ffd794 100644
--- a/deployers/azure.yaml
+++ b/deployers/azure.yaml
@@ -9,6 +9,9 @@ services:
     project: ../application/single_app
     language: python
     host: appservice
+        # Container-based App Service deployment.
+        # Gunicorn startup is provided by application/single_app/Dockerfile,
+        # so no App Service Stack Settings startup command is required.
     docker:
       context: ../../
       dockerfile: application/single_app/Dockerfile
diff --git a/deployers/azurecli/README.md b/deployers/azurecli/README.md
index 54688a63..0b6ab6ac 100644
--- a/deployers/azurecli/README.md
+++ b/deployers/azurecli/README.md
@@ -1,3 +1,14 @@
 # Simple Chat - Deployment using AzureCLI + PowerShell
 
-[Return to Main](../README.md)
\ No newline at end of file
+[Return to Main](../README.md)
+
+## Runtime Startup Behavior
+
+- This deployer configures Azure App Service to run the published **container image**.
+- Gunicorn is already started by the container entrypoint in `application/single_app/Dockerfile`.
+- You do **not** need to add anything to App Service Stack Settings Startup command when using this deployer.
+- If you manually switch to a native Python App Service deployment instead of containers, deploy the `application/single_app` folder and use:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
\ No newline at end of file
diff --git a/deployers/azurecli/deploy-simplechat.ps1 b/deployers/azurecli/deploy-simplechat.ps1
index 55feabbd..625b56f8 100644
--- a/deployers/azurecli/deploy-simplechat.ps1
+++ b/deployers/azurecli/deploy-simplechat.ps1
@@ -607,6 +607,9 @@ if (-not $webApp) {
         }
 
         Write-Host "`n=====> Setting App Service Container Image ..."
+        # This deployer uses a container-based App Service.
+        # Gunicorn startup is handled by the Dockerfile ENTRYPOINT inside the image,
+        # so App Service native Python startup settings are not configured here.
         # az webapp config container set `
         # --name $appServiceName `
         # --resource-group $resourceGroupName `
diff --git a/deployers/bicep/README.md b/deployers/bicep/README.md
index 5c50ed9f..8a3a1d0a 100644
--- a/deployers/bicep/README.md
+++ b/deployers/bicep/README.md
@@ -57,6 +57,17 @@ Ensure the following resource providers are registered in your subscription:
 
 ## Deployment Process
 
+## Runtime Startup Behavior
+
+- This deployer publishes a **container image** to Azure App Service.
+- Gunicorn is started by the container entrypoint in `application/single_app/Dockerfile`.
+- Do **not** add an App Service Stack Settings Startup command for this deployer unless you intentionally change the deployment model away from containers.
+- If you later switch to a native Python App Service deployment, deploy the `application/single_app` folder and use this startup command instead:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
 The below steps cover the process to deploy the Simple Chat application to an Azure Subscription.  It is assumed the user has administrative rights to the subscription for deployment.  If the user does not also have permissions to create an Application Registration in Entra, a stand-alone script can be provided to an administrator with the correct permissions.
 
 ### Pre-Configuration:
@@ -358,7 +369,7 @@ A: Base infrastructure (without optional services) costs approximately:
 ### Upgrading
 
 **Q: How do I upgrade to a new version?**
-A: Run `azd up` again from the updated codebase. Use `azd provision --preview` to review changes first.
+A: For **code-only** container updates, prefer `azd deploy`. Use `azd provision --preview` and then `azd up` only when the release also changes infrastructure. See [../../docs/how-to/upgrade_paths.md](../../docs/how-to/upgrade_paths.md) for the upgrade decision guide.
 
 ---
 
diff --git a/deployers/bicep/main.bicep b/deployers/bicep/main.bicep
index bcf19d31..6adc0d6a 100644
--- a/deployers/bicep/main.bicep
+++ b/deployers/bicep/main.bicep
@@ -6,13 +6,18 @@ targetScope = 'subscription'
 param location string
 
 @description('''The target Azure Cloud environment.
-- Accepted values are: AzureCloud, AzureUSGovernment
-- Default is AzureCloud''')
+- Accepted values are: AzureCloud, AzureUSGovernment, public, usgovernment, custom
+- Default is based on the ARM cloud name''')
 @allowed([
-  'AzureCloud'
-  'AzureUSGovernment'
+  'AzureCloud'        // public, keep allowed values for backwards compatibility
+  'AzureUSGovernment' // usgovernment
+  'public'             
+  'usgovernment'       
+  'custom'
 ])
-param cloudEnvironment string
+param cloudEnvironment string = az.environment().name == 'AzureCloud' ? 'public' : (az.environment().name == 'AzureUSGovernment' ? 'usgovernment' : 'custom')
+// SimpleChat expects public, usgovernment or custom
+var scCloudEnvironment = cloudEnvironment == 'AzureCloud' ? 'public' : (cloudEnvironment == 'AzureUSGovernment' ? 'usgovernment' : cloudEnvironment)
 
 @description('''The name of the application to be deployed.  
 - Name may only contain letters and numbers
@@ -80,6 +85,20 @@ param enableDiagLogging bool
 - Default is false''')
 param enablePrivateNetworking bool
 
+// --- Custom Azure Environment Parameters (for 'custom' azureEnvironment) ---
+@description('Custom blob storage URL suffix, e.g. blob.core.usgovcloudapi.net')
+param customBlobStorageSuffix string = 'blob.${az.environment().suffixes.storage}'
+@description('Custom Graph API URL, e.g. https://graph.microsoft.us')
+param customGraphUrl string? // az.environment().graph is legacy AD, do not use
+@description('Custom Identity URL, e.g. https://login.microsoftonline.us/')
+param customIdentityUrl string = az.environment().authentication.loginEndpoint
+@description('Custom Resource Manager URL, e.g. https://management.usgovcloudapi.net')
+param customResourceManagerUrl string = az.environment().resourceManager
+@description('Custom Cognitive Services scope ex: https://cognitiveservices.azure.com/.default')
+param customCognitiveServicesScope string = 'https://cognitiveservices.azure.com/.default'
+@description('Custom search resource URL for token audience, e.g. https://search.azure.us')
+param customSearchResourceUrl string = 'https://search.azure.com'
+
 @description('''Array of GPT model names to deploy to the OpenAI resource.''')
 param gptModels array = [
   {
@@ -424,7 +443,7 @@ module appService 'modules/appService.bicep' = {
     logAnalyticsId: logAnalytics.outputs.logAnalyticsId
     appServicePlanId: appServicePlan.outputs.appServicePlanId
     containerImageName: containerImageName
-    azurePlatform: cloudEnvironment
+    azurePlatform: scCloudEnvironment
     cosmosDbName: cosmosDB.outputs.cosmosDbName
     searchServiceName: searchService.outputs.searchServiceName
     openAiServiceName: openAI.outputs.openAIName
@@ -439,6 +458,14 @@ module appService 'modules/appService.bicep' = {
     enablePrivateNetworking: enablePrivateNetworking
     #disable-next-line BCP318 // expect one value to be null if private networking is disabled
     appServiceSubnetId: enablePrivateNetworking? virtualNetwork.outputs.appServiceSubnetId : ''
+
+    // --- Custom Azure Environment Parameters (for 'custom' azureEnvironment) ---
+    customBlobStorageSuffix: customBlobStorageSuffix
+    customGraphUrl: customGraphUrl
+    customIdentityUrl: customIdentityUrl
+    customResourceManagerUrl: customResourceManagerUrl
+    customCognitiveServicesScope: customCognitiveServicesScope
+    customSearchResourceUrl: customSearchResourceUrl
   }
 }
 
diff --git a/deployers/bicep/modules/appService.bicep b/deployers/bicep/modules/appService.bicep
index 5d9aa471..14f251b7 100644
--- a/deployers/bicep/modules/appService.bicep
+++ b/deployers/bicep/modules/appService.bicep
@@ -27,6 +27,23 @@ param keyVaultUri string
 param enablePrivateNetworking bool
 param appServiceSubnetId string = ''
 
+// --- Custom Azure Environment Parameters (for 'custom' azureEnvironment) ---
+@description('Custom blob storage URL suffix, e.g. blob.core.usgovcloudapi.net')
+param customBlobStorageSuffix string?
+@description('Custom Graph API URL, e.g. https://graph.microsoft.us')
+param customGraphUrl string?
+@description('Custom Identity URL, e.g. https://login.microsoftonline.us')
+param customIdentityUrl string?
+@description('Custom Resource Manager URL, e.g. https://management.usgovcloudapi.net')
+param customResourceManagerUrl string?
+@description('Custom Cognitive Services scope ex: https://cognitiveservices.azure.com/.default')
+param customCognitiveServicesScope string?
+@description('Custom search resource URL for token audience, e.g. https://search.azure.us')
+param customSearchResourceUrl string?
+
+var tenantId = tenant().tenantId
+var openIdMetadataUrl = '${az.environment().authentication.loginEndpoint}${tenantId}/v2.0/.well-known/openid-configuration'
+
 // Import diagnostic settings configurations
 module diagnosticConfigs 'diagnosticSettings.bicep' = if (enableDiagLogging) {
   name: 'diagnosticConfigs'
@@ -55,12 +72,15 @@ resource appInsights 'Microsoft.Insights/components@2020-02-02' existing = {
   name: appInsightsName
 }
 
-var acrDomain = azurePlatform == 'AzureUSGovernment' ? '.azurecr.us' : '.azurecr.io'
+var acrDomain = az.environment().suffixes.acrLoginServer
 
 // add web app
 resource webApp 'Microsoft.Web/sites@2022-03-01' = {
   name: toLower('${appName}-${environment}-app')
   location: location
+  // This module deploys a Linux container App Service.
+  // Gunicorn startup comes from the container image entrypoint,
+  // so App Service native Python startup settings are not used here.
   kind: 'app,linux,container'
   properties: {
     serverFarmId: appServicePlanId
@@ -77,7 +97,7 @@ resource webApp 'Microsoft.Web/sites@2022-03-01' = {
       ftpsState: 'Disabled'
       healthCheckPath: '/external/healthcheck'
       appSettings: [
-        { name: 'AZURE_ENDPOINT', value: azurePlatform == 'AzureUSGovernment' ? 'usgovernment' : 'public' }
+        { name: 'AZURE_ENVIRONMENT', value: azurePlatform }
         { name: 'SCM_DO_BUILD_DURING_DEPLOYMENT', value: 'false' }
         { name: 'AZURE_COSMOS_ENDPOINT', value: cosmosDb.properties.documentEndpoint }
         { name: 'AZURE_COSMOS_AUTHENTICATION_TYPE', value: toLower(authenticationType) }
@@ -150,8 +170,18 @@ resource webApp 'Microsoft.Web/sites@2022-03-01' = {
         { name: 'InstrumentationEngine_EXTENSION_VERSION', value: 'disabled' }
         { name: 'SnapshotDebugger_EXTENSION_VERSION', value: 'disabled' }
         { name: 'XDT_MicrosoftApplicationInsights_BaseExtensions', value: 'disabled' }
-        { name: 'XDT_MicrosoftApplicationInsights_Mode', value: 'recommended' }
-        { name: 'XDT_MicrosoftApplicationInsights_PreemptSdk', value: 'disabled' }
+        {name: 'XDT_MicrosoftApplicationInsights_Mode', value: 'recommended' }
+        {name: 'XDT_MicrosoftApplicationInsights_PreemptSdk', value: 'disabled' }
+        ...(azurePlatform == 'custom' ? [
+        {name: 'CUSTOM_GRAPH_URL_VALUE', value: customGraphUrl ?? ''}
+        {name: 'CUSTOM_IDENTITY_URL_VALUE', value: customIdentityUrl ?? ''}
+        {name: 'CUSTOM_RESOURCE_MANAGER_URL_VALUE', value: customResourceManagerUrl ?? ''}
+        {name: 'CUSTOM_BLOB_STORAGE_URL_VALUE', value: customBlobStorageSuffix ?? ''}
+        {name: 'CUSTOM_COGNITIVE_SERVICES_URL_VALUE', value: customCognitiveServicesScope ?? ''}
+        {name: 'CUSTOM_SEARCH_RESOURCE_MANAGER_URL_VALUE', value: customSearchResourceUrl ?? ''}
+        {name: 'KEY_VAULT_DOMAIN', value: az.environment().suffixes.keyvaultDns}
+        {name: 'CUSTOM_OIDC_METADATA_URL_VALUE', value: openIdMetadataUrl ?? ''}]
+        : [])
       ]
     }
     clientAffinityEnabled: false
@@ -205,7 +235,7 @@ resource authSettings 'Microsoft.Web/sites/config@2022-03-01' = {
       azureActiveDirectory: {
         enabled: true
         registration: {
-          openIdIssuer: azurePlatform == 'AzureUSGovernment' ? 'https://login.microsoftonline.us/${tenant().tenantId}/' : 'https://sts.windows.net/${tenant().tenantId}/'
+          openIdIssuer: '${az.environment().authentication.loginEndpoint}${tenant().tenantId}/'
           clientId: enterpriseAppClientId
           clientSecretSettingName: 'MICROSOFT_PROVIDER_AUTHENTICATION_SECRET'
         }
diff --git a/deployers/terraform/ReadMe.md b/deployers/terraform/ReadMe.md
index fb2dcf7d..acb2bc74 100644
--- a/deployers/terraform/ReadMe.md
+++ b/deployers/terraform/ReadMe.md
@@ -39,6 +39,12 @@ ACR_PASSWORD = "your_acr_password"
 
 From Github > Actions > "SimpleChat Docker Image Publish" > Run workflow
 
+## Upgrading
+
+- For **code-only** container updates, publish a new image to ACR and follow the existing App Service container rollout process instead of rerunning Terraform for every release.
+- Use Terraform when you are intentionally changing infrastructure or configuration that belongs in Terraform state.
+- See [../../docs/how-to/upgrade_paths.md](../../docs/how-to/upgrade_paths.md) for the native-vs-container upgrade guide and the ACR/image-only rollout notes.
+
 ## Terraform deployment
 
 Initialize: Run terraform init to download the necessary providers.
diff --git a/deployers/terraform/main.tf b/deployers/terraform/main.tf
index 77b486df..77eb1e75 100644
--- a/deployers/terraform/main.tf
+++ b/deployers/terraform/main.tf
@@ -360,6 +360,9 @@ resource "azurerm_linux_web_app" "app" {
   location            = azurerm_resource_group.rg.location
   resource_group_name = azurerm_resource_group.rg.name
   service_plan_id     = azurerm_service_plan.asp.id
+  # This Terraform deployer uses a container-based Linux Web App.
+  # Gunicorn startup comes from the container image entrypoint,
+  # so native Python app_command_line/startup settings are not used here.
   ftp_publish_basic_authentication_enabled = false
   webdeploy_publish_basic_authentication_enabled = false   
 
diff --git a/custom-ca-certificates/.gitkeep b/docker-customization/custom-ca-certificates/.gitkeep
similarity index 100%
rename from custom-ca-certificates/.gitkeep
rename to docker-customization/custom-ca-certificates/.gitkeep
diff --git a/docker-customization/pip.conf b/docker-customization/pip.conf
new file mode 100644
index 00000000..3dc81272
--- /dev/null
+++ b/docker-customization/pip.conf
@@ -0,0 +1 @@
+# Add pip configuration here
\ No newline at end of file
diff --git a/docs/Gemfile b/docs/Gemfile
index 4ca5f8aa..478cf7f3 100644
--- a/docs/Gemfile
+++ b/docs/Gemfile
@@ -5,6 +5,7 @@ gem "github-pages", group: :jekyll_plugins
 
 # Ruby 3+ compatibility
 gem "webrick", "~> 1.8"
+gem "json", ">= 2.19.2"
 
 # Jekyll plugins for enhanced functionality
 group :jekyll_plugins do
diff --git a/docs/Gemfile.lock b/docs/Gemfile.lock
index 9dede6d7..4af17774 100644
--- a/docs/Gemfile.lock
+++ b/docs/Gemfile.lock
@@ -220,7 +220,7 @@ GEM
       gemoji (>= 3, < 5)
       html-pipeline (~> 2.2)
       jekyll (>= 3.0, < 5.0)
-    json (2.15.0)
+    json (2.19.2)
     kramdown (2.4.0)
       rexml
     kramdown-parser-gfm (1.1.0)
@@ -287,6 +287,7 @@ DEPENDENCIES
   jekyll-seo-tag
   jekyll-sitemap
   jekyll-titles-from-headings
+  json (>= 2.19.2)
   webrick (~> 1.8)
 
 BUNDLED WITH
diff --git a/docs/explanation/features/CHAT_COMPLETION_NOTIFICATIONS.md b/docs/explanation/features/CHAT_COMPLETION_NOTIFICATIONS.md
new file mode 100644
index 00000000..871ae2b0
--- /dev/null
+++ b/docs/explanation/features/CHAT_COMPLETION_NOTIFICATIONS.md
@@ -0,0 +1,103 @@
+# Chat Completion Notifications (v0.239.128)
+
+## Overview
+Chat completion notifications add background awareness for personal chat conversations. When a streamed assistant response finishes after the user has moved away from the chat, the app now creates a personal notification that deep-links back to the exact conversation and shows a green unread dot in both conversation lists until the conversation is opened.
+
+**Version Implemented:** 0.239.128
+
+## Dependencies
+- Flask chat and conversation routes
+- Cosmos DB conversations and notifications containers
+- Existing notification platform in `functions_notifications.py`
+- Chat SSE finalization in `static/js/chat/chat-streaming.js`
+- Main and sidebar conversation list modules
+
+## Implemented in version: **0.239.128**
+
+## Architecture Overview
+
+### Backend
+- **Stream completion hook:** `application/single_app/route_backend_chats.py`
+- **Unread-state helpers:** `application/single_app/functions_conversation_unread.py`
+- **Notification helpers:** `application/single_app/functions_notifications.py`
+- **Read/clear endpoint:** `POST /api/conversations/<conversation_id>/mark-read`
+
+When a personal chat stream completes, the backend now:
+- persists the assistant message as before
+- marks the conversation with unread assistant-response fields
+- creates a personal `chat_response_complete` notification
+- stores `conversation_id` and `message_id` in notification metadata
+- uses `/chats?conversationId=...` as the notification deep link
+
+### Frontend
+- **Main list module:** `application/single_app/static/js/chat/chat-conversations.js`
+- **Sidebar list module:** `application/single_app/static/js/chat/chat-sidebar-conversations.js`
+- **Stream finalization:** `application/single_app/static/js/chat/chat-streaming.js`
+- **Styling:** `application/single_app/static/css/chats.css`, `application/single_app/static/css/sidebar.css`
+
+The chat UI now:
+- renders a green unread dot for personal conversations with unread assistant responses
+- clears unread state when a conversation is opened
+- immediately clears the just-created unread state if the user is still watching that conversation when streaming finishes
+
+## Notification Behavior
+
+### Deep-Linking
+Notification clicks use the existing notification navigation flow and point directly to:
+
+`/chats?conversationId=<conversation_id>`
+
+`chat-onload.js` already supports this URL shape, so the destination conversation is selected automatically after the chat page loads.
+
+### Approximate Active-View Suppression
+This implementation intentionally does not add heartbeat or presence tracking. Instead:
+- the backend always creates the completion notification for personal chats
+- the active chat page immediately calls the new mark-read endpoint after stream completion
+- this keeps the user-facing result aligned with the active-view scenario without adding presence infrastructure
+
+## Conversation Data Shape
+Personal conversation payloads now normalize these fields:
+- `has_unread_assistant_response`
+- `last_unread_assistant_message_id`
+- `last_unread_assistant_at`
+
+Older conversation documents that do not yet contain these fields are normalized to safe defaults in the conversation list and metadata APIs.
+
+## Files Updated
+- `application/single_app/functions_conversation_unread.py`
+- `application/single_app/functions_notifications.py`
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/route_backend_conversations.py`
+- `application/single_app/static/js/chat/chat-conversations.js`
+- `application/single_app/static/js/chat/chat-sidebar-conversations.js`
+- `application/single_app/static/js/chat/chat-streaming.js`
+- `application/single_app/static/css/chats.css`
+- `application/single_app/static/css/sidebar.css`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_completion_notifications.py`
+
+## Usage Instructions
+- Start a personal chat and send a prompt that takes long enough for streaming to remain active.
+- Navigate away from the chat page before the response completes.
+- After completion, open Notifications and click the new AI response notification.
+- The app navigates back to the exact conversation and clears the unread state.
+
+## Testing and Validation
+- **Functional test:** `functional_tests/test_chat_completion_notifications.py`
+
+The regression test validates:
+- chat-response notification creation and deep-link shape
+- unread-field normalization for older conversation documents
+- mark-read endpoint clearing both conversation unread state and notification read state
+- frontend wiring for unread dots and mark-read calls
+
+## Performance Considerations
+- No polling was added beyond the existing notification badge polling
+- The mark-read endpoint is idempotent and lightweight
+- The unread indicator stores a single latest unread assistant response state, not an unread count
+
+## Known Limitations
+- First rollout is personal chats only
+- Group and public chat conversations do not yet participate in this notification flow
+- Presence detection is approximate rather than heartbeat-based
+- The green dot indicates unread assistant completion state, not a count of unread assistant messages
\ No newline at end of file
diff --git a/docs/explanation/features/CONVERSATION_EXPORT.md b/docs/explanation/features/CONVERSATION_EXPORT.md
index c56d261a..a674dca4 100644
--- a/docs/explanation/features/CONVERSATION_EXPORT.md
+++ b/docs/explanation/features/CONVERSATION_EXPORT.md
@@ -1,139 +1,186 @@
 # Conversation Export
 
 ## Overview
-The Conversation Export feature allows users to export one or multiple conversations directly from the Chats experience. A multi-step wizard modal guides users through format selection, output packaging, and downloading the final file.
+The Conversation Export feature lets users export one or more chats from the Chats experience as JSON, Markdown, or PDF. The export now mirrors the live conversation view more closely by excluding deleted and inactive-thread messages, including processing thoughts, and preserving the modern citation buckets used by the chat UI.
 
-**Version Implemented:** 0.237.050
+**Version Implemented:** 0.239.022 (base), 0.239.023 (PDF export), 0.239.030 (persistent summaries)
 
 ## Dependencies
-- Flask (backend route)
-- Azure Cosmos DB (conversation and message storage)
-- Bootstrap 5 (modal, step indicators, cards)
-- ES modules (chat-export.js)
+- Flask backend route and file responses
+- Azure Cosmos DB conversation, message, and thought containers
+- Bootstrap 5 modal workflow
+- ES modules in `static/js/chat/chat-export.js`
+- Azure OpenAI / APIM-backed chat model access for optional intro summaries
+- PyMuPDF (fitz) for HTML-to-PDF rendering via the Story API
+
+## Implemented in version: **0.239.030**
 
 ## Architecture Overview
 
 ### Backend
-- **Route file:** `route_backend_conversation_export.py`
+- **Route file:** `application/single_app/route_backend_conversation_export.py`
 - **Endpoint:** `POST /api/conversations/export`
-- **Registration:** Called via `register_route_backend_conversation_export(app)` in `app.py`
+- **Registration:** `register_route_backend_conversation_export(app)`
+
+The export route now:
+- verifies the current user owns each requested conversation
+- loads messages ordered by timestamp, then reapplies thread ordering to match the chat UI
+- removes soft-deleted messages and inactive-thread retries
+- joins processing thoughts from the thoughts container by `message_id`
+- builds both normalized citation summaries and raw citation buckets
+- optionally generates a per-conversation intro summary using the selected chat model
+
+### Frontend
+- **JS module:** `application/single_app/static/js/chat/chat-export.js`
+- **Modal host:** `application/single_app/templates/chats.html`
+- **Entry point:** `window.chatExport.openExportWizard(conversationIds, skipSelection)`
+
+The wizard now has up to 5 steps:
+1. **Selection** — Review selected conversations (multi-select exports only)
+2. **Format** — Choose JSON, Markdown, or PDF
+3. **Packaging** — Choose a single file or ZIP archive
+4. **Summary** — Optionally generate a per-conversation intro summary and choose the summary model from the same model list used in the chat composer
+5. **Download** — Review settings and download the export
+
+## Request Payload
 
-The endpoint accepts a JSON body with:
 | Field | Type | Description |
 |---|---|---|
-| `conversation_ids` | list[str] | IDs of conversations to export |
-| `format` | string | `"json"` or `"markdown"` |
+| `conversation_ids` | list[str] | Conversation IDs to export |
+| `format` | string | `"json"`, `"markdown"`, or `"pdf"` |
 | `packaging` | string | `"single"` or `"zip"` |
-
-The server verifies user ownership of each conversation, fetches messages from Cosmos DB, filters for active thread messages, sanitizes internal fields, and returns either a single file or ZIP archive as a binary download.
-
-### Frontend
-- **JS module:** `static/js/chat/chat-export.js`
-- **Modal HTML:** Embedded in `templates/chats.html` (`#export-wizard-modal`)
-- **Global API:** `window.chatExport.openExportWizard(conversationIds, skipSelection)`
-
-The wizard has up to 4 steps:
-1. **Selection Review** — Shows selected conversations with titles (skipped for single-conversation export)
-2. **Format** — Choose between JSON and Markdown via action-type cards
-3. **Packaging** — Choose between single file and ZIP archive
-4. **Download** — Summary and download button
-
-## Entry Points
-
-### Single Conversation Export
-- **Sidebar ellipsis menu** → "Export" item (in `chat-sidebar-conversations.js`)
-- **Left-pane ellipsis menu** → "Export" item (in `chat-conversations.js`)
-- Both call `window.chatExport.openExportWizard([conversationId], true)` — skips the selection step
-
-### Multi-Conversation Export
-- Enter selection mode by clicking "Select" on any conversation
-- Select multiple conversations via checkboxes
-- Click the export button in:
-  - **Left-pane header** — `#export-selected-btn` (btn-info, download icon)
-  - **Sidebar actions bar** — `#sidebar-export-selected-btn`
-- These call `window.chatExport.openExportWizard(selectedIds, false)` — shows all 4 steps
+| `include_summary_intro` | boolean | Whether to generate a per-conversation intro summary |
+| `summary_model_deployment` | string | Optional selected model deployment for the summary intro |
 
 ## Export Formats
 
 ### JSON
-Produces a JSON array where each entry contains:
-```json
-{
-  "conversation": {
-    "id": "...",
-    "title": "...",
-    "last_updated": "...",
-    "chat_type": "...",
-    "tags": [],
-    "is_pinned": false,
-    "context": []
-  },
-  "messages": [
-    {
-      "role": "user",
-      "content": "...",
-      "timestamp": "...",
-      "citations": []
-    }
-  ]
-}
-```
+Each exported conversation entry contains:
+- `conversation` metadata and aggregate counts
+- `summary_intro` status and content (when enabled)
+- `messages` with:
+  - raw `content`
+  - normalized `content_text`
+  - curated `details`
+  - normalized `citations`
+  - raw `legacy_citations`, `hybrid_citations`, `web_search_citations`, and `agent_citations`
+  - nested `thoughts`
+
+This makes the JSON export useful both for readable inspection and downstream processing.
 
 ### Markdown
-Produces a Markdown document with:
-- `# Title` heading
-- Metadata block (last updated, chat type, tags, message count)
-- `### Role` sections per message with timestamps
-- Citation lists where applicable
-- `---` separators between messages and conversations
+Markdown exports are now organized like a lightweight report:
+- conversation title and metadata header
+- optional **Abstract** section generated by the selected model
+- **Transcript** section with clean user/assistant turns only
+- appendix sections for:
+  - conversation metadata
+  - curated message details
+  - references/citations
+  - processing thoughts
+  - supplemental non-transcript messages such as file or system records
+
+This keeps the main body readable while moving verbose reference material to the end of the document.
+
+### PDF
+PDF exports render the conversation as a print-ready document with chat-bubble styling that visually resembles the live chat interface:
+- **User messages** are displayed in blue bubbles (#c8e0fa) aligned to the right
+- **Assistant messages** are displayed in gray bubbles (#f1f0f0) aligned to the left
+- **System and file messages** use distinct background colors
+- Message content is converted from Markdown to HTML for rich formatting
+- The same appendix structure as Markdown is included (metadata, details, references, thoughts, supplemental messages)
+- PDF rendering uses PyMuPDF's Story API on US Letter paper (612 × 792pt) with 0.5-inch margins
+- Multi-page content is handled automatically with page overflow
+
+## Citation Handling
+
+The export supports the same citation categories used in the live chat UI:
+- **Document citations** from `hybrid_citations`
+- **Web citations** from `web_search_citations`
+- **Agent/tool citations** from `agent_citations`
+- **Legacy citations** for backwards-compatible older messages
+
+Normalized citation summaries provide a single export-friendly list, while the raw buckets preserve the original structured data.
+
+## Thoughts Handling
+
+If processing thoughts are enabled and available, they are exported with the assistant message they belong to. Each thought includes:
+- `step_index`
+- `step_type`
+- `content`
+- `detail`
+- `duration_ms`
+- `timestamp`
+
+Markdown exports place these in the **Processing Thoughts** appendix.
 
 ## Output Packaging
 
 ### Single File
-- One file containing all selected conversations
-- JSON: `.json` file
-- Markdown: `.md` file with `---` separators between conversations
+- JSON exports combine all selected conversations into one `.json` file
+- Markdown exports combine all selected conversations into one `.md` file separated by `---`
+- PDF exports combine all selected conversations into one `.pdf` file with visual separators between conversations
 
 ### ZIP Archive
-- One file per conversation inside a `.zip`
-- Filenames: `{sanitized_title}_{id_prefix}.{ext}`
-- Titles are sanitized for filesystem safety (special chars replaced, truncated to 50 chars)
-
-## File Structure
-```
-application/single_app/
-├── route_backend_conversation_export.py   # Backend API endpoint
-├── app.py                                  # Route registration
-├── static/js/chat/
-│   ├── chat-export.js                     # Export wizard module
-│   ├── chat-conversations.js              # Left-pane wiring
-│   └── chat-sidebar-conversations.js      # Sidebar wiring
-├── templates/
-│   ├── chats.html                         # Modal HTML + button + script
-│   ├── _sidebar_nav.html                  # Sidebar export button
-│   └── _sidebar_short_nav.html            # Short sidebar export button
-functional_tests/
-└── test_conversation_export.py            # Functional tests
-```
-
-## Security
-- Endpoint requires `@login_required` and `@user_required` decorators
-- Each conversation is verified for user ownership before export
-- Internal Cosmos DB fields (`_rid`, `_self`, `_etag`, `user_id`, etc.) are stripped from output
-- No sensitive data is included in the export
+- Each conversation is written to its own file in the archive
+- Filenames use `{sanitized_title}_{conversation_id_prefix}.{ext}`
+- Summary intros remain per conversation in both JSON and Markdown archive entries
+
+## Persistent Conversation Summaries (v0.239.030)
+
+Summaries generated during export or on demand are now persisted to the conversation document in Cosmos DB and reused automatically.
+
+### How Caching Works
+- Each summary stores `message_time_start` and `message_time_end` from the messages it was built from.
+- On subsequent exports, `_build_summary_intro` compares the cached `message_time_end` against the latest message timestamp. If no new messages exist, the cached summary is returned instantly.
+- If new messages exist beyond the cached range, a fresh summary is generated, saved, and returned.
+
+### On-Demand Generation
+- The conversation details modal (opened from the sidebar) now includes a **Summary** card.
+- If no summary exists, a **Generate Summary** button with a model selector lets users create one without exporting.
+- If a summary exists, the content, generation date, and model are displayed with a **Regenerate** button.
+- New API endpoint: `POST /api/conversations/<id>/summary` with optional `{ "model_deployment": "..." }` body.
+
+### Reusable Helper
+- `generate_conversation_summary()` (in `route_backend_conversation_export.py`) is the shared LLM call function used by both the export pipeline and the API endpoint.
+- It handles transcript assembly, truncation, model role selection (developer vs system), and Cosmos persistence.
+
+## Security and Data-Shaping Rules
+- Route uses authenticated user checks before export
+- Only user-owned personal conversations are exported
+- Internal Cosmos metadata is not passed through
+- Deleted messages and inactive-thread retries are excluded
+- Exported message details are curated rather than raw metadata dumps
+- Raw settings are not exposed to the browser; the export wizard reuses the existing chat model selector already rendered from sanitized settings
+
+## Files Updated
+- `application/single_app/route_backend_conversation_export.py`
+- `application/single_app/route_backend_conversations.py`
+- `application/single_app/static/js/chat/chat-export.js`
+- `application/single_app/static/js/chat/chat-conversation-details.js`
+- `application/single_app/config.py`
+- `functional_tests/test_conversation_export.py`
+- `functional_tests/test_persistent_conversation_summary.py`
 
 ## Testing and Validation
-- **Functional test:** `functional_tests/test_conversation_export.py`
-- Tests cover:
-  - Conversation sanitization (internal field stripping)
-  - Message sanitization
-  - Markdown generation (headings, metadata, citations)
-  - JSON structure validation
-  - ZIP packaging (correct entries, valid content)
-  - Filename sanitization (special chars, truncation, empty input)
-  - Active thread message filtering
+- **Functional tests:**
+  - `functional_tests/test_conversation_export.py` — export pipeline (8 tests)
+  - `functional_tests/test_persistent_conversation_summary.py` — summary caching (8 tests)
+
+Coverage includes:
+- deleted/inactive message filtering
+- normalized and raw citation export
+- thoughts attached to assistant messages
+- transcript-style Markdown appendices
+- summary-intro metadata shape
+- filename sanitization and ZIP naming
+- content normalization and citation-count helpers
+- PDF HTML body generation with chat-bubble classes
+- PDF byte output validation (%PDF- header)
 
 ## Known Limitations
-- Export is limited to conversations the authenticated user owns
-- Very large conversations (thousands of messages) may take longer to process
-- The wizard fetches conversation titles client-side; if a title lookup fails, it shows the conversation ID instead
+- Export remains limited to user-owned personal conversations
+- Optional intro summaries increase export time because each selected conversation is summarized individually
+- Very large conversations may be truncated for summary generation, though the full export content is still preserved in the output
+- Summary generation is best-effort: if model initialization or generation fails, the export still completes and records the summary error state
+- PDF rendering relies on PyMuPDF’s CSS subset which does not support flexbox, float, or full border-radius; bubble alignment is approximated using margin offsets
diff --git a/docs/explanation/features/MESSAGE_EXPORT.md b/docs/explanation/features/MESSAGE_EXPORT.md
new file mode 100644
index 00000000..4495b5a1
--- /dev/null
+++ b/docs/explanation/features/MESSAGE_EXPORT.md
@@ -0,0 +1,118 @@
+# Per-Message Export
+
+## Overview
+The Per-Message Export feature adds export and action options directly to the three-dots dropdown menu on individual chat messages. Users can export a single message to Markdown or Word, insert it into the chat prompt, or open it in their default email client — all without leaving the chat interface.
+
+**Version Implemented:** 0.239.005–0.239.007
+
+## Dependencies
+- Flask (backend route for Word export)
+- `python-docx` 1.1.2 (Word document generation)
+- Azure Cosmos DB (message retrieval for Word export)
+- Bootstrap 5 (dropdown menus, icons)
+- ES modules (`chat-message-export.js`)
+
+## Architecture Overview
+
+### Backend
+- **Route file:** `route_backend_conversation_export.py`
+- **Endpoint:** `POST /api/message/export-word`
+- **Registration:** Registered alongside the existing conversation export routes
+
+The Word export endpoint accepts a JSON body with:
+
+| Field | Type | Description |
+|---|---|---|
+| `message_id` | string | ID of the message to export |
+| `conversation_id` | string | ID of the conversation the message belongs to |
+
+The server verifies user ownership of the conversation, fetches the specific message from Cosmos DB, generates a Word document using `python-docx` with basic Markdown-to-DOCX conversion (headings, paragraphs, bold, italic, inline code, code blocks, lists, citations), and returns the `.docx` as a binary download.
+
+### Frontend
+- **JS module:** `static/js/chat/chat-message-export.js`
+- **Dropdown integration:** `static/js/chat/chat-messages.js` (AI and user message dropdowns)
+- **Dynamic import:** Module is loaded on-demand when any export action is clicked (same pattern as `chat-edit.js`)
+
+## Features
+
+### Export to Markdown
+- **Location:** Three-dots dropdown → "Export to Markdown"
+- **Icon:** `bi-markdown`
+- **Behavior:** Entirely client-side. Grabs the message content from the existing hidden textarea (AI messages) or `.message-text` element (user messages), wraps it with a role header, and triggers a `.md` file download via Blob URL.
+- **Filename pattern:** `message_export_YYYYMMDD_HHMMSS.md`
+
+### Export to Word
+- **Location:** Three-dots dropdown → "Export to Word"
+- **Icon:** `bi-file-earmark-word`
+- **Behavior:** POSTs to `/api/message/export-word`. The backend generates a styled `.docx` document with:
+  - Title heading ("Message Export")
+  - Role metadata
+  - Message content with Markdown formatting preserved (headings, bold, italic, code blocks, lists)
+  - Citations section (if present on the message)
+- **Filename pattern:** `message_export_YYYYMMDD_HHMMSS.docx`
+
+### Use as Prompt
+- **Location:** Three-dots dropdown → "Use as Prompt"
+- **Icon:** `bi-clipboard-plus`
+- **Behavior:** Entirely client-side. Inserts the raw message content directly into the chat input box (`#user-input`), focuses the input, and triggers the auto-resize/send-button update. The user can review, edit, and send it.
+
+### Open in Email
+- **Location:** Three-dots dropdown → "Open in Email"
+- **Icon:** `bi-envelope`
+- **Behavior:** Entirely client-side. Opens the user's default email client via a `mailto:` link with:
+  - **Subject:** "Chat message from [sender name]"
+  - **Body:** The full message content
+- Uses `encodeURIComponent` for safe URL encoding of the content.
+
+## Dropdown Menu Structure
+
+Both AI and user messages now have export options below a divider:
+
+**AI Message Dropdown:**
+1. Delete
+2. Retry
+3. Feedback (thumbs up/down)
+4. ─── divider ───
+5. Export to Markdown
+6. Export to Word
+7. Use as Prompt
+8. Open in Email
+
+**User Message Dropdown:**
+1. Edit
+2. Delete
+3. Retry
+4. ─── divider ───
+5. Export to Markdown
+6. Export to Word
+7. Use as Prompt
+8. Open in Email
+
+## File Structure
+
+| File | Purpose |
+|------|---------|
+| `static/js/chat/chat-message-export.js` | Client-side export functions (Markdown, Word fetch, Use as Prompt, Open in Email) |
+| `static/js/chat/chat-messages.js` | Dropdown menu HTML and event bindings for both AI and user messages |
+| `route_backend_conversation_export.py` | Backend `/api/message/export-word` endpoint and Markdown-to-DOCX conversion |
+
+## Testing and Validation
+
+### Test Scenarios
+1. AI message → Export to Markdown → `.md` file downloads with content and role header
+2. AI message → Export to Word → `.docx` file downloads with formatted content and citations
+3. User message → Export to Markdown → `.md` file downloads
+4. User message → Export to Word → `.docx` file downloads
+5. AI/User message → Use as Prompt → Content appears in chat input box
+6. AI/User message → Open in Email → Default email client opens with pre-filled subject and body
+7. Existing actions (Delete, Retry, Edit, Feedback) still function correctly
+
+### Known Limitations
+- Word export requires a round-trip to the backend; offline use is not supported for Word format
+- `mailto:` URL length is limited by the email client/OS; very long messages may be truncated
+- Markdown export for user messages uses `innerText` rather than original markdown source
+
+## Cross-References
+- Related feature: [Conversation Export](CONVERSATION_EXPORT.md) — exports entire conversations
+- Backend shares infrastructure with conversation export (`route_backend_conversation_export.py`)
+- Functional tests: `functional_tests/test_message_export.py` (when created)
diff --git a/docs/explanation/features/SIMPLECHAT_STARTUP.md b/docs/explanation/features/SIMPLECHAT_STARTUP.md
new file mode 100644
index 00000000..d5a35d23
--- /dev/null
+++ b/docs/explanation/features/SIMPLECHAT_STARTUP.md
@@ -0,0 +1,191 @@
+# SimpleChat Startup and Scheduler (v0.239.136)
+
+## Overview
+This document explains how SimpleChat should be started in local development, Azure App Service native Python deployments, and container-based runtimes. It also explains how background scheduler work is separated from the Gunicorn web process so administrators can use more than one web worker without duplicating scheduler threads.
+
+**Version Implemented:** 0.239.136
+
+## Dependencies
+- Flask application bootstrap in `application/single_app/app.py`
+- Gunicorn runtime config in `application/single_app/gunicorn.conf.py`
+- Shared scheduler loops in `application/single_app/background_tasks.py`
+- Dedicated scheduler entrypoint in `application/single_app/simplechat_scheduler.py`
+- Container startup in `application/single_app/Dockerfile`
+
+## Implemented in version: **0.239.136**
+
+## Technical Specifications
+
+### Web Process Modes
+- **Local debug mode:** `FLASK_DEBUG=1` and `python app.py`
+- **Direct Gunicorn mode:** Gunicorn launched by App Service or by an operator command
+- **Optional handoff mode:** `python app.py` with `SIMPLECHAT_USE_GUNICORN=1` on Linux-compatible runtimes
+
+The web process now supports two production-safe approaches:
+
+1. Launch Gunicorn directly.
+2. Launch `python app.py` and let the process exec into Gunicorn when `SIMPLECHAT_USE_GUNICORN=1` is set.
+
+If Gunicorn is already the startup command, `SIMPLECHAT_USE_GUNICORN` is not needed.
+
+Important platform note:
+
+- Windows local development should stay on `FLASK_DEBUG=1` with `python app.py`.
+- Gunicorn and the optional handoff path should be treated as Linux/container/App Service runtime options, not native Windows runtime options.
+- If a Windows developer needs Gunicorn-specific worker or thread validation, use Docker Desktop, WSL2, or another Linux environment.
+
+### Background Scheduler Separation
+Scheduler-style loops are defined in `background_tasks.py` and can be started either:
+
+- inside a single-process web runtime for local development or legacy deployments
+- in a separate dedicated scheduler process by running `simplechat_scheduler.py`
+
+Background loops now start unless `SIMPLECHAT_RUN_BACKGROUND_TASKS` is explicitly set to a false-like value such as `0`, `false`, `no`, or `off`.
+
+Approval expiry and retention policy execution also use Cosmos-backed distributed lease documents in the shared settings container so only one worker or instance should perform those jobs at a time.
+
+### Environment Variables
+
+#### Web Process
+- `FLASK_DEBUG=1`
+  Uses the Flask development server with HTTPS and local-friendly behavior.
+- `SIMPLECHAT_USE_GUNICORN=1`
+  Only matters when the process starts as `python app.py` in non-debug mode on a runtime that can execute Gunicorn.
+- `SIMPLECHAT_RUN_BACKGROUND_TASKS`
+  Background loops are enabled when this setting is unset. Set it to `0`, `false`, `no`, or `off` to disable background loops in the current process.
+
+#### Gunicorn Tuning
+- `GUNICORN_BIND`
+- `GUNICORN_WORKERS`
+- `GUNICORN_THREADS`
+- `GUNICORN_TIMEOUT`
+- `GUNICORN_GRACEFUL_TIMEOUT`
+- `GUNICORN_KEEPALIVE`
+- `GUNICORN_MAX_REQUESTS`
+- `GUNICORN_MAX_REQUESTS_JITTER`
+
+## Native Python App Service
+
+### Recommended Web Startup
+Use Gunicorn directly in the App Service Startup command when deploying the native Python runtime.
+
+Deploy and run the `application/single_app` folder in App Service.
+
+Use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+An explicit full command is also valid:
+
+```bash
+gunicorn --bind=0.0.0.0:$PORT --worker-class gthread --workers 2 --threads 8 --timeout 900 --graceful-timeout 60 --keep-alive 75 --max-requests 500 --max-requests-jitter 50 app:app
+```
+
+### Recommended Scheduler Process
+Run scheduler work in a separate job/process instead of inside the web workers.
+
+Recommended command:
+
+```bash
+python simplechat_scheduler.py
+```
+
+Operational options include:
+- a separate App Service or worker instance dedicated to the scheduler command
+- a WebJob or automation step that runs the scheduler command
+- a scheduled container/job platform that launches the same codebase with the scheduler command
+
+### Admin Guidance
+- Keep Gunicorn as the web Startup command.
+- Leave `SIMPLECHAT_USE_GUNICORN` unset unless you intentionally want `python app.py` to hand off to Gunicorn.
+- Set `SIMPLECHAT_RUN_BACKGROUND_TASKS=0` in multi-worker Gunicorn web deployments if the scheduler runs elsewhere.
+- Use `workers=2` for the web process only after moving scheduler work out to the dedicated scheduler process.
+
+## Container Runtime
+
+### Default Web Container Behavior
+The container image now starts the web process with Gunicorn by default through the Docker entrypoint.
+
+Web container entrypoint:
+
+```text
+python3 -m gunicorn -c /app/gunicorn.conf.py app:app
+```
+
+### Dedicated Scheduler Container or Job
+Use the same image with an overridden command to run scheduler work separately.
+
+Scheduler command:
+
+```bash
+python3 /app/simplechat_scheduler.py
+```
+
+This allows a deployment topology such as:
+- one web container with `workers=2`
+- one scheduler container or job running `simplechat_scheduler.py`
+
+## Local Development
+
+### Default Local Workflow
+For everyday development, use:
+
+```bash
+FLASK_DEBUG=1
+python app.py
+```
+
+This keeps the normal Flask development flow and starts background loops in the local process.
+
+On Windows, this is the recommended local workflow. Keep `FLASK_DEBUG=1` enabled and do not rely on native Gunicorn execution.
+
+If multiple workers or instances are active, the approval expiry and retention policy jobs now coordinate through distributed locks. Logging timer work still runs per process.
+
+### Production-Like Local Workflow
+For concurrency, timeout, and streaming validation, run Gunicorn locally only in Linux-compatible environments such as Docker, WSL2, or a native Linux/macOS shell:
+
+```bash
+gunicorn --bind=0.0.0.0:5000 --worker-class gthread --workers 2 --threads 8 --timeout 900 --graceful-timeout 60 --keep-alive 75 --max-requests 500 --max-requests-jitter 50 app:app
+```
+
+Windows note:
+
+- Native Windows Python cannot run Gunicorn because Gunicorn depends on Unix-only modules.
+- On Windows, use `python app.py` for application development and switch to Docker or WSL2 when you need Gunicorn-specific validation.
+
+To test the scheduler separately at the same time:
+
+```bash
+python simplechat_scheduler.py
+```
+
+## Usage Instructions
+
+### Native Python App Service
+1. Set the App Service Startup command to Gunicorn.
+2. Set `SIMPLECHAT_RUN_BACKGROUND_TASKS=0` in the web app configuration when scheduler work is running in a separate process/job.
+3. Run scheduler work with `python simplechat_scheduler.py` in a separate process/job.
+
+### Container Deployments
+1. Keep the default Gunicorn web entrypoint.
+2. Launch a second container/job using the same image.
+3. Override its command to `python3 /app/simplechat_scheduler.py`.
+
+## Testing and Validation
+- Functional test: `functional_tests/test_gunicorn_startup_support.py`
+- Functional test: `functional_tests/test_startup_scheduler_support.py`
+
+These tests verify:
+- Gunicorn-aware startup helpers and config defaults
+- shared background task module extraction
+- dedicated scheduler entrypoint presence
+- deployment guidance documentation presence
+
+## Known Limitations
+- Native Windows Python is not a supported Gunicorn runtime.
+- Leaving `SIMPLECHAT_RUN_BACKGROUND_TASKS` unset enables the loops in every Gunicorn worker process.
+- Approval expiry and retention policy now coordinate with distributed locks, but logging timer work still runs in every enabled process.
+- Set `SIMPLECHAT_RUN_BACKGROUND_TASKS=0` in web workers if you want the separate scheduler process to be the only scheduler runtime.
+- The scheduler separation prepares the app for multi-worker web runtimes, but the actual Azure job/container orchestration still needs to be configured per environment.
\ No newline at end of file
diff --git a/docs/explanation/features/v0.229.001/OPENAPI_ACTION.md b/docs/explanation/features/v0.229.001/OPENAPI_ACTION.md
index d1b765cc..9a6b999b 100644
--- a/docs/explanation/features/v0.229.001/OPENAPI_ACTION.md
+++ b/docs/explanation/features/v0.229.001/OPENAPI_ACTION.md
@@ -12,7 +12,6 @@ This OpenAPI Semantic Kernel Action/Plugin allows you to expose any OpenAPI-comp
 - **Better error handling**: Validates inputs and provides clear error messages
 - **Multiple file formats**: Supports both YAML and JSON OpenAPI specifications
 - **Secure file uploads**: Comprehensive security validation for uploaded OpenAPI specs
-- **Multiple source types**: Supports file upload and URL download
 - **Web UI integration**: Full modal interface for configuration through the web application
 
 ## Installation
@@ -21,7 +20,6 @@ No additional dependencies beyond what's already in the project. The plugin uses
 - `yaml` for YAML parsing
 - `json` for JSON parsing
 - `semantic_kernel` for plugin functionality
-- `requests` for URL-based spec downloads
 - `re` for security pattern matching
 
 ## Configuration Methods
@@ -31,7 +29,6 @@ The OpenAPI plugin can be configured in three ways:
 ### 1. Through Web UI (Recommended)
 Use the plugin configuration modal in the web application to:
 - Upload OpenAPI specification files (with security validation)
-- Download specs from URLs (with security checks)
 - Configure authentication settings
 - Test and validate configurations before saving
 
@@ -49,8 +46,12 @@ from openapi_plugin_factory import OpenApiPluginFactory
 
 # Create from configuration
 plugin = OpenApiPluginFactory.create_from_config({
-    'openapi_source_type': 'file',  # or 'url'
-    'openapi_file_id': 'uploaded_file_id',
+    'openapi_spec_content': {
+        'openapi': '3.0.0',
+        'info': {'title': 'My API', 'version': '1.0.0'},
+        'paths': {}
+    },
+    'openapi_source_type': 'content',
     'base_url': 'https://api.example.com',
     'auth': {'type': 'bearer', 'token': 'your-token'}
 })
@@ -162,14 +163,6 @@ The OpenAPI plugin includes comprehensive security validation to prevent malicio
   - Code execution patterns (`eval()`, `exec()`, `import os`, etc.)
   - SQL injection attempts (`DROP TABLE`, `UNION SELECT`, etc.)
 
-### URL Security
-- **HTTPS enforcement**: Recommends secure connections
-- **Private network blocking**: Prevents access to:
-  - Localhost (`127.0.0.1`, `::1`)
-  - Private IP ranges (`10.x.x.x`, `192.168.x.x`, `172.16-31.x.x`)
-  - Link-local addresses (`169.254.x.x`)
-- **Content validation**: Downloaded content undergoes same security checks as uploads
-
 ### Structure Validation
 - **Nesting depth limits**: Prevents DoS attacks via deeply nested objects
 - **OpenAPI format validation**: Ensures valid OpenAPI 2.0/3.0 structure
@@ -209,9 +202,7 @@ The OpenAPI plugin includes comprehensive security validation to prevent malicio
 ### Web UI Configuration (Recommended)
 1. Open the plugin configuration modal
 2. Select "OpenAPI" as plugin type
-3. Choose your specification source:
-   - **Upload File**: Drag & drop or select your OpenAPI file
-   - **From URL**: Enter URL to your OpenAPI specification
+3. Upload your OpenAPI specification file
 4. Configure authentication settings
 5. Test and save configuration
 
@@ -221,8 +212,12 @@ The OpenAPI plugin includes comprehensive security validation to prevent malicio
 from openapi_plugin_factory import OpenApiPluginFactory
 
 config = {
-    'openapi_source_type': 'file',
-    'openapi_file_id': 'abc123',  # From upload endpoint
+    'openapi_source_type': 'content',
+    'openapi_spec_content': {
+        'openapi': '3.0.0',
+        'info': {'title': 'My API', 'version': '1.0.0'},
+        'paths': {}
+    },
     'base_url': 'https://api.myservice.com',
     'auth': {'type': 'bearer', 'token': 'your-token'}
 }
@@ -257,7 +252,6 @@ The plugin includes comprehensive error handling:
 - **ValueError**: If required parameters are missing or file format is invalid
 - **YAMLError/JSONDecodeError**: If spec file is malformed
 - **SecurityValidationError**: If uploaded file contains malicious content
-- **URLSecurityError**: If URL points to forbidden locations (localhost, private networks)
 - **FileSizeError**: If uploaded file exceeds 5MB limit
 
 ## API Endpoints
@@ -272,36 +266,11 @@ Content-Type: multipart/form-data
 Response: {
   "success": true,
   "file_id": "abc123",
-  "api_info": {
+    "spec_info": {
     "title": "My API",
-    "version": "1.0.0",
-    "endpoints_count": 25
-  }
-}
-```
-
-### Validate OpenAPI URL
-```
-POST /api/openapi/validate-url
-Content-Type: application/json
-Body: {"url": "https://api.example.com/openapi.yaml"}
-
-Response: {
-  "success": true,
-  "file_id": "def456",
-  "api_info": {...}
-}
-```
-
-### Download from URL
-```
-POST /api/openapi/download-from-url
-Content-Type: application/json
-Body: {"url": "https://api.example.com/openapi.yaml"}
-
-Response: {
-  "success": true,
-  "file_id": "ghi789"
+        "version": "1.0.0"
+    },
+    "spec_content": {...}
 }
 ```
 
diff --git a/docs/explanation/features/v0.239.003/PROCESSING_THOUGHTS.md b/docs/explanation/features/v0.239.003/PROCESSING_THOUGHTS.md
new file mode 100644
index 00000000..5dad56d9
--- /dev/null
+++ b/docs/explanation/features/v0.239.003/PROCESSING_THOUGHTS.md
@@ -0,0 +1,153 @@
+---
+layout: libdoc/page
+title: Processing Thoughts
+order: 100
+category: Features
+---
+
+# Processing Thoughts
+
+## Overview
+The Processing Thoughts feature replaces the generic "AI is typing..." indicator with real-time processing step traces that show users what the system is doing during chat processing. Each step (document search, web search, agent invocation, content safety check, response generation) is persisted in Cosmos DB and can be reviewed later via a per-message collapsible section.
+
+**Version Implemented:** 0.239.003
+
+## Dependencies
+- Flask (backend routes)
+- Azure Cosmos DB (`thoughts` and `archive_thoughts` containers)
+- Bootstrap 5 (collapsible section, badges, icons)
+- ES modules (`chat-thoughts.js`)
+
+## Architecture Overview
+
+### Backend
+
+#### ThoughtTracker (`functions_thoughts.py`)
+Stateful per-request tracker that writes each thought step to Cosmos DB immediately so polling clients can see partial progress.
+
+```
+ThoughtTracker(conversation_id, message_id, thread_id, user_id)
+  .add_thought(step_type, content, detail=None) → thought_id
+  .complete_thought(thought_id, duration_ms)     → updates duration
+  .enabled                                        → checks settings['enable_thoughts']
+```
+
+Design rules:
+- Each `add_thought()` does an immediate `upsert_item()` to Cosmos DB.
+- All writes are wrapped in try/except — thought errors never crash the chat flow.
+- Auto-increments `step_index` per tracker instance.
+- Logs failures via `log_event()` at WARNING level.
+
+#### Thought Document Schema
+```json
+{
+    "id": "uuid",
+    "conversation_id": "str",
+    "message_id": "str (assistant message ID)",
+    "thread_id": "str",
+    "user_id": "str (partition key)",
+    "step_index": 0,
+    "step_type": "search | tabular_analysis | web_search | agent_tool_call | generation | content_safety",
+    "content": "Searching personal workspace documents for 'sales analysis'...",
+    "detail": "Optional technical detail",
+    "duration_ms": null,
+    "timestamp": "ISO-8601"
+}
+```
+
+#### API Endpoints (`route_backend_thoughts.py`)
+
+| Method | Endpoint | Purpose |
+|--------|----------|---------|
+| GET | `/api/conversations/<id>/messages/<id>/thoughts` | Fetch persisted thoughts for a specific assistant message (historical viewing) |
+| GET | `/api/conversations/<id>/thoughts/pending` | Fetch latest in-progress thoughts (polling while waiting for response) |
+
+Both endpoints return `{"thoughts": [...], "enabled": true/false}`. When `enable_thoughts` is off, they return `{"thoughts": [], "enabled": false}`.
+
+#### Instrumentation Points (`route_backend_chats.py`)
+
+| Step Type | Content Example | When |
+|-----------|----------------|------|
+| `content_safety` | "Checking content safety..." | Before content safety check |
+| `search` | "Searching personal documents for 'query'..." | Before hybrid search |
+| `search` | "Found 5 results from 3 documents" | After search results |
+| `tabular_analysis` | "Found tabular data — evaluating analysis..." | When tabular data detected |
+| `web_search` | "Searching the web for 'query'..." | Before web search |
+| `web_search` | "Got 8 web results" | After web search |
+| `agent_tool_call` | "Sending to agent 'Data Analyst'..." | Before agent invocation |
+| `generation` | "Generating response..." | Before GPT call |
+
+### Frontend
+
+#### Streaming Mode
+Thought events are embedded in the SSE stream as `{"type": "thought", ...}` JSON payloads. The streaming handler in `chat-streaming.js` passes these to `handleStreamingThought()` which updates the streaming placeholder badge.
+
+#### Non-Streaming Mode
+A polling mechanism in `chat-thoughts.js` fetches `/thoughts/pending` every 2 seconds while waiting for a response. The loading indicator text is updated with the latest thought step.
+
+#### Per-Message History
+Each assistant message footer includes a lightbulb toggle button (when thoughts exist). Clicking it opens a collapsible section that lazy-loads thoughts from the API. Each step shows an icon, content text, and optional duration.
+
+#### Icon Map
+| Step Type | Bootstrap Icon |
+|-----------|---------------|
+| `search` | `bi-search` |
+| `tabular_analysis` | `bi-table` |
+| `web_search` | `bi-globe` |
+| `agent_tool_call` | `bi-robot` |
+| `generation` | `bi-lightning` |
+| `content_safety` | `bi-shield-check` |
+
+## Configuration
+
+### Admin Settings
+- **Toggle**: `enable_thoughts` (default: `false`)
+- **Location**: Admin Settings > Optional Features tab > "Processing Thoughts" section
+- **Effect**: When disabled, no thoughts are recorded and no UI elements are shown
+
+### Cosmos DB Containers
+| Container | Partition Key | Purpose |
+|-----------|--------------|---------|
+| `thoughts` | `/user_id` | Active thought records |
+| `archive_thoughts` | `/user_id` | Archived thoughts from deleted conversations |
+
+## Archive and Cleanup
+
+When a conversation is deleted:
+- **Archiving enabled**: Thoughts are copied to `archive_thoughts` container, then deleted from `thoughts`
+- **Archiving disabled**: Thoughts are permanently deleted from `thoughts`
+
+This applies to both single conversation delete and bulk delete operations.
+
+## File Structure
+
+### Files Created
+| File | Purpose |
+|------|---------|
+| `functions_thoughts.py` | ThoughtTracker class, Cosmos CRUD helpers |
+| `route_backend_thoughts.py` | API endpoints for fetching thoughts |
+| `static/js/chat/chat-thoughts.js` | Frontend polling, rendering, toggle |
+
+### Files Modified
+| File | Change |
+|------|--------|
+| `config.py` | Added `thoughts` + `archive_thoughts` Cosmos containers, bumped VERSION |
+| `functions_settings.py` | Added `enable_thoughts` default setting |
+| `app.py` | Imported and registered thought routes |
+| `route_backend_chats.py` | Instrumented ~8 thought points per chat path |
+| `route_backend_conversations.py` | Added archive/delete thoughts on conversation delete |
+| `templates/admin_settings.html` | Added Processing Thoughts toggle card |
+| `static/js/admin/admin_settings.js` | Added `enable_thoughts` to settings collection |
+| `static/js/chat/chat-messages.js` | Integrated thoughts toggle in footer, polling start/stop |
+| `static/js/chat/chat-streaming.js` | Handle `type: "thought"` in SSE data |
+| `static/js/chat/chat-loading-indicator.js` | Added `updateLoadingIndicatorText()` for thought display |
+| `static/css/chats.css` | Added thought indicator, toggle, container, and dark mode styles |
+
+## Testing
+
+1. **Enable feature**: Set `enable_thoughts: True` in admin settings
+2. **Non-streaming**: Send a message with document search — verify loading indicator updates with thought steps, lightbulb icon appears after response
+3. **Streaming**: Send a message — verify streaming placeholder shows thought badges, lightbulb available after finalization
+4. **History**: Reload page, open old conversation — click lightbulb to verify lazy-loaded thoughts
+5. **Disabled**: Set `enable_thoughts: False` — verify no thoughts generated, no lightbulb icons
+6. **Archive**: Delete a conversation with archiving enabled — verify thoughts moved to `archive_thoughts`
diff --git a/docs/explanation/features/v0.239.022/CONVERSATION_EXPORT.md b/docs/explanation/features/v0.239.022/CONVERSATION_EXPORT.md
new file mode 100644
index 00000000..edb69c2e
--- /dev/null
+++ b/docs/explanation/features/v0.239.022/CONVERSATION_EXPORT.md
@@ -0,0 +1,58 @@
+# Conversation Export
+
+## Overview
+Snapshot of the Conversation Export feature as implemented in version **0.239.022**.
+
+This version updates export generation so JSON includes modern citation buckets, normalized citation summaries, and processing thoughts, while Markdown becomes a transcript-first report with appendix sections and optional AI-generated intro summaries.
+
+**Version Implemented:** 0.239.022
+**Dependencies:** Flask export route, Azure Cosmos DB conversations/messages/thoughts, Bootstrap modal workflow, chat-export.js, Azure OpenAI/APIM chat models
+
+## Technical Summary
+
+### Backend
+- Filters out deleted messages and inactive-thread retries
+- Reapplies thread-aware ordering to align with the live chat view
+- Includes both normalized and raw citations per message
+- Joins persisted processing thoughts by `message_id`
+- Supports optional per-conversation `summary_intro` generation using a selected model
+
+### Frontend
+- Adds a summary step to the export wizard
+- Lets users enable or disable intro summaries
+- Reuses the existing chat model selector options for summary model choice
+
+## Export Shape
+
+### JSON
+Each conversation entry contains:
+- `conversation`
+- `summary_intro`
+- `messages`
+
+Each message can include:
+- `content`
+- `content_text`
+- `details`
+- `citations`
+- `legacy_citations`
+- `hybrid_citations`
+- `web_search_citations`
+- `agent_citations`
+- `thoughts`
+
+### Markdown
+Markdown exports contain:
+- metadata header
+- optional abstract
+- transcript body
+- appendices for metadata, message details, references, thoughts, and supplemental messages
+
+## Files Updated
+- `application/single_app/route_backend_conversation_export.py`
+- `application/single_app/static/js/chat/chat-export.js`
+- `application/single_app/config.py`
+- `functional_tests/test_conversation_export.py`
+
+## Testing
+Validated by `functional_tests/test_conversation_export.py`.
diff --git a/docs/explanation/features/v0.239.123/CHAT_SEARCHABLE_SELECTORS.md b/docs/explanation/features/v0.239.123/CHAT_SEARCHABLE_SELECTORS.md
new file mode 100644
index 00000000..c4859bb2
--- /dev/null
+++ b/docs/explanation/features/v0.239.123/CHAT_SEARCHABLE_SELECTORS.md
@@ -0,0 +1,64 @@
+# Chat Searchable Selectors
+
+## Overview
+Snapshot of the searchable chat selector update as implemented in version **0.239.123**.
+
+This version adds in-menu search to the chat workspace scope and tag filters, and rebuilds the prompt, model, and agent toolbar selectors as searchable dropdowns while keeping the hidden native selects in place for existing chat integrations.
+
+**Version Implemented:** 0.239.123
+**Dependencies:** chats.html toolbar and workspace filter markup, chats.css dropdown styling, chat-documents.js, chat-prompts.js, chat-model-selector.js, chat-agents.js, chat-searchable-select.js
+
+## Technical Specifications
+
+### Architecture Overview
+- Adds a shared `chat-searchable-select.js` helper for two selector patterns:
+  - searchable single-select dropdowns for prompts, models, and agents
+  - searchable filter overlays for the existing scope, tags, and documents dropdowns
+- Keeps `#prompt-select`, `#model-select`, and `#agent-select` as the canonical state holders so existing chat modules can continue reading native select values and option metadata.
+- Extends the existing search-documents card instead of introducing a second workspace filtering UI.
+
+### Prompt Loading
+- Prompt loading now walks paginated `/api/prompts`, `/api/group_prompts`, and `/api/public_prompts` responses using `page_size=100` until the full prompt set is loaded.
+- This removes the prior first-page cap from the chat prompt picker without changing shared prompt API semantics used elsewhere in the application.
+- Scope filtering still applies after prompt data is loaded, so the searchable list only shows prompt categories that match the current chat workspace scope.
+
+### File Structure
+- `application/single_app/templates/chats.html`
+- `application/single_app/static/css/chats.css`
+- `application/single_app/static/js/chat/chat-searchable-select.js`
+- `application/single_app/static/js/chat/chat-documents.js`
+- `application/single_app/static/js/chat/chat-prompts.js`
+- `application/single_app/static/js/chat/chat-model-selector.js`
+- `application/single_app/static/js/chat/chat-agents.js`
+- `functional_tests/test_chat_searchable_selectors.py`
+- `application/single_app/config.py`
+
+## Usage Instructions
+
+### Scope, Tags, and Documents
+- Open **Workspaces** on the chat page.
+- Use **Search workspaces...** to narrow the scope dropdown.
+- Use **Search tags...** to narrow tag and classification choices.
+- Use **Search documents...** to narrow the document list after scope and tag filtering have been applied.
+
+### Prompts
+- Open **Prompts** on the chat page.
+- Search within the prompt dropdown to find a saved prompt by name.
+- Select a single prompt to mirror the value back into the hidden prompt select used by message submission.
+
+### Models and Agents
+- Use the model dropdown search to quickly locate a deployment in long GPT model lists.
+- When agent mode is enabled, the model control swaps to a searchable agent dropdown with the same search interaction.
+- Agent labels continue to include group/global context when needed to distinguish duplicate display names.
+
+## Testing and Validation
+
+### Functional Coverage
+- `functional_tests/test_chat_searchable_selectors.py`
+- `functional_tests/test_workspace_scope_prompts_fix.py`
+
+### Validation Focus
+- Confirms the chat template contains searchable selector markup for scope, tags, prompts, models, and agents.
+- Confirms the shared helper supports both dropdown filtering and searchable single-select behavior.
+- Confirms prompt loading pages through all available prompt results instead of stopping at the default first page.
+- Confirms the app version bump for the feature.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHANGED_FILES_GITHUB_ACTION_SUPPLY_CHAIN_FIX.md b/docs/explanation/fixes/CHANGED_FILES_GITHUB_ACTION_SUPPLY_CHAIN_FIX.md
new file mode 100644
index 00000000..3533aa74
--- /dev/null
+++ b/docs/explanation/fixes/CHANGED_FILES_GITHUB_ACTION_SUPPLY_CHAIN_FIX.md
@@ -0,0 +1,48 @@
+# Changed-Files GitHub Action Supply Chain Fix
+
+Fixed/Implemented in version: **0.239.135**
+
+## Issue Description
+
+The repository's release notes workflow used `tj-actions/changed-files@v44`, a tag family affected by the March 2025 supply chain incident involving retroactively modified action tags.
+
+Even though the malicious window has been closed and current tags were restored, keeping the workflow on the older tag family left the repository behind the patched release identified by the advisory.
+
+## Root Cause Analysis
+
+- `.github/workflows/release-notes-check.yml` depended on `tj-actions/changed-files@v44`.
+- The security advisory identifies `46.0.1` as the patched version for the compromised action.
+
+## Technical Details
+
+### Files Modified
+
+- `.github/workflows/release-notes-check.yml`
+- `application/single_app/config.py`
+- `functional_tests/test_changed_files_action_version.py`
+
+### Code Changes Summary
+
+- Updated the release notes workflow to use `tj-actions/changed-files@v46.0.1`.
+- Added a functional regression test that verifies the patched action reference and rejects the known malicious commit SHA.
+- Bumped the application version to `0.239.135` for traceability.
+
+### Testing Approach
+
+- Added `functional_tests/test_changed_files_action_version.py` to validate the workflow pin, version bump, and fix documentation marker.
+
+## Validation
+
+### Before
+
+- The workflow referenced `tj-actions/changed-files@v44`.
+- There was no repository-level regression check guarding against reintroduction of the malicious commit SHA.
+
+### After
+
+- The workflow references the patched `v46.0.1` release.
+- The regression test asserts that the known malicious SHA is absent and the patched version remains in place.
+
+### Impact Analysis
+
+This is a narrow CI supply-chain remediation. It does not change application runtime behavior, but it does reduce the risk of reintroducing a compromised GitHub Action reference in repository automation.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_COMPLETION_PERSONAL_SCOPE_GATE_FIX.md b/docs/explanation/fixes/CHAT_COMPLETION_PERSONAL_SCOPE_GATE_FIX.md
new file mode 100644
index 00000000..336d7052
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_COMPLETION_PERSONAL_SCOPE_GATE_FIX.md
@@ -0,0 +1,47 @@
+# Chat Completion Personal Scope Gate Fix
+
+Fixed/Implemented in version: **0.239.133**
+
+## Issue Description
+
+Personal chat responses could complete and save successfully after the user navigated away, but no completion notification appeared and no green unread dot was shown when returning to the chat page.
+
+## Root Cause Analysis
+
+The streaming completion path decided whether to create chat-completion notifications by checking `active_group_id` and `active_public_workspace_id` from the request/session state.
+
+Those workspace identifiers can stay populated even when the actual conversation being completed is still a personal chat. In that case, the route incorrectly skipped the personal unread-state and notification writes.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_completion_notifications.py`
+- `functional_tests/test_chat_stream_background_execution.py`
+- `functional_tests/test_streaming_only_chat_path.py`
+
+### Code Changes Summary
+
+- Added `is_personal_chat_conversation(...)` to classify the completed conversation from its saved `chat_type`.
+- Updated the streaming completion path to use conversation metadata instead of active workspace session values when deciding whether to create personal unread-state and notification side effects.
+- Added debug logging for the non-personal skip path to make future scope mismatches easier to diagnose.
+- Bumped the application version to `0.239.133`.
+
+### Testing Approach
+
+- Extended the chat completion notification regression to verify the streaming completion path uses the personal-conversation helper instead of the old active-workspace gate.
+- Updated the streaming route regressions so their version assertions stay aligned with the current app version.
+
+## Validation
+
+### Before
+
+- Personal chats could complete and persist the assistant answer without receiving completion-side unread state or notifications.
+- A stale active group or public workspace in session could suppress personal notifications incorrectly.
+
+### After
+
+- Personal chat completion side effects are now keyed off the saved conversation type.
+- Personal chats continue to receive unread markers and completion notifications even if unrelated workspace session state is populated.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_COMPLETION_STREAM_FINALIZATION_FIX.md b/docs/explanation/fixes/CHAT_COMPLETION_STREAM_FINALIZATION_FIX.md
new file mode 100644
index 00000000..45a060fc
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_COMPLETION_STREAM_FINALIZATION_FIX.md
@@ -0,0 +1,44 @@
+# Chat Completion Stream Finalization Fix
+
+Fixed/Implemented in version: **0.239.130**
+
+## Issue Description
+
+Personal chat responses could finish successfully in the background, but the user would not receive a completion notification and the conversation would not show the green unread dot when returning to the chat page.
+
+## Root Cause Analysis
+
+The streaming `/api/chat/stream` completion path persisted the final assistant message and updated conversation metadata, but it did not mark the conversation as unread or create the personal `chat_response_complete` notification before emitting the terminal SSE payload.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_completion_notifications.py`
+
+### Code Changes Summary
+
+- Restored unread-state writes in the streaming completion branch using `mark_conversation_unread(...)`.
+- Restored personal completion-notification creation using `create_chat_response_notification(...)`.
+- Kept the behavior scoped to personal chats only by skipping group and public workspace completions.
+- Added regression coverage that inspects the streaming completion path for the unread-state and notification calls.
+- Bumped the application version to `0.239.130`.
+
+### Testing Approach
+
+- Extended `functional_tests/test_chat_completion_notifications.py` to verify the SSE completion branch includes unread-state and notification creation.
+
+## Validation
+
+### Before
+
+- Background chat completion could persist the assistant message without creating a notification.
+- Returning to the chat page showed no unread dot because the conversation unread fields were never written.
+
+### After
+
+- Personal streaming completions mark the conversation unread before the final SSE payload is sent.
+- Personal streaming completions create a `chat_response_complete` notification with the conversation deep link.
+- Returning to the chats page shows the green unread dot until the conversation is opened or marked read.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_BRIDGE_RESTORE_FIX.md b/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_BRIDGE_RESTORE_FIX.md
new file mode 100644
index 00000000..778b9201
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_BRIDGE_RESTORE_FIX.md
@@ -0,0 +1,45 @@
+# Chat Stream Background Bridge Restore Fix
+
+Fixed/Implemented in version: **0.239.132**
+
+## Issue Description
+
+Leaving the chat page during a streamed response could still cause the assistant answer to disappear entirely from the conversation, with no completion notification and no unread marker when returning later.
+
+## Root Cause Analysis
+
+The active `route_backend_chats.py` route had drifted back to returning `stream_with_context(generate())` directly for both streaming entry points. That made the request-bound SSE generator the owner of assistant generation again, so browser navigation could terminate the stream before final assistant persistence and notification side effects ran.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_completion_notifications.py`
+
+### Code Changes Summary
+
+- Restored a queue-backed `BackgroundStreamBridge` to decouple SSE delivery from the background chat worker.
+- Wrapped the worker with `copy_current_request_context()` so existing request/session-dependent logic remains available during background completion.
+- Routed both the main streaming path and compatibility streaming path through the background bridge helper.
+- Advanced the application version to `0.239.132` to match the active streaming regression suite.
+
+### Testing Approach
+
+- Reused the existing streaming background execution regression to verify the bridge class, executor submission, fallback thread path, and route wiring.
+- Updated the chat completion notification regression to validate the current app version after the restore.
+
+## Validation
+
+### Before
+
+- Chat logs could show intermediate tool work but never reach final assistant persistence.
+- Returning to the conversation could show only the user message because the response died with the detached request.
+- Completion notifications and unread dots were skipped because the finalization block never ran.
+
+### After
+
+- Streaming chat work is executed in background execution and only relayed through the HTTP response.
+- Browser disconnects detach the consumer without canceling the background worker.
+- Final assistant persistence, unread state, and completion notifications remain reachable after navigation away from the chat page.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_EXECUTION_FIX.md b/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_EXECUTION_FIX.md
new file mode 100644
index 00000000..5a94c022
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_STREAM_BACKGROUND_EXECUTION_FIX.md
@@ -0,0 +1,54 @@
+# Chat Stream Background Execution Fix
+
+Fixed/Implemented in version: **0.239.129**
+
+## Issue Description
+
+Leaving a streaming chat page before the assistant finished could stop the server-side chat execution entirely. In practice, backend logs stopped as soon as the browser disconnected, and the assistant response never reached the final persistence, unread-state, or notification paths.
+
+## Root Cause Analysis
+
+The normal `/api/chat/stream` implementation performed the model call and all downstream persistence directly inside the request-bound SSE generator.
+
+That meant the request response loop was also the worker. Once the browser navigated away and the streaming response was torn down, the long-running chat work could stop with it.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_stream_background_execution.py`
+- `functional_tests/test_streaming_only_chat_path.py`
+- `functional_tests/test_chat_completion_notifications.py`
+
+### Code Changes Summary
+
+- Added a queue-backed `BackgroundStreamBridge` to decouple SSE delivery from chat execution.
+- Wrapped the streaming worker with `copy_current_request_context()` so existing request/session-dependent chat logic can still run in background execution.
+- Started the streaming worker through Flask-Executor when available, with a daemon-thread fallback.
+- Routed both the normal streaming path and the compatibility streaming path through the same background bridge helper.
+- Bumped the application version to `0.239.129`.
+
+### Testing Approach
+
+- Added `functional_tests/test_chat_stream_background_execution.py` to verify the background bridge, executor submission, and versioned fix documentation.
+- Updated the relevant streaming and chat notification regression tests so their version checks stay aligned with the current app version.
+
+## Validation
+
+### Before
+
+- Normal streaming chat execution lived inside the response generator.
+- Navigating away from the chat page could terminate the in-flight assistant generation.
+- Completion-side effects such as final message persistence and notifications could be skipped.
+
+### After
+
+- Chat generation is started in background execution and the HTTP response only relays queued SSE events.
+- If the browser disconnects, the consumer detaches but the background chat worker can continue to completion.
+- Final assistant persistence, unread markers, and completion notifications remain reachable even when the user leaves the page.
+
+### Impact Analysis
+
+This fix keeps the current streaming UX for connected users while removing the request lifecycle as the owner of the chat workload. It is intentionally minimal: the existing streaming generator remains the source of chat behavior, and the new bridge only changes how those events are delivered to the browser.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_STREAM_COMPATIBILITY_SSE_SYNTAX_FIX.md b/docs/explanation/fixes/CHAT_STREAM_COMPATIBILITY_SSE_SYNTAX_FIX.md
new file mode 100644
index 00000000..752f3de1
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_STREAM_COMPATIBILITY_SSE_SYNTAX_FIX.md
@@ -0,0 +1,54 @@
+# Chat Stream Compatibility SSE Syntax Fix
+
+Fixed/Implemented in version: **0.239.134**
+
+## Issue Description
+
+The compatibility branch inside the streaming chat route emitted image-generation thought events through multi-line f-strings that embedded `json.dumps({...})` directly inside the interpolation expression.
+
+In CI, that block was parsed as an unterminated string literal in `route_backend_chats.py`, which stopped the job before runtime tests could begin.
+
+## Root Cause Analysis
+
+The SSE compatibility bridge assembled dictionary literals inline inside an f-string expression across multiple lines.
+
+That formatting is fragile and parser-hostile in this file, especially in automated validation paths that compile the module directly.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_stream_compatibility_sse_syntax.py`
+- `functional_tests/test_chat_stream_background_execution.py`
+- `functional_tests/test_streaming_only_chat_path.py`
+- `functional_tests/test_chat_completion_notifications.py`
+
+### Code Changes Summary
+
+- Replaced the three multi-line SSE `yield` statements in the compatibility image-generation path with explicit payload dictionaries stored in local variables.
+- Kept the emitted SSE payload shape unchanged while making the code parser-safe.
+- Added a regression test that compiles `route_backend_chats.py` and verifies the new payload-variable pattern.
+- Bumped the application version to `0.239.134`.
+
+### Testing Approach
+
+- Added `functional_tests/test_chat_stream_compatibility_sse_syntax.py` to compile the route module and verify the fixed compatibility SSE block.
+- Updated existing streaming-related functional tests so their version checks align with the current app version.
+
+## Validation
+
+### Before
+
+- The compatibility SSE branch embedded multi-line dictionary literals directly inside f-string interpolation.
+- CI could fail during parsing with `SyntaxError: unterminated string literal` near the first compatibility image-generation event.
+
+### After
+
+- The compatibility SSE branch builds JSON payload dictionaries first and interpolates only the serialized variable into the SSE frame.
+- The route module compiles cleanly and preserves the same thought-event content for image-generation compatibility mode.
+
+### Impact Analysis
+
+This is a narrow, low-risk parser-safety fix. It does not change the compatibility mode contract or the streamed payload content, but it does prevent a syntax-level failure that blocked the chat route from loading.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_STREAM_DEBUG_LOGGING_FIX.md b/docs/explanation/fixes/CHAT_STREAM_DEBUG_LOGGING_FIX.md
new file mode 100644
index 00000000..efbfe68b
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_STREAM_DEBUG_LOGGING_FIX.md
@@ -0,0 +1,52 @@
+# Chat Stream Debug Logging Fix
+
+Fixed/Implemented in version: **0.239.142**
+
+## Issue Description
+
+Normal chat usage goes through `/api/chat/stream`, but the streaming path did not emit enough unconditional `debug_print()` output to make local troubleshooting practical. Startup logging still worked, while key runtime steps in the streaming request and Semantic Kernel orchestration path were too quiet.
+
+## Root Cause Analysis
+
+The codebase still contained many `debug_print()` statements in `route_backend_chats.py`, but many of them were in the non-streaming `/api/chat` handler or inside narrower conditional branches. The frontend chat UI uses the streaming route by default, so important request entry and plugin orchestration events were not consistently visible in the local console.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_chat_stream_debug_logging.py`
+- `functional_tests/test_chat_stream_compatibility_sse_syntax.py`
+- `functional_tests/test_chat_stream_background_execution.py`
+
+### Code Changes Summary
+
+- Added unconditional streaming-route `debug_print()` output for request entry, compatibility-mode routing, normalized request state, model initialization, conversation load/create, and final stream completion.
+- Added explicit streaming Semantic Kernel orchestration logging for plugin invocation clearing, tabular-analysis entry/exit, response-path selection, plugin callback registration, plugin callback execution, and callback deregistration.
+- Bumped the application version to `0.239.142`.
+- Added a regression test that checks the required streaming debug markers remain present.
+- Updated existing streaming regression tests to use the current application version.
+
+### Testing Approach
+
+- Added `functional_tests/test_chat_stream_debug_logging.py` to verify the new streaming debug markers exist.
+- Re-ran existing streaming regression tests covering SSE syntax and background execution.
+
+## Validation
+
+### Before
+
+- Startup debug output proved `debug_print()` still worked.
+- Regular UI chat requests still appeared quiet in the console.
+- Plugin execution visibility depended on hitting narrower branches instead of the main streaming path.
+
+### After
+
+- The streaming route now logs a request summary as soon as `/api/chat/stream` is entered.
+- The console shows which response path was selected, when plugin callbacks were registered and fired, and when the stream finalized.
+- Stream-focused regression tests pass with the updated instrumentation and version bump.
+
+### Impact Analysis
+
+This change is deliberately narrow: it restores operational visibility for the route the frontend already uses, without changing the streaming contract or response payload shape.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CHAT_WORKSPACE_SELECTION_RESET_FIX.md b/docs/explanation/fixes/CHAT_WORKSPACE_SELECTION_RESET_FIX.md
new file mode 100644
index 00000000..4f2fe8d2
--- /dev/null
+++ b/docs/explanation/fixes/CHAT_WORKSPACE_SELECTION_RESET_FIX.md
@@ -0,0 +1,55 @@
+# Chat Workspace Selection Reset Fix
+
+## Fix Title
+Implicit chat conversation creation now preserves selected workspace scope, tags, and documents.
+
+## Issue Description
+When a user arrived on the chat page with workspace context already selected, either by choosing documents manually or by coming from a workspace link that preselected scope, tags, or documents, clicking into the message input created a conversation and immediately reset the workspace filters back to their defaults.
+
+## Root Cause Analysis
+
+1. The chat bootstrap in `chat-onload.js` auto-created a conversation on first input focus when no conversation ID existed.
+2. `createNewConversation()` in `chat-conversations.js` always called `resetScopeLock()` in full reset mode, which restored the scope dropdown to `All` and reloaded document and tag controls.
+3. That full reset path rebuilt the document and tag UI in `chat-documents.js`, which cleared the preselected workspace context before the user sent the first message.
+
+## Version Implemented
+Fixed in version: **0.239.105**
+
+## Files Modified
+
+| File | Change |
+|------|--------|
+| `application/single_app/static/js/chat/chat-documents.js` | Added a preserve-selection mode to `resetScopeLock()` so lock state can be cleared without rebuilding current scope, tag, and document selections. |
+| `application/single_app/static/js/chat/chat-conversations.js` | Added a `preserveSelections` option to `createNewConversation()` and reused in-flight create requests so implicit creation does not race duplicate conversations. |
+| `application/single_app/static/js/chat/chat-onload.js` | Changed implicit auto-create entry points for input focus, prompt selection, and file selection to preserve current workspace filters. |
+| `application/single_app/static/js/chat/chat-input-actions.js` | Updated file-upload auto-create flows to preserve workspace selections. |
+| `application/single_app/static/js/chat/chat-messages.js` | Updated the first-send auto-create flow to preserve workspace selections. |
+| `functional_tests/test_chat_preserves_workspace_selection_on_auto_create.py` | Added regression coverage for preserve-selection reset logic and implicit conversation creation call sites. |
+| `application/single_app/config.py` | Version bump to `0.239.105`. |
+
+## Code Changes Summary
+
+### Preserve Selection State During Implicit Auto-Create
+- Added an options-based preserve path to the scope reset helper so a new conversation can start unlocked without forcing the workspace picker back to `All`.
+- Updated implicit conversation creation flows to request preserved selections.
+
+### Prevent Duplicate Conversation Creation
+- Reused a single in-flight create-conversation request so focus-triggered creation and an immediate send action do not create duplicate empty conversations.
+
+### Keep Explicit Reset Behavior Intact
+- Left the explicit `New Conversation` button on the default reset path so a deliberate fresh chat still restores default workspace scope.
+
+## Testing Approach
+- Added functional regression coverage in `functional_tests/test_chat_preserves_workspace_selection_on_auto_create.py`.
+- Validates that `resetScopeLock()` supports a preserve-selection path.
+- Validates that `createNewConversation()` preserves selections when requested and reuses a pending create request.
+- Validates that implicit creation call sites in the chat bootstrap, first-send flow, and file upload flow all request preserved selections.
+
+## Impact Analysis
+- Users can now click into the message input and continue with the workspace, tag, and document filters they already chose.
+- Workspace deep links remain stable through the first interaction instead of reverting to the default chat scope.
+- Explicit new chat creation still resets to the default workspace scope, so existing fresh-start behavior remains available.
+
+## Validation
+- Before: first focus in the message box could silently reset workspace-related filters before the user sent a message.
+- After: implicit conversation creation preserves the active workspace context, while explicit new chat creation keeps the full reset behavior.
\ No newline at end of file
diff --git a/docs/explanation/fixes/CONTROL_CENTER_GROUP_MANAGER_REFRESHGROUPS_OVERWRITE_FIX.md b/docs/explanation/fixes/CONTROL_CENTER_GROUP_MANAGER_REFRESHGROUPS_OVERWRITE_FIX.md
new file mode 100644
index 00000000..5b226a9d
--- /dev/null
+++ b/docs/explanation/fixes/CONTROL_CENTER_GROUP_MANAGER_REFRESHGROUPS_OVERWRITE_FIX.md
@@ -0,0 +1,57 @@
+# Control Center GroupManager refreshGroups Overwrite Fix
+
+Fixed in version: **0.239.145**
+
+## Issue Description
+
+The embedded `GroupManager` object in `control_center.html` defined `refreshGroups`
+twice. In JavaScript object literals, the later property overwrote the earlier
+one, which triggered the overwritten-property warning and discarded the version
+that showed a loading placeholder before refreshing the groups list.
+
+## Root Cause
+
+An older backward-compatibility alias for `refreshGroups` remained in the same
+object literal after the richer implementation had been added earlier in the
+file. Because both members used the same property name, the later alias replaced
+the intended implementation.
+
+## Files Modified
+
+- `application/single_app/templates/control_center.html`
+- `application/single_app/config.py`
+- `functional_tests/test_control_center_group_manager_refresh_groups_duplicate_fix.py`
+
+## Code Changes Summary
+
+- Removed the duplicate trailing `refreshGroups` alias from `GroupManager`.
+- Kept the single `refreshGroups` implementation that updates the table with a
+  loading message before delegating to `loadGroups()`.
+- Added a regression test that fails if `refreshGroups` is defined more than
+  once in the control center template.
+- Updated the application version to `0.239.145`.
+
+## Testing Approach
+
+- Added a focused functional test that scans the control center template and
+  asserts `refreshGroups` appears exactly once.
+- The same test also verifies the retained implementation still includes the
+  loading placeholder text.
+
+## Impact Analysis
+
+The fix removes a static-analysis warning, preserves the intended refresh UX,
+and reduces the chance of future regressions caused by duplicated object literal
+members in the control center group management script.
+
+## Validation
+
+Before:
+
+- `GroupManager` contained two `refreshGroups` members.
+- The second definition silently overwrote the first.
+
+After:
+
+- `GroupManager` contains a single `refreshGroups` member.
+- The groups table retains the loading placeholder behavior during refresh.
\ No newline at end of file
diff --git a/docs/explanation/fixes/DOCS_JSON_GEM_SECURITY_FIX.md b/docs/explanation/fixes/DOCS_JSON_GEM_SECURITY_FIX.md
new file mode 100644
index 00000000..ebe30eac
--- /dev/null
+++ b/docs/explanation/fixes/DOCS_JSON_GEM_SECURITY_FIX.md
@@ -0,0 +1,57 @@
+# Docs JSON Gem Security Fix
+
+Fixed/Implemented in version: **0.239.136**
+
+## Issue Description
+
+The docs site bundle resolved the Ruby `json` gem to `2.15.0`, which falls in the
+advisory's affected range for format string injection when parsing untrusted JSON
+with `allow_duplicate_key: false`.
+
+## Root Cause Analysis
+
+The docs Jekyll bundle relied on transitive dependency resolution for `json`
+without an explicit minimum version constraint, so `bundle update` had previously
+locked the site to a vulnerable release.
+
+## Technical Details
+
+### Files Modified
+
+- `docs/Gemfile`
+- `docs/Gemfile.lock`
+- `application/single_app/config.py`
+- `functional_tests/test_docs_json_gem_security_fix.py`
+
+### Code Changes Summary
+
+- Added an explicit `json >= 2.19.2` dependency to the docs bundle.
+- Updated the docs lockfile from `json 2.15.0` to `json 2.19.2`.
+- Added a regression test that verifies the Gemfile floor, resolved lockfile version,
+  and application version bump.
+- Bumped the application version to `0.239.136`.
+
+### Testing Approach
+
+- Ran a targeted `bundle update json` in `docs/` to resolve the patched gem version.
+- Added `functional_tests/test_docs_json_gem_security_fix.py` to verify the fix stays
+  in place.
+
+## Validation
+
+### Before
+
+- `docs/Gemfile.lock` resolved `json (2.15.0)`.
+- The patched version was not enforced directly in `docs/Gemfile`.
+
+### After
+
+- `docs/Gemfile.lock` resolves `json (2.19.2)`.
+- `docs/Gemfile` enforces a patched minimum so future dependency refreshes do not
+  drift back into the vulnerable range.
+
+### Impact Analysis
+
+This is a low-risk dependency hardening change scoped to the docs site bundle. It
+does not alter application runtime behavior, but it removes a known vulnerable gem
+version from the repository-managed Ruby dependencies.
\ No newline at end of file
diff --git a/docs/explanation/fixes/EMBEDDING_RATE_LIMIT_WAIT_TIME_FIX.md b/docs/explanation/fixes/EMBEDDING_RATE_LIMIT_WAIT_TIME_FIX.md
new file mode 100644
index 00000000..40b5a9a7
--- /dev/null
+++ b/docs/explanation/fixes/EMBEDDING_RATE_LIMIT_WAIT_TIME_FIX.md
@@ -0,0 +1,44 @@
+# Embedding Rate Limit Wait Time Fix
+
+## Fix Title
+Embedding retries now honor server-provided wait times from Azure OpenAI rate-limit responses.
+
+## Issue Description
+The embedding helpers retried `429 Too Many Requests` failures using only local exponential backoff with jitter. When Azure OpenAI returned a `Retry-After` style header, the application ignored that server guidance and retried on its own schedule.
+
+## Root Cause Analysis
+- `generate_embedding()` and `generate_embeddings_batch()` only used a client-side backoff calculation after `RateLimitError`.
+- The underlying OpenAI/Azure OpenAI `429` response headers were available on the exception response, but the helper never parsed them.
+- As a result, retries could happen earlier than the service requested, increasing the chance of repeated throttling.
+
+## Version Implemented
+Fixed in version: **0.239.116**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/functions_content.py` | Added retry header parsing and applied it to both embedding retry loops |
+| `functional_tests/test_embedding_rate_limit_wait_time.py` | Added regression coverage for `Retry-After` parsing and embedding retry timing |
+| `application/single_app/config.py` | Version bump to 0.239.116 |
+
+## Code Changes Summary
+- Added a shared helper to parse `retry-after-ms`, `x-ms-retry-after-ms`, and `retry-after` values from rate-limit responses.
+- Updated both single-item and batched embedding generation to prefer the server-provided wait time when it is available and reasonable.
+- Kept the existing jittered exponential backoff as a fallback when the response does not provide a usable retry delay.
+
+## Testing Approach
+- Added `functional_tests/test_embedding_rate_limit_wait_time.py`.
+- The functional test stubs the embedding client and rate-limit exception so it can verify:
+  - Header parsing for millisecond and date-based retry values.
+  - Single embedding retries sleep for the server-provided duration.
+  - Batched embedding retries sleep for the server-provided duration.
+
+## Impact Analysis
+- Embedding retries now align more closely with Azure OpenAI throttling guidance.
+- This reduces avoidable repeat `429` responses during document ingestion and batched embedding creation.
+- Existing fallback behavior remains in place for responses that do not include a usable retry hint.
+
+## Validation
+- Regression test: `functional_tests/test_embedding_rate_limit_wait_time.py`
+- Before: embedding retries always used local backoff, even when the `429` response included a wait time.
+- After: embedding retries use the server-provided wait time when available, then fall back to local backoff only when necessary.
\ No newline at end of file
diff --git a/docs/explanation/fixes/GLOBAL_ACTION_AUDIT_USER_FALLBACK_FIX.md b/docs/explanation/fixes/GLOBAL_ACTION_AUDIT_USER_FALLBACK_FIX.md
new file mode 100644
index 00000000..011091f2
--- /dev/null
+++ b/docs/explanation/fixes/GLOBAL_ACTION_AUDIT_USER_FALLBACK_FIX.md
@@ -0,0 +1,45 @@
+# Global Action Audit User Fallback Fix
+
+## Fix Title
+Global Action Save Path Defaults Missing Audit User IDs
+
+## Issue Description
+`save_global_action()` accepted an optional `user_id`, but callers that omitted it could persist `created_by` and `modified_by` as `null`. This affected flows such as plugin validation repair, which saves plugin manifests through the global action helper without explicitly passing a user ID.
+
+## Root Cause Analysis
+- `save_global_action()` never mirrored the existing `save_global_agent()` behavior that resolves `user_id` through `get_current_user_id()` when the caller passes `None`.
+- The helper wrote audit fields directly from the unresolved `user_id`, so create operations stored `null` values.
+- Update operations preserved an existing `created_by` value even when it was already `null`, which meant previously corrupted audit data could survive indefinitely.
+
+## Version Implemented
+Fixed in version: **0.239.103**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/functions_global_actions.py` | Default missing `user_id` from `get_current_user_id()`, fall back to `system`, and repair null `created_by` on update |
+| `functional_tests/test_global_action_user_audit_fallback.py` | Added regression coverage for create and update audit-field fallback behavior |
+| `application/single_app/config.py` | Version bump to 0.239.103 |
+
+## Code Changes Summary
+- Imported `get_current_user_id()` into `functions_global_actions.py`.
+- When `user_id` is `None`, the helper now resolves the current authenticated user.
+- If no authenticated user is available, the helper falls back to `system` so audit fields remain non-null.
+- Existing actions with `created_by=None` are repaired on save by substituting the resolved fallback value.
+
+## Testing Approach
+- Added `functional_tests/test_global_action_user_audit_fallback.py`.
+- The test stubs the config, authentication, and Key Vault dependencies so it can exercise `save_global_action()` directly.
+- Coverage verifies both:
+  - Create flow uses `get_current_user_id()` when `user_id` is omitted.
+  - Update flow repairs a previously null `created_by` and falls back to `system` when no current user exists.
+
+## Impact Analysis
+- Global plugin/action saves now produce stable audit metadata even for internal or repair flows that do not pass a user ID explicitly.
+- Existing global actions with missing `created_by` values are corrected the next time they are saved.
+- No route or payload contract changes were introduced.
+
+## Validation
+- Regression test: `functional_tests/test_global_action_user_audit_fallback.py`
+- Before: `created_by` and `modified_by` could be stored as `null`.
+- After: both fields resolve to the current user ID or `system`.
\ No newline at end of file
diff --git a/docs/explanation/fixes/GROUP_PUBLIC_WORKSPACE_EXPANDED_TAGS_FIX.md b/docs/explanation/fixes/GROUP_PUBLIC_WORKSPACE_EXPANDED_TAGS_FIX.md
new file mode 100644
index 00000000..2f255b9a
--- /dev/null
+++ b/docs/explanation/fixes/GROUP_PUBLIC_WORKSPACE_EXPANDED_TAGS_FIX.md
@@ -0,0 +1,47 @@
+# Group/Public Workspace Expanded Tags Fix
+
+## Fix Title
+Group and public workspace expanded list rows now display document tags like the personal workspace.
+
+## Issue Description
+When a user expanded a document in list view inside a group workspace or public workspace, the metadata panel omitted the document's tags. Personal workspace already showed tags in the same expanded view, and the backend APIs for group and public workspaces were already returning each document's `tags` array.
+
+## Root Cause Analysis
+- The group workspace expanded-details renderer in `group_workspaces.html` never added a `Tags:` row.
+- The public workspace expanded-details renderer in `public_workspace.js` had the same omission.
+- Both workspaces already loaded workspace tag definitions and document tag arrays for filtering and metadata editing, so the gap was limited to list-view UI rendering rather than missing backend data.
+
+## Version Implemented
+Fixed in version: **0.239.113**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/templates/group_workspaces.html` | Added a local tag badge renderer and inserted a `Tags:` row into expanded list-view document details. |
+| `application/single_app/static/js/public/public_workspace.js` | Added a local tag badge renderer and inserted a `Tags:` row into expanded list-view document details. |
+| `functional_tests/test_group_public_workspace_expanded_tags.py` | Added regression coverage for the group/public expanded tag rows and helper usage. |
+| `application/single_app/config.py` | Version bump to `0.239.113`. |
+
+## Code Changes Summary
+- Added `renderGroupTagBadges()` in the group workspace page and `renderPublicTagBadges()` in the public workspace script.
+- Reused existing workspace tag definitions and color utilities so tags render with configured colors when available.
+- Added a neutral fallback badge color for unknown tag definitions and `No tags` text when a document has no tags.
+- Inserted the new `Tags:` row between `Keywords:` and `Abstract:` to match the personal workspace expanded-details layout.
+
+## Testing Approach
+- Added `functional_tests/test_group_public_workspace_expanded_tags.py`.
+- The test validates that:
+  - Personal workspace still provides the parity reference for expanded tag rendering.
+  - Group workspace defines a local badge helper and renders a `Tags:` row in expanded document details.
+  - Public workspace defines a local badge helper and renders a `Tags:` row in expanded document details.
+  - The `Tags:` row appears between `Keywords:` and `Abstract:` in both renderers.
+
+## Impact Analysis
+- Group workspace users now see document tags immediately when expanding a file in list view.
+- Public workspace users now get the same visibility without needing to open metadata editing flows.
+- The experience is now consistent across personal, group, and public workspaces while keeping backend contracts unchanged.
+
+## Validation
+- Before: group and public expanded list rows showed metadata such as version, authors, keywords, and abstract, but omitted tags.
+- After: both workspaces render color-coded tag badges or a `No tags` fallback in the expanded details row.
+- Regression test: `functional_tests/test_group_public_workspace_expanded_tags.py`
\ No newline at end of file
diff --git a/docs/explanation/fixes/OPENAPI_URL_IMPORT_REMOVAL_FIX.md b/docs/explanation/fixes/OPENAPI_URL_IMPORT_REMOVAL_FIX.md
new file mode 100644
index 00000000..f1b6f751
--- /dev/null
+++ b/docs/explanation/fixes/OPENAPI_URL_IMPORT_REMOVAL_FIX.md
@@ -0,0 +1,67 @@
+# OpenAPI URL Import Removal Fix
+
+Fixed/Implemented in version: **0.239.143**
+
+## Issue Description
+
+SimpleChat had moved the OpenAPI plugin UI to an upload-only workflow, but the backend still exposed authenticated URL import endpoints and legacy URL-based plugin creation paths.
+
+That mismatch left dead functionality in place and preserved an unnecessary server-side URL fetch surface that was no longer part of the supported product flow.
+
+## Root Cause Analysis
+
+The frontend plugin modal had already standardized on uploaded OpenAPI file content, but the backend still retained:
+
+- `/api/openapi/validate-url`
+- `/api/openapi/download-from-url`
+- URL-fetch validation helpers in `openapi_security.py`
+- a deprecated `openapi_source_type == 'url'` branch in the OpenAPI plugin factory
+
+Because those code paths still existed, authenticated callers could continue to exercise an unsupported backend URL import path even though the web UI no longer offered it.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_openapi.py`
+- `application/single_app/openapi_security.py`
+- `application/single_app/semantic_kernel_plugins/openapi_plugin_factory.py`
+- `application/single_app/config.py`
+- `docs/explanation/features/v0.229.001/OPENAPI_ACTION.md`
+- `functional_tests/test_openapi_upload_only_flow.py`
+
+### Code Changes Summary
+
+- Removed the backend URL import endpoints for validating and downloading OpenAPI specifications from remote URLs.
+- Removed URL-fetch validation helpers from the OpenAPI security validator so it now focuses on uploaded file content only.
+- Removed deprecated URL-based plugin factory handling to align runtime behavior with the upload/content-based configuration flow.
+- Updated the OpenAPI feature documentation to reflect the supported upload-only workflow.
+- Added regression coverage to ensure URL import routes and URL source handling do not return unintentionally.
+- Bumped the application version to `0.239.143`.
+
+### Testing Approach
+
+- Added `functional_tests/test_openapi_upload_only_flow.py` to verify the backend no longer exposes URL import routes or URL-based factory handling.
+- The regression test also checks that the frontend still requires uploaded OpenAPI content and that the config version matches the implementation.
+
+## Validation
+
+### Before
+
+- The modal required an uploaded OpenAPI file.
+- The backend still registered authenticated URL import endpoints.
+- The plugin factory still contained a deprecated URL source path.
+
+### After
+
+- OpenAPI configuration is consistently upload-only across the frontend and backend.
+- The unsupported server-side URL import surface has been removed.
+- The factory and documentation now match the supported content-based plugin configuration flow.
+
+### Impact Analysis
+
+This change is intentionally narrow:
+
+- the supported upload workflow remains unchanged
+- frontend configuration still stores validated OpenAPI spec content directly
+- only dead URL import behavior was removed
\ No newline at end of file
diff --git a/docs/explanation/fixes/PER_MESSAGE_WORD_EXPORT_ROUTE_FIX.md b/docs/explanation/fixes/PER_MESSAGE_WORD_EXPORT_ROUTE_FIX.md
new file mode 100644
index 00000000..7b3ab4dc
--- /dev/null
+++ b/docs/explanation/fixes/PER_MESSAGE_WORD_EXPORT_ROUTE_FIX.md
@@ -0,0 +1,49 @@
+# Per-Message Word Export Route Fix
+
+Fixed/Implemented in version: **0.239.128**
+
+## Issue Description
+
+The chat message dropdown still offered "Export to Word", but requests to `POST /api/message/export-word` returned `405 METHOD NOT ALLOWED`.
+
+## Root Cause Analysis
+
+The backend export module no longer registered the explicit `/api/message/export-word` route even though the frontend, release notes, feature documentation, and functional tests still referenced it.
+
+Because the explicit route was missing, Flask matched the request path against the generic `/api/message/<message_id>` route instead. That route only supports `DELETE`, so a `POST` to `/api/message/export-word` failed with `405`.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_conversation_export.py`
+- `application/single_app/config.py`
+- `functional_tests/test_per_message_export.py`
+
+### Code Changes Summary
+
+- Restored the explicit `POST /api/message/export-word` route in the conversation export module.
+- Added DOCX rendering helpers for single-message export, including basic markdown formatting and citation output.
+- Added regression coverage to verify that the backend source continues to define the explicit route.
+- Bumped the application version to `0.239.128`.
+
+### Testing Approach
+
+- Extended `functional_tests/test_per_message_export.py` with an AST-based regression check for the missing backend route.
+- Preserved the existing content normalization and Word document generation checks for the per-message export feature.
+
+## Validation
+
+### Before
+
+- `POST /api/message/export-word` returned `405 METHOD NOT ALLOWED`.
+- The frontend Word export action could not download a `.docx` file.
+
+### After
+
+- `POST /api/message/export-word` is explicitly registered again.
+- The frontend request can resolve to the intended Word export handler instead of the generic message delete route.
+
+### User Experience Improvement
+
+Users can export a single chat message to Word from the message dropdown without hitting a method error.
\ No newline at end of file
diff --git a/docs/explanation/fixes/PILLOW_PSD_UPLOAD_HARDENING_FIX.md b/docs/explanation/fixes/PILLOW_PSD_UPLOAD_HARDENING_FIX.md
new file mode 100644
index 00000000..7b9a3e8d
--- /dev/null
+++ b/docs/explanation/fixes/PILLOW_PSD_UPLOAD_HARDENING_FIX.md
@@ -0,0 +1,46 @@
+# Pillow PSD Upload Hardening Fix
+
+Fixed/Implemented in version: **0.239.134**
+
+## Issue Description
+
+The application pinned Pillow to `11.1.0`, which falls in the vulnerable range for an out-of-bounds write when parsing specially crafted PSD images.
+
+Although the admin settings page only intends to accept PNG and JPEG uploads for logos and favicons, those uploads were still passed directly to `Image.open(...)` without an explicit decoder allowlist.
+
+## Root Cause Analysis
+
+- `application/single_app/requirements.txt` pinned Pillow to a vulnerable version.
+- `application/single_app/route_frontend_admin_settings.py` relied on filename extensions before calling Pillow, but did not constrain Pillow to the actual image formats the route supports.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/requirements.txt`
+- `application/single_app/route_frontend_admin_settings.py`
+- `application/single_app/config.py`
+- `functional_tests/test_pillow_psd_upload_hardening.py`
+
+### Code Changes Summary
+
+- Updated the Pillow dependency pin to `12.1.1`.
+- Added `open_allowed_uploaded_image(...)` so admin logo and favicon uploads only open through Pillow with `PNG` and `JPEG` decoders enabled.
+- Reused that helper for standard logo, dark-mode logo, and favicon uploads.
+- Bumped the application version to `0.239.134`.
+
+### Testing Approach
+
+- Added `functional_tests/test_pillow_psd_upload_hardening.py` to verify the patched dependency pin, the route-level Pillow format allowlist, and the version bump.
+
+## Validation
+
+### Before
+
+- The app installed a Pillow version in the vulnerable range.
+- A disguised PSD upload could still be handed to Pillow from the admin image upload route.
+
+### After
+
+- The app pins Pillow to the patched version.
+- The admin image upload route now restricts Pillow parsing to the PNG and JPEG formats already allowed by the UI.
\ No newline at end of file
diff --git a/docs/explanation/fixes/REASONING_EFFORT_INITIAL_SYNC_FIX.md b/docs/explanation/fixes/REASONING_EFFORT_INITIAL_SYNC_FIX.md
new file mode 100644
index 00000000..b6efec4f
--- /dev/null
+++ b/docs/explanation/fixes/REASONING_EFFORT_INITIAL_SYNC_FIX.md
@@ -0,0 +1,45 @@
+# Reasoning Effort Initial Sync Fix
+
+## Fix Title
+Reasoning effort button now reflects the saved level for the selected model on the first chat-page load.
+
+## Issue Description
+The reasoning effort button could show the wrong initial icon and tooltip when a user first opened the chat page. If the user opened the reasoning modal and changed the level, the button updated immediately and stayed correct for the rest of that session.
+
+## Root Cause Analysis
+- `chat-onload.js` applied the preferred model only after a broader startup `Promise.all()` that also waited on document and prompt loading.
+- `chat-reasoning.js` fetched user settings independently and initialized the reasoning button as soon as its own request resolved.
+- When the reasoning settings request finished before the preferred model was applied, the button synced itself against the default model instead of the user's actual selected model.
+- Because the preferred model was assigned programmatically without a later reasoning-state refresh, the stale icon and tooltip remained until the user changed the reasoning level manually.
+
+## Version Implemented
+Fixed in version: **0.239.125**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/static/js/chat/chat-onload.js` | Applied user settings earlier in startup and initialized reasoning after the preferred model is set |
+| `application/single_app/static/js/chat/chat-reasoning.js` | Added deterministic reasoning-state sync using already-loaded settings |
+| `functional_tests/test_reasoning_effort_initial_sync.py` | Added regression coverage for the startup ordering and reasoning sync path |
+| `functional_tests/test_chat_searchable_selectors.py` | Updated version metadata/assertion for the new release |
+| `functional_tests/test_workspace_scope_prompts_fix.py` | Updated version metadata/assertion for the new release |
+| `application/single_app/config.py` | Version bump to 0.239.125 |
+
+## Code Changes Summary
+- Added a shared reasoning-state sync path so the reasoning button can be refreshed explicitly for the current model.
+- Updated reasoning initialization to accept already-loaded user settings instead of always starting a second settings fetch race.
+- Changed chat startup so the preferred model is applied before the reasoning toggle is initialized.
+
+## Testing Approach
+- Added `functional_tests/test_reasoning_effort_initial_sync.py`.
+- Updated existing versioned functional tests so their release assertions match the new config version.
+
+## Impact Analysis
+- The reasoning button now shows the saved effort level and tooltip immediately for the active model on initial page load.
+- Startup remains responsive because user settings are still loaded in parallel with document and prompt data.
+- Real-time reasoning updates after a manual change continue to work through the existing model-change and save flow.
+
+## Validation
+- Regression test: `functional_tests/test_reasoning_effort_initial_sync.py`
+- Before: the first visible reasoning icon could reflect the wrong model context until the user manually changed reasoning.
+- After: reasoning state is synchronized after the preferred model is applied, so the initial button state matches the saved setting.
\ No newline at end of file
diff --git a/docs/explanation/fixes/REDUNDANT_CONVERSATION_ID_ASSIGNMENT_FIX.md b/docs/explanation/fixes/REDUNDANT_CONVERSATION_ID_ASSIGNMENT_FIX.md
new file mode 100644
index 00000000..ea588b24
--- /dev/null
+++ b/docs/explanation/fixes/REDUNDANT_CONVERSATION_ID_ASSIGNMENT_FIX.md
@@ -0,0 +1,36 @@
+# Redundant Conversation ID Assignment Fix
+
+Fixed/Implemented in version: **0.239.148**
+
+## Issue Description
+
+A standalone assignment in `route_backend_chats.py` reassigned `conversation_id` to itself during chat request processing.
+
+## Root Cause Analysis
+
+The statement `conversation_id = conversation_id` was left in a setup block where nearby lines initialize local state. Because it had no effect, it only introduced a redundant-assignment warning and suggested a likely copy-paste mistake.
+
+## Technical Details
+
+- Files modified:
+  - `application/single_app/route_backend_chats.py`
+  - `application/single_app/config.py`
+  - `functional_tests/test_route_backend_chats_redundant_assignment.py`
+- Code changes summary:
+  - Removed the no-op `conversation_id = conversation_id` assignment from the chat handling path.
+  - Added a functional test that parses `route_backend_chats.py` with `ast` and fails if any standalone self-assignment remains.
+  - Bumped the application version to `0.239.148`.
+- Testing approach:
+  - Added a targeted regression test for standalone self-assignment detection.
+
+## Impact Analysis
+
+Removing the redundant assignment does not change runtime behavior because the previous statement had no effect. It does remove a misleading warning and reduces the chance of masking a real state-initialization bug later.
+
+## Validation
+
+- Before:
+  - `route_backend_chats.py` contained a standalone `conversation_id = conversation_id` statement.
+- After:
+  - The redundant assignment is removed.
+  - A regression test now checks the file for the same class of no-op assignment.
diff --git a/docs/explanation/fixes/SQL_PLUGIN_KEY_VAULT_SECRET_STORAGE_FIX.md b/docs/explanation/fixes/SQL_PLUGIN_KEY_VAULT_SECRET_STORAGE_FIX.md
new file mode 100644
index 00000000..dcee21df
--- /dev/null
+++ b/docs/explanation/fixes/SQL_PLUGIN_KEY_VAULT_SECRET_STORAGE_FIX.md
@@ -0,0 +1,55 @@
+# SQL Plugin Key Vault Secret Storage Fix
+
+## Fix Title
+SQL plugin credentials now use Azure Key Vault secret storage when it is enabled.
+
+## Issue Description
+SQL plugin configuration stored sensitive values such as `connection_string` and `password` directly in plugin manifests because those fields did not pass through the existing plugin Key Vault helper. The helper already supported `auth.key` and dynamic `__Secret` additional fields, but SQL credentials used regular field names, so Key Vault-enabled deployments still left SQL secrets in stored plugin data.
+
+## Root Cause Analysis
+- The shared plugin Key Vault helper only recognized `auth.key` and additional field names ending in `__Secret`.
+- SQL plugins used standard additional field names like `connection_string` and `password`, so those values bypassed Key Vault storage.
+- Edit flows returned `Stored_In_KeyVault` placeholders to the browser, but several save and delete paths did not reliably load the stored Key Vault reference names during updates and deletes.
+- The personal workspace bulk-save flow dropped plugin ids, which made rename and placeholder-preservation scenarios unreliable.
+
+## Version Implemented
+Fixed in version: **0.239.114**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/functions_keyvault.py` | Added SQL secret-field handling for plugin save/get/delete helpers and a shared plugin redaction helper |
+| `application/single_app/functions_personal_actions.py` | Preserved existing Key Vault references during personal action updates and deletes |
+| `application/single_app/functions_global_actions.py` | Preserved existing Key Vault references during global action updates and deletes |
+| `application/single_app/functions_group_actions.py` | Passed existing group action manifests into the Key Vault helper for placeholder-preserving updates |
+| `application/single_app/route_backend_plugins.py` | Preserved personal plugin ids during bulk saves, resolved stored SQL Key Vault secrets for edit-time connection tests, removed delete-then-save global edit behavior, and redacted plugin logs |
+| `application/single_app/static/js/plugin_modal_stepper.js` | Fixed SQL edit population, mapped SQL service-principal auth to the shared auth schema, and sent scope/id context for edit-time SQL connection tests |
+| `application/single_app/static/js/workspace/workspace_plugins.js` | Preserved plugin ids across personal workspace edits so stored Key Vault references survive rename/update flows |
+| `application/single_app/semantic_kernel_plugins/plugin_health_checker.py` | Validated SQL manifests using nested `additionalFields` values as well as top-level fields |
+| `functional_tests/test_sql_plugin_key_vault_secret_storage.py` | Added regression coverage for helper behavior and personal/global/group wrapper flows |
+| `application/single_app/config.py` | Version bump to 0.239.114 |
+
+## Code Changes Summary
+- SQL plugin `connection_string` and `password` additional fields are now treated as secret-bearing fields by the shared plugin Key Vault helper.
+- Existing stored Key Vault references are preserved during edit flows instead of being regenerated or dropped when the UI submits `Stored_In_KeyVault` placeholders.
+- Personal workspace plugin edits now preserve plugin ids so updates can target the existing stored document even when the plugin name changes.
+- The SQL connection test endpoint can now resolve previously stored Key Vault-backed SQL secrets during edit flows without forcing the user to re-enter them.
+- Plugin logging now redacts secret-bearing values before writing plugin manifests to logs.
+
+## Testing Approach
+- Added `functional_tests/test_sql_plugin_key_vault_secret_storage.py`.
+- The functional test stubs Key Vault, Cosmos, and action-helper dependencies so it can exercise:
+  - Shared SQL Key Vault secret save/get/delete behavior.
+  - Placeholder-preserving personal action save/delete flows.
+  - Placeholder-preserving global and group action save/delete flows.
+
+## Impact Analysis
+- New and updated SQL plugins now store secret-bearing configuration in Key Vault when `enable_key_vault_secret_storage` is enabled.
+- Existing plaintext SQL plugin records are not backfilled automatically; they remain unchanged until the plugin is saved again.
+- Edit flows for SQL plugins no longer require re-entering an unchanged stored connection string or password just to test or save the plugin.
+- Non-SQL plugin Key Vault behavior for `auth.key` and `additionalFields.*__Secret` remains intact.
+
+## Validation
+- Regression test: `functional_tests/test_sql_plugin_key_vault_secret_storage.py`
+- Before: SQL plugin `connection_string` and `password` values could remain in stored plugin data even when Key Vault was enabled.
+- After: those values are stored as Key Vault references, resolved at runtime and test time, preserved across edits, and cleaned up on delete.
\ No newline at end of file
diff --git a/docs/explanation/fixes/SQL_QUERY_PLUGIN_SCHEMA_AWARENESS_FIX.md b/docs/explanation/fixes/SQL_QUERY_PLUGIN_SCHEMA_AWARENESS_FIX.md
new file mode 100644
index 00000000..9713a62c
--- /dev/null
+++ b/docs/explanation/fixes/SQL_QUERY_PLUGIN_SCHEMA_AWARENESS_FIX.md
@@ -0,0 +1,160 @@
+# SQL Query Plugin Schema Awareness Fix
+
+## Fix Title
+SQL Query Plugin - Schema Awareness, Companion Plugin Auto-Creation, and Workflow Guidance
+
+## Issue Description
+When users asked database-related questions (e.g., "what is user1 licensed to use?"), agents connected to SQL databases would ask for clarification about table/column names instead of querying the database directly. The agent had no citations, meaning it never actually called any database tools.
+
+## Root Cause Analysis
+
+### Original Root Causes (v0.239.014)
+Three interconnected issues caused the initial failure:
+
+1. **Generic `@kernel_function` descriptions**: The SQL Query and SQL Schema plugin function descriptions were terse and generic (e.g., "Execute a SQL query and return results"). They provided no workflow guidance telling the LLM to discover the schema first before writing queries.
+
+2. **No schema context in agent instructions**: Agent instructions were passed through verbatim from configuration with no automatic injection of database schema information.
+
+3. **Independent, disconnected plugins**: The SQL Schema Plugin and SQL Query Plugin operated as completely independent plugins with no linkage.
+
+### Deeper Root Causes Discovered (v0.239.015)
+The v0.239.014 fix improved descriptions but actually made things **worse** because:
+
+4. **No companion schema plugin was ever loaded**: The ESAM agent only had ONE action configured (`sql_query` type). No `sql_schema` action existed in the agent's actions. The `_create_sql_plugin()` method creates exactly what the manifest requests — so only `SQLQueryPlugin` was loaded, never `SQLSchemaPlugin`.
+
+5. **Descriptions demanded non-existent functions**: The v0.239.014 descriptions said "you MUST first call get_database_schema or get_table_list from the SQL Schema plugin" — but those functions didn't exist in the kernel since no schema plugin was loaded. This created an **impossible dependency** that made the LLM ask for clarification instead.
+
+6. **Schema extraction found nothing**: `_extract_sql_schema_for_instructions()` only searched for `SQLSchemaPlugin` instances. Since none existed in the kernel, it returned an empty string, so no schema was injected into agent instructions.
+
+7. **SQLPluginFactory was disconnected**: The `SQLPluginFactory` class was designed to create `(SQLSchemaPlugin, SQLQueryPlugin)` pairs, but was never called by the `LoggedPluginLoader` pipeline.
+
+### Empty Schema Tables from INFORMATION_SCHEMA (v0.239.016)
+After the v0.239.015 fix, the agent could answer simple queries (e.g., "what is user1 licensed to use?" correctly returned Office 365 license data). However, complex multi-table JOIN queries (e.g., "which department is spending the most on licensing?") still failed because:
+
+8. **INFORMATION_SCHEMA views returned empty results on Azure SQL**: The `_get_tables_query()`, `_get_columns_query()`, and `_get_primary_keys_query()` methods used `INFORMATION_SCHEMA.TABLES`, `INFORMATION_SCHEMA.COLUMNS`, and `INFORMATION_SCHEMA.KEY_COLUMN_USAGE` respectively. These views returned **zero rows** in this Azure SQL environment, even though the database contained 5 user tables.
+
+9. **sys.\* catalog views worked correctly**: The `_get_relationships_data()` method used `sys.foreign_keys`, `sys.tables`, and `sys.columns` — and successfully returned 4 foreign key relationships. This proved the database connection and permissions were fine, but `INFORMATION_SCHEMA` access was restricted or misconfigured.
+
+10. **pyodbc.Row type mismatch**: The table iteration code used `isinstance(table, tuple)` to check row types, but `pyodbc.Row` objects may not pass this check depending on the pyodbc version. When `isinstance` returned `False`, the code fell into an `else` branch that assigned the entire Row object as the table name, causing subsequent SQL queries to fail silently in the exception handler.
+
+11. **Result**: `get_database_schema` returned `{'tables': {}, 'relationships': [4 items]}` — the agent had foreign key metadata but no table/column definitions, making it impossible to construct multi-table JOINs.
+
+## Version Implemented
+**Initial fix in version: 0.239.014**
+**Companion plugin fix in version: 0.239.015**
+**Schema catalog views fix in version: 0.239.016**
+
+## Files Modified
+
+| File | Change |
+|------|--------|
+| `application/single_app/semantic_kernel_plugins/sql_schema_plugin.py` | Rewrote all `@kernel_function` descriptions with prescriptive workflow guidance (v0.239.014); migrated all SQL Server queries from INFORMATION_SCHEMA to sys.\* catalog views and fixed pyodbc.Row handling (v0.239.016) |
+| `application/single_app/semantic_kernel_plugins/sql_query_plugin.py` | Rewrote all `@kernel_function` descriptions with resilient conditional guidance (v0.239.015); added `query_database` convenience function (v0.239.014); updated `metadata` property description |
+| `application/single_app/semantic_kernel_loader.py` | Added `_extract_sql_schema_for_instructions()` helper function; auto-injects database schema into agent instructions; added SQLQueryPlugin fallback detection (v0.239.015) |
+| `application/single_app/semantic_kernel_plugins/logged_plugin_loader.py` | Enabled SQL plugin creation path (v0.239.014); added `_auto_create_companion_schema_plugin()` method that auto-creates a SQLSchemaPlugin whenever a SQLQueryPlugin is loaded (v0.239.015) |
+| `application/single_app/config.py` | Version bump to 0.239.016 |
+
+## Code Changes Summary
+
+### v0.239.014 Changes
+
+#### 1. Prescriptive Function Descriptions (sql_schema_plugin.py)
+- `get_database_schema`: Now says "ALWAYS call this function FIRST before executing any SQL queries"
+- `get_table_list`: Now says "Use this function first to discover which tables are available"
+- `get_table_schema`: Now says "Call this after discovering tables via get_database_schema or get_table_list"
+- `get_relationships`: Now says "Use this to understand how tables connect via JOIN conditions"
+
+#### 2. New `query_database` Convenience Function (sql_query_plugin.py)
+- Accepts `question` (natural language) and `query` (SQL) parameters
+- Returns results with the original question context for better LLM response formatting
+
+#### 3. Auto Schema Injection (semantic_kernel_loader.py)
+- New `_extract_sql_schema_for_instructions()` function detects SQL Schema plugins in the kernel
+- Calls `get_database_schema()` at agent load time to fetch full schema
+- Formats schema as markdown tables (table names, columns, types, relationships)
+- Appends schema to agent instructions with directive: "Do NOT ask the user for table or column names"
+
+#### 4. Enabled SQL Plugin Creation Path (logged_plugin_loader.py)
+- Uncommented the `elif plugin_type in ['sql_schema', 'sql_query']` branch
+
+### v0.239.015 Changes (Complete Fix)
+
+#### 5. Auto-Create Companion Schema Plugin (logged_plugin_loader.py)
+- New `_auto_create_companion_schema_plugin()` method
+- When a `sql_query` plugin is loaded, automatically creates a companion `SQLSchemaPlugin` using the same connection details
+- Derives schema plugin name: `enterprise_software_asset_management_query` → `enterprise_software_asset_management_schema`
+- Checks if the companion already exists (idempotent)
+- Enables logging, wraps functions, registers with kernel
+- This is the **critical fix** — ensures schema discovery is always available even when only `sql_query` is configured
+
+#### 6. Resilient Function Descriptions (sql_query_plugin.py)
+- Changed from "you MUST first call get_database_schema" to "If the database schema is provided in your instructions, use those exact table and column names. If no schema is available, call get_database_schema"
+- This dual-path approach works whether schema is injected in instructions OR available via schema plugin functions
+
+#### 7. SQLQueryPlugin Fallback in Schema Extraction (semantic_kernel_loader.py)
+- Added fallback in `_extract_sql_schema_for_instructions()` that also detects `SQLQueryPlugin` instances
+- If no `SQLSchemaPlugin` is found, creates a temporary `SQLSchemaPlugin` from the query plugin's connection config
+- Belt-and-suspenders safety net in case companion auto-creation fails
+- Appends schema to agent instructions with directive: "Do NOT ask the user for table or column names"
+- This ensures the LLM ALWAYS has schema context even if it doesn't call the schema plugin
+
+### 4. Enabled SQL Plugin Creation Path (logged_plugin_loader.py)
+- Uncommented the `elif plugin_type in ['sql_schema', 'sql_query']` branch
+- SQL plugins now use the explicit `_create_sql_plugin()` method instead of generic discovery fallback
+
+### v0.239.016 Changes (Schema Catalog Views Fix)
+
+#### 8. Migrated SQL Server Queries to sys.\* Catalog Views (sql_schema_plugin.py)
+- **`_get_tables_query()`**: Replaced `INFORMATION_SCHEMA.TABLES` with `sys.tables t INNER JOIN sys.schemas s ON t.schema_id = s.schema_id WHERE t.type = 'U'`
+- **`_get_columns_query()`**: Replaced `INFORMATION_SCHEMA.COLUMNS` with `sys.columns c INNER JOIN sys.tables t ... LEFT JOIN sys.default_constraints dc ...` using `TYPE_NAME(c.user_type_id)` for data type resolution
+- **`_get_primary_keys_query()`**: Replaced `INFORMATION_SCHEMA.KEY_COLUMN_USAGE` with `sys.index_columns ic INNER JOIN sys.indexes i ... WHERE i.is_primary_key = 1`
+- This makes all SQL Server schema queries consistent with `_get_relationships_data()`, which already used sys.\* views successfully
+
+#### 9. Robust pyodbc.Row Handling (sql_schema_plugin.py)
+- **`get_database_schema()`**: Replaced `isinstance(table, tuple)` checks with try/except indexing; all row field values cast to `str()` before use as dict keys
+- **`get_table_list()`**: Same robust Row handling pattern applied to table row iteration
+- **`_get_table_schema_data()`**: Primary key list comprehension updated to use `str(pk[0])` without isinstance checks
+- This ensures the code works correctly regardless of whether `pyodbc.Row` inherits from `tuple` in the installed pyodbc version
+
+## Testing Approach
+- Functional test (v0.239.014): `functional_tests/test_sql_query_plugin_schema_awareness.py`
+- Functional test (v0.239.015): `functional_tests/test_sql_auto_schema_companion.py`
+- Validates `_auto_create_companion_schema_plugin` method exists with correct signature
+- Confirms companion creation is triggered in `load_plugin_from_manifest` for `sql_query` type
+- Verifies schema plugin name derivation logic (`_query` → `_schema` suffix swap)
+- Checks `@kernel_function` descriptions are resilient (no hard dependency on non-existent functions)
+- Validates `_extract_sql_schema_for_instructions` has SQLQueryPlugin fallback
+- Confirms version updated to 0.239.015
+- Functional test (v0.239.016): `functional_tests/test_sql_schema_sys_catalog_views.py`
+- Validates all SQL Server queries use `sys.tables`, `sys.columns`, `sys.indexes` instead of `INFORMATION_SCHEMA`
+- Confirms pyodbc.Row-safe iteration (no `isinstance(table, tuple)` checks)
+- Verifies primary key query uses `sys.index_columns` with `is_primary_key = 1`
+- Checks PostgreSQL/MySQL/SQLite queries remain unchanged
+- Confirms version updated to 0.239.016
+
+## Impact Analysis
+- **SQL-connected agents**: Will now automatically have BOTH a query plugin AND a companion schema plugin, even when only `sql_query` is configured. Schema is injected into agent instructions at load time.
+- **Non-SQL agents**: Completely unaffected (companion creation only triggers for `sql_query` type)
+- **LogAnalytics agents**: Unaffected (different plugin type)
+- **Performance**: One-time schema fetch at agent load time adds minimal latency; schema is cached in instructions for the session
+- **Backwards compatible**: If both `sql_query` and `sql_schema` actions are explicitly configured, the companion auto-creation is skipped (checks for existing plugin)
+
+## Before/After Comparison
+
+### Before (v0.239.014)
+- User: "What is user1 licensed to use?"
+- Agent: "I need the exact user identifier... Please provide the identifier..." (no database call, no citations)
+- Root cause: Descriptions demanded calling schema functions that didn't exist in the kernel
+
+### After v0.239.015
+- User: "What is user1 licensed to use?"
+- Agent: Correctly returns Office 365 license data with LicenseID 1, TotalQuantity 52 (simple single-table queries work)
+- User: "Which department is spending the most on licensing?"
+- Agent: Fails — says "I don't see a department dimension in the current schema" and calls `get_database_schema` which returns `{'tables': {}}` (empty)
+- Root cause: INFORMATION_SCHEMA views returned no results on Azure SQL
+
+### After v0.239.016
+- User: "What is user1 licensed to use?"
+- Agent: Correctly returns license data (still works)
+- User: "Which department is spending the most on licensing?"
+- Agent: Has full schema with all 5 tables and their columns → can construct multi-table JOINs (Licenses → Procurements for cost, Usage for department) → returns department-level spending analysis
diff --git a/docs/explanation/fixes/STREAMING_ONLY_CHAT_PATH_FIX.md b/docs/explanation/fixes/STREAMING_ONLY_CHAT_PATH_FIX.md
new file mode 100644
index 00000000..9c6dc831
--- /dev/null
+++ b/docs/explanation/fixes/STREAMING_ONLY_CHAT_PATH_FIX.md
@@ -0,0 +1,74 @@
+# Streaming-Only Chat Path Fix
+
+Fixed in version: **0.239.127**
+
+## Issue Description
+
+The chat experience still maintained two first-party execution paths:
+
+- A streaming SSE path for normal chat responses.
+- A legacy non-streaming JSON path used as a direct fallback by the main send flow, retry flow, and edit flow.
+
+That duplication created drift between the two implementations. Features such as image generation, retry/edit behavior, and final message handling existed in the legacy path, while the product direction is to make streaming the only chat path used by the application.
+
+## Root Cause Analysis
+
+The frontend still posted directly to `/api/chat` from multiple modules, and the chat toolbar still presented streaming as optional. At the same time, the streaming finalizer did not fully support all terminal payload shapes already used by the legacy route, especially image results and reload-driven completion behavior.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/static/js/chat/chat-messages.js`
+- `application/single_app/static/js/chat/chat-streaming.js`
+- `application/single_app/static/js/chat/chat-edit.js`
+- `application/single_app/static/js/chat/chat-retry.js`
+- `application/single_app/static/js/chat/chat-input-actions.js`
+- `application/single_app/templates/chats.html`
+- `application/single_app/templates/profile.html`
+- `application/single_app/functions_settings.py`
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_streaming_only_chat_path.py`
+
+### Code Changes Summary
+
+- Removed the first-party chat UI fallback that posted directly to `/api/chat`.
+- Moved retry and edit flows onto the shared streaming helper.
+- Removed the chat toolbar streaming toggle so streaming is no longer presented as optional.
+- Extended the streaming finalizer to support image-generation results and reload-driven completion handling.
+- Added a streaming compatibility bridge in the backend for parity-sensitive requests, including image generation and retry/edit, while keeping `/api/chat` available as a temporary compatibility shim.
+- Updated defaults and profile messaging to reflect streaming-only chat behavior.
+- Added image-generation thought events on the streaming compatibility bridge so users see progress before the final image arrives.
+- Bumped the application version to `0.239.127`.
+
+## Testing Approach
+
+A functional regression test was added at `functional_tests/test_streaming_only_chat_path.py` to verify:
+
+- Main chat, retry, and edit entry points do not call `/api/chat` directly.
+- The streaming helper still uses `/api/chat/stream`.
+- The backend streaming route contains the compatibility bridge.
+- Image-generation compatibility requests emit useful streaming thoughts before the final image payload.
+- The chat template no longer includes the streaming toggle button.
+- The default setting and app version were updated.
+
+## Impact Analysis
+
+This change makes streaming the only first-party chat path exposed by the UI while preserving legacy behaviors through the streaming endpoint for image generation and retry/edit flows. It reduces the risk of feature drift between chat implementations and provides a safer base for removing the legacy shim entirely in a later cleanup pass.
+
+## Validation
+
+### Before
+
+- Main send flow could fall back to `/api/chat`.
+- Retry and edit always posted to `/api/chat`.
+- Image generation was blocked on the streaming route.
+- The toolbar exposed streaming as an optional toggle.
+
+### After
+
+- First-party chat flows send through `/api/chat/stream`.
+- Retry and edit are routed through the same streaming helper.
+- Image-generation requests are supported through the streaming route via the backend compatibility bridge.
+- Streaming is treated as required chat behavior in the UI.
diff --git a/docs/explanation/fixes/STREAMING_THOUGHT_FINALIZATION_FIX.md b/docs/explanation/fixes/STREAMING_THOUGHT_FINALIZATION_FIX.md
new file mode 100644
index 00000000..75d73c20
--- /dev/null
+++ b/docs/explanation/fixes/STREAMING_THOUGHT_FINALIZATION_FIX.md
@@ -0,0 +1,43 @@
+# Streaming Thought Finalization Fix
+
+## Fix Title
+Streaming chat responses now finalize reliably even when SSE events arrive across chunk boundaries or trailing thought events arrive after answer text has started streaming.
+
+## Issue Description
+In streaming mode, some responses would briefly show the full assistant answer and then revert to the final pulsing thought badge. The UI could remain stuck on that thought placeholder until the page was refreshed, even though the backend had already saved the assistant message.
+
+## Root Cause Analysis
+- The streaming client parsed each `reader.read()` chunk independently and split it by newline, which is not safe for SSE because a single event can arrive across multiple network chunks.
+- When the final `done` event was split across chunks, the client could miss it and never finalize the temporary streaming message.
+- The streaming thought renderer also replaced the entire temporary message body. If a late thought event arrived after answer text had already started rendering, it could overwrite the visible answer with the pulsing thought badge.
+
+## Version Implemented
+Fixed in version: **0.239.116**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/static/js/chat/chat-streaming.js` | Added buffered SSE frame parsing, explicit incomplete-stream handling, and content-start tracking for the temporary streaming message |
+| `application/single_app/static/js/chat/chat-thoughts.js` | Prevented streaming thoughts from replacing the temporary message once answer content has begun streaming |
+| `functional_tests/test_streaming_thought_finalization.py` | Added focused regression coverage for buffered SSE parsing and late-thought overwrite guards |
+| `application/single_app/config.py` | Version bump to 0.239.116 |
+
+## Code Changes Summary
+- Added a stateful SSE buffer so JSON payloads are parsed only after a full SSE event block is available.
+- Flushed the decoder and processed any trailing event data when the stream closes.
+- Added a fallback error path when a stream ends without completion metadata so the UI does not hang indefinitely on the temporary placeholder.
+- Marked the temporary streaming message once real answer content starts rendering.
+- Ignored subsequent streaming-thought placeholder renders after that point so answer text stays visible until finalization replaces the temporary message with the permanent assistant message.
+
+## Testing Approach
+- Added `functional_tests/test_streaming_thought_finalization.py`.
+- Re-ran the existing thoughts feature structural coverage to confirm the edited modules still expose the expected thought integration points.
+
+## Impact Analysis
+- Streaming responses should now remain stable on screen after answer text begins rendering.
+- Split SSE frames should no longer prevent the final `done` payload from being processed.
+- If the server ever closes a stream without completion metadata, the user now sees a partial-response warning instead of a permanent pulsing placeholder.
+
+## Validation
+- Before: final streamed answers could be replaced by the last thought badge and remain stuck until refresh.
+- After: answer text remains visible once content starts, and the temp streaming message either finalizes correctly or degrades into an explicit interrupted-stream state.
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_COMPUTED_RESULTS_PROMPT_PRIORITY_FIX.md b/docs/explanation/fixes/TABULAR_COMPUTED_RESULTS_PROMPT_PRIORITY_FIX.md
new file mode 100644
index 00000000..11065e38
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_COMPUTED_RESULTS_PROMPT_PRIORITY_FIX.md
@@ -0,0 +1,42 @@
+# Tabular Computed Results Prompt Priority Fix
+
+## Fix Title
+Successful tabular tool analysis now has prompt priority over excerpt-only retrieval instructions in the final GPT response.
+
+## Issue Description
+The tabular SK pass could recover, find the correct worksheet, and compute the needed row-level values, but the outer GPT response could still answer as if those results were unavailable. In practice, the final response sometimes fell back to the search-excerpt framing and said it did not have direct access to the requested record even after the tool pass succeeded.
+
+## Root Cause Analysis
+- The retrieval augmentation prompt told the outer GPT response to answer only from retrieved excerpts and to say so when the answer was not present in those excerpts.
+- The tabular-computed-results handoff was added as a separate system message later in the prompt assembly.
+- That created a prompt conflict: search excerpts often contained only workbook schema context, while the successful tabular pass contained the actual computed row-level values.
+- The final model could anchor on the excerpt-only instruction and ignore the later tool-backed analysis, producing a cautious but incorrect fallback-style answer.
+
+## Version Implemented
+Fixed in version: **0.239.118**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/route_backend_chats.py` | Added shared prompt helpers so retrieval augmentation explicitly allows later tool-backed results and successful tabular analysis is marked authoritative |
+| `functional_tests/test_tabular_computed_results_prompt_priority.py` | Added regression coverage for the search-prompt contract and authoritative tabular handoff |
+| `application/single_app/config.py` | Version bump to 0.239.118 |
+
+## Code Changes Summary
+- Replaced the repeated search augmentation prompt text with a shared helper.
+- Updated the retrieval prompt so it permits and respects computed tool-backed results that appear in later system messages.
+- Replaced repeated successful-tabular-analysis handoff text with a shared helper that explicitly marks those results as authoritative for calculations and row-level facts.
+- Added a regression test to block reintroduction of the older excerpt-only wording.
+
+## Testing Approach
+- Added `functional_tests/test_tabular_computed_results_prompt_priority.py`.
+- Planned focused validation against the prompt helpers plus existing tabular orchestration coverage.
+
+## Impact Analysis
+- Successful tabular recovery should now survive the final answer synthesis step instead of being overwritten by schema-only search guidance.
+- The final GPT response should stop claiming it lacks direct access when tool-backed values are already present in the prompt.
+- This fix is general for workspace and chat-upload tabular analysis paths because it updates the shared prompt handoff contract rather than a workbook-specific rule.
+
+## Validation
+- Before: a recovered tabular pass could still lead to an excerpt-only final answer.
+- After: the final answer prompt treats successful tabular results as authoritative and no longer frames the answer as limited to excerpts alone.
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_CROSS_SHEET_BRIDGE_ANALYSIS_FIX.md b/docs/explanation/fixes/TABULAR_CROSS_SHEET_BRIDGE_ANALYSIS_FIX.md
new file mode 100644
index 00000000..a5177f2a
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_CROSS_SHEET_BRIDGE_ANALYSIS_FIX.md
@@ -0,0 +1,61 @@
+# Tabular Cross-Sheet Bridge Analysis Fix
+
+Fixed in version: **0.239.140**
+
+## Issue Description
+
+Grouped workbook questions could fail when the answer required combining a small reference worksheet with a larger fact worksheet.
+
+Example pattern:
+- one worksheet lists canonical entities such as solution engineers
+- another worksheet contains the fact rows such as milestones
+- the user asks for grouped results per entity
+
+The prior orchestration sometimes stayed on a single worksheet, grouped a boolean or membership-style column, or fell back to schema-only language after an incomplete analytical pass.
+
+## Root Cause
+
+Analysis mode had strong single-sheet guidance but no generalized prompt for a reference-sheet plus fact-sheet bridge.
+
+Two specific gaps caused the failure:
+- multi-sheet analysis still established a default worksheet even when the workbook structure suggested the answer needed more than one sheet
+- the prompt did not tell the model to prefer canonical entity names from a small reference sheet over boolean or membership-flag columns in a larger fact sheet
+
+## Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_cross_sheet_bridge_analysis.py`
+
+## Code Changes Summary
+
+- added generalized detection for grouped cross-sheet analytical questions
+- added a lightweight bridge-plan helper that infers a smaller reference worksheet and a larger fact worksheet from workbook metadata
+- prevented analysis mode from setting a default sheet when that bridge plan is active
+- added prompt guidance to query both sheets explicitly and avoid answering “each X” by grouping yes/no or membership-flag columns
+- added regression coverage for intent detection, bridge-plan inference, and prompt guardrails
+
+## Testing Approach
+
+The new functional regression test validates:
+- grouped cross-sheet questions remain in analysis mode rather than entity-lookup mode
+- workbook metadata produces the expected reference-sheet and fact-sheet bridge plan
+- the analysis prompt includes the new bridge-plan and flag-column guardrails
+
+## Impact Analysis
+
+This change is intentionally narrow:
+- schema-summary routing is unchanged
+- entity-lookup routing is unchanged
+- normal single-sheet analysis still keeps the existing default-sheet behavior
+
+The new behavior only activates when the question looks like a grouped analytical request and the workbook structure strongly suggests a small reference sheet plus a larger fact sheet.
+
+## Validation
+
+Expected improvement:
+- grouped cross-sheet workbook questions can iterate across the relevant tabs without overfitting to a workbook-specific scenario
+- the model is less likely to mistake flag columns for the requested entity dimension
+
+Related functional test:
+- `functional_tests/test_tabular_cross_sheet_bridge_analysis.py`
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_ENTITY_LOOKUP_CROSS_SHEET_RETRY_FIX.md b/docs/explanation/fixes/TABULAR_ENTITY_LOOKUP_CROSS_SHEET_RETRY_FIX.md
new file mode 100644
index 00000000..23cc1b70
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_ENTITY_LOOKUP_CROSS_SHEET_RETRY_FIX.md
@@ -0,0 +1,47 @@
+# Tabular Entity Lookup Cross-Sheet Retry Fix
+
+## Fix Title
+Cross-sheet workbook entity lookups now retry when the analytical pass only succeeds on one worksheet and stops before collecting the related records requested by the user.
+
+## Issue Description
+Questions such as finding one taxpayer and showing their profile, return summary, W-2, 1099, payment, refund, notice, audit, and installment agreement records could appear to succeed while still returning an incomplete answer. The analytical tabular pass repeatedly queried `Taxpayers`, found the primary row, and then stopped without traversing the related worksheets.
+
+## Root Cause Analysis
+- The route layer treated any successful analytical invocation as sufficient to finish the inner tabular analysis pass.
+- For cross-sheet entity questions, that success condition was too weak because the first worksheet could succeed while leaving most requested record types untouched.
+- Multi-sheet default-sheet behavior also made the initial worksheet selection too sticky for entity/profile prompts that should span several tabs.
+- Worksheet matching did not preserve `W2` cleanly for sheet names such as `W2Forms`, which weakened related-sheet hinting for tax-document lookups.
+
+## Version Implemented
+Fixed in version: **0.239.119**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/route_backend_chats.py` | Added `entity_lookup` routing, related-worksheet hinting, execution-gap retries for incomplete one-sheet success, and stronger worksheet tokenization for `W2` sheet names |
+| `functional_tests/test_tabular_entity_lookup_mode.py` | Added regression coverage for entity-lookup routing, related-sheet ranking, and incomplete-success retry guardrails |
+| `functional_tests/test_tabular_workbook_schema_summary_mode.py` | Updated helper extraction to include the new execution-mode dependency and refreshed the file version header |
+| `application/single_app/config.py` | Version bump to 0.239.119 |
+
+## Code Changes Summary
+- Added a dedicated `entity_lookup` execution mode for cross-sheet profile and related-record questions.
+- Prevented multi-sheet entity lookups from relying on a sticky default worksheet during the analytical pass.
+- Added execution-gap retry feedback so the inner SK loop retries when successful tool calls only touched one worksheet or when the narrative still claims the data is unavailable.
+- Improved worksheet tokenization so `W2Forms` contributes a usable `w2` token during related-sheet scoring.
+
+## Testing Approach
+- Added `functional_tests/test_tabular_entity_lookup_mode.py`.
+- Re-ran focused tabular functional tests covering workbook schema-summary routing, retry-sheet recovery, and the new cross-sheet entity-lookup path.
+
+## Impact Analysis
+- Cross-sheet taxpayer and case-history questions should now keep traversing related worksheets instead of stopping after the first successful row.
+- Existing workbook summary and wrong-sheet recovery behavior remain intact because the new retry logic is scoped to `entity_lookup` mode.
+- Related-sheet hinting is stronger for IRS-style workbook tabs that encode tax forms directly in sheet names.
+
+## Validation
+- Before: a taxpayer lookup could query `Taxpayers` successfully several times, never inspect the other tabs, and still finish with a generic answer.
+- After: incomplete one-sheet success is treated as an execution gap, the analytical pass is retried with explicit cross-sheet guidance, and related tax-form worksheets such as `W2Forms` remain visible to the ranking logic.
+
+## Related Config Update
+- `application/single_app/config.py` now sets `VERSION = "0.239.119"`.
+- Related functional tests: `functional_tests/test_tabular_entity_lookup_mode.py` and `functional_tests/test_tabular_workbook_schema_summary_mode.py`.
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_POPUP_DOWNLOAD_FIX.md b/docs/explanation/fixes/TABULAR_POPUP_DOWNLOAD_FIX.md
new file mode 100644
index 00000000..5dad38bd
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_POPUP_DOWNLOAD_FIX.md
@@ -0,0 +1,49 @@
+# Tabular Popup Download Fix
+
+Fixed/Implemented in version: **0.239.124**
+
+## Issue Description
+
+Downloading a workbook from the chat tabular preview popup could fail with a generic browser download error, while the app showed no JavaScript error and the backend logged no application error.
+
+## Root Cause Analysis
+
+The popup used a plain anchor-based download control for a session-protected endpoint.
+
+That meant the browser handled the request outside the app's normal error flow, so failures surfaced only as a generic download error and bypassed the UI's toast/error handling.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/static/js/chat/chat-enhanced-citations.js`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_popup_download_fix.py`
+
+### Code Changes Summary
+
+- Replaced the tabular popup download anchor with a controlled button-driven download flow.
+- Added an authenticated `fetch()` request for the tabular download endpoint using same-origin credentials.
+- Added blob-based client download handling and explicit toast/error reporting when the request fails.
+- Updated `config.py` to version `0.239.124` for this fix.
+
+### Testing Approach
+
+- Added a functional regression test that inspects the chat enhanced citations JavaScript for the fetch-to-blob download flow.
+- Added coverage to verify the popup no longer uses the old blank-target anchor download path.
+
+## Validation
+
+### Before
+
+- The tabular popup download used a browser-managed anchor request.
+- When the download failed, the user saw a generic browser download error without an app-level error message.
+
+### After
+
+- The tabular popup download is handled explicitly in JavaScript with `fetch()` and blob download logic.
+- Failures now route through the app's error handling and can surface a toast message instead of failing silently.
+
+### User Experience Improvement
+
+Users can download tabular files from the chat preview popup through a controlled download path that is more reliable and easier to troubleshoot when something goes wrong.
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_RETRY_SHEET_RECOVERY_FIX.md b/docs/explanation/fixes/TABULAR_RETRY_SHEET_RECOVERY_FIX.md
new file mode 100644
index 00000000..3a2020d4
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_RETRY_SHEET_RECOVERY_FIX.md
@@ -0,0 +1,43 @@
+# Tabular Retry Sheet Recovery Fix
+
+## Fix Title
+Multi-sheet tabular analysis now recovers from a wrong initial worksheet guess by promoting candidate recovery sheets from failed analytical tool calls.
+
+## Issue Description
+Identifier-based workbook questions could fail even when the needed row existed in the workbook and document search had already surfaced the file. The analytical tabular pass sometimes started on a plausible but wrong worksheet, then kept retrying analytical tools against that same sheet until it exhausted retries and fell back to schema-only context.
+
+## Root Cause Analysis
+- The route layer used a lightweight likely-sheet heuristic to establish a default worksheet for multi-sheet analytical calls.
+- That heuristic did not tokenize camel-case sheet names such as `TaxReturns` very well, which weakened the initial sheet guess for many workbook naming conventions.
+- When a tool call failed because the requested column was missing on the chosen sheet, the tool only returned a generic missing-column error. The retry loop had no structured signal telling it which other worksheet was a better candidate.
+- As a result, retries could keep hitting the same wrong worksheet even though the workbook schema already contained enough information to steer recovery.
+
+## Version Implemented
+Fixed in version: **0.239.117**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py` | Added workbook-aware missing-column payloads with `selected_sheet`, `missing_column`, and ordered `candidate_sheets` recovery hints |
+| `application/single_app/route_backend_chats.py` | Added retry-sheet override helpers, camel-case sheet tokenization, and retry-time default-sheet promotion based on failed tool payloads |
+| `functional_tests/test_tabular_retry_sheet_recovery.py` | Added regression coverage for camel-case sheet tokenization, candidate-sheet error payloads, and retry-sheet override selection |
+| `application/single_app/config.py` | Version bump to 0.239.117 |
+
+## Code Changes Summary
+- Improved worksheet tokenization so camel-case sheet names participate in likely-sheet matching more accurately.
+- Extended analytical tool errors so missing-column failures identify the current sheet and suggest candidate recovery sheets from the same workbook.
+- Added retry orchestration that reads those candidate sheets and updates the plugin's default worksheet before the next analytical attempt.
+- Updated the analytical system prompt so recovery-sheet hints override the original likely-sheet guess after a wrong-sheet failure.
+
+## Testing Approach
+- Added `functional_tests/test_tabular_retry_sheet_recovery.py`.
+- Re-ran focused multi-sheet and tool-error tabular functional tests to confirm retry recovery stays compatible with the existing analytical-only orchestration.
+
+## Impact Analysis
+- Identifier-based workbook questions should now recover when the first worksheet guess is wrong instead of repeating the same failing call.
+- This remains tool-driven behavior inside the analytical SK pass; it does not rely on schema-only fallback to answer workbook calculation questions.
+- The recovery behavior is generic across multi-sheet workbooks because it is based on missing-column signals and workbook sheet/column structure rather than workbook-specific rules.
+
+## Validation
+- Before: a wrong initial worksheet guess could lead to repeated analytical retries on the same sheet until the route fell back to schema context.
+- After: missing-column failures expose better candidate sheets and the next analytical retry can be redirected to the stronger worksheet automatically.
\ No newline at end of file
diff --git a/docs/explanation/fixes/TABULAR_WORKBOOK_SCHEMA_SUMMARY_ROUTING_FIX.md b/docs/explanation/fixes/TABULAR_WORKBOOK_SCHEMA_SUMMARY_ROUTING_FIX.md
new file mode 100644
index 00000000..1a6e40e8
--- /dev/null
+++ b/docs/explanation/fixes/TABULAR_WORKBOOK_SCHEMA_SUMMARY_ROUTING_FIX.md
@@ -0,0 +1,47 @@
+# Tabular Workbook Schema Summary Routing Fix
+
+## Fix Title
+Workbook-structure questions now use a schema-summary tabular mode instead of being forced through analytical-only tool retries.
+
+## Issue Description
+Selected or cited Excel workbooks were always routed into the analytical mini Semantic Kernel pass. That worked well for value lookups, aggregations, and grouped analysis, but workbook-summary prompts such as asking what worksheets exist, what each worksheet represents, and how they relate were not true analytical questions.
+
+## Root Cause Analysis
+- The tabular mini-agent was intentionally hardened to allow only analytical functions during its retry path.
+- Workbook-summary prompts still triggered that same analytical path, even though the correct tool for those questions is `describe_tabular_file()`.
+- As a result, the model sometimes chose analytical functions like `aggregate_column()` just to satisfy the forced tool-use requirement, which then failed on multi-sheet workbooks because no `sheet_name` was supplied.
+- When the mini-agent failed, the outer fallback prompt still told the final GPT pass to use plugin functions even though that stage could not actually invoke them.
+
+## Version Implemented
+Fixed in version: **0.239.115**
+
+## Files Modified
+| File | Change |
+|------|--------|
+| `application/single_app/route_backend_chats.py` | Added workbook-schema intent detection, schema-summary execution mode, and safer fallback prompt handling |
+| `functional_tests/test_tabular_workbook_schema_summary_mode.py` | Added regression coverage for workbook-summary intent routing, fallback prompts, and citation preservation |
+| `application/single_app/config.py` | Version bump to 0.239.115 |
+
+## Code Changes Summary
+- Added a narrow workbook-structure intent heuristic so prompts about worksheets, tabs, workbook summaries, and cross-sheet relationships route into a schema-summary tabular mode.
+- Extended the mini tabular SK executor with a `schema_summary` mode that allows `describe_tabular_file()` and treats it as a successful tool-backed result.
+- Kept the existing analytical-only path unchanged for value lookup, aggregation, filtering, and grouped-analysis questions.
+- Updated the workspace fallback prompt so the final GPT pass no longer gets impossible instructions to call plugin tools after the mini SK pass has already failed.
+- Preserved `describe_tabular_file()` citations when they are the only successful tabular tool calls.
+
+## Testing Approach
+- Added `functional_tests/test_tabular_workbook_schema_summary_mode.py`.
+- Re-ran the focused tabular regression suite to confirm the analytical path stayed intact:
+  - `functional_tests/test_tabular_analysis_rejects_discovery_only.py`
+  - `functional_tests/test_tabular_tool_error_retry_and_thoughts.py`
+  - `functional_tests/test_tabular_multisheet_workbook_support.py`
+  - `functional_tests/test_workspace_tabular_trigger_and_thoughts.py`
+
+## Impact Analysis
+- Workbook-summary questions should now reach the correct tabular tool path with fewer retries and lower latency.
+- Analytical questions keep the stricter analytical-only guardrails that were added to prevent discovery-only answers.
+- When the mini SK pass still fails, the outer fallback is now more honest about using schema-only context rather than implying that more tool calls will happen.
+
+## Validation
+- Before: workbook-summary questions could trigger repeated `aggregate_column()` failures on multi-sheet workbooks and then fall back through a contradictory prompt.
+- After: workbook-summary questions route to `describe_tabular_file()`-based schema summarization, while analytical questions remain on the analytical-only path.
\ No newline at end of file
diff --git a/docs/explanation/fixes/v0.239.008/CHAT_TABULAR_SK_TRIGGER_FIX.md b/docs/explanation/fixes/v0.239.008/CHAT_TABULAR_SK_TRIGGER_FIX.md
new file mode 100644
index 00000000..d525e2d1
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.008/CHAT_TABULAR_SK_TRIGGER_FIX.md
@@ -0,0 +1,67 @@
+# Chat-Uploaded Tabular File SK Mini-Agent Trigger Fix
+
+## Issue Description
+
+When a user uploads a tabular file (CSV, XLSX, XLS, XLSM) directly to a chat conversation and asks a question in model-only mode (no agent selected), the SK mini-agent (`run_tabular_sk_analysis`) did not trigger. The model would see instructions to "use plugin functions" but could not call them without an agent, resulting in the model describing what it would do instead of providing actual analysis results.
+
+The full agent mode worked correctly because the agent has direct access to the `TabularProcessingPlugin` and can call its functions.
+
+## Root Cause
+
+Three gaps prevented the mini SK agent from activating for chat-uploaded tabular files:
+
+1. **Streaming path ignored `file` role messages**: The streaming conversation history loop (`/api/chat/stream`) only processed `user` and `assistant` roles, making chat-uploaded files completely invisible to the model.
+
+2. **Mini SK only triggered from search results**: Both streaming and non-streaming paths only invoked `run_tabular_sk_analysis()` when tabular files appeared in hybrid search results (`combined_documents`). Chat-uploaded files are stored in blob storage as `file` role messages and are not indexed in Azure AI Search, so they never appeared in search results.
+
+3. **Model-only mode can't call plugin functions**: The non-streaming path's file handler injected "Use the tabular_processing plugin functions" as a system message, but in model-only mode the model has no function-calling capability.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py` — All code changes
+- `application/single_app/config.py` — Version bump to 0.239.008
+
+### Code Changes
+
+#### Non-streaming path (`/api/chat`)
+
+1. Added `chat_tabular_files = set()` tracker before the conversation history loop (~line 1896)
+2. Added `chat_tabular_files.add(filename)` inside the `if is_table and file_content_source == 'blob':` block (~line 1936)
+3. After the history loop, added a block that checks `chat_tabular_files` and calls `run_tabular_sk_analysis(source_hint="chat")`, injecting pre-computed results as a system message (~line 2027)
+
+#### Streaming path (`/api/chat/stream`)
+
+4. Replaced the simple 8-line history loop (which only handled `user`/`assistant`) with expanded logic that mirrors the non-streaming path's `file` role handling, including blob tabular file tracking (~line 3687)
+5. Added the same mini SK trigger block after the expanded loop (~line 3751)
+
+### How It Works After Fix
+
+1. User uploads `sales.xlsx` to chat, asks "analyze sales/profit"
+2. During conversation history building, the `file` role message with `is_table=True` and `file_content_source='blob'` is detected
+3. The filename is collected into `chat_tabular_files`
+4. After the history loop, `run_tabular_sk_analysis()` is called with `source_hint="chat"`, which resolves the file from the `personal-chat` blob container
+5. The mini SK agent pre-loads the file schema, calls plugin functions (aggregate, filter, etc.), and returns computed results
+6. Results are injected as a system message so the model can present accurate numbers
+7. Plugin invocation citations are collected for transparency
+
+## Testing
+
+1. Upload a tabular file (xlsx/csv) directly to chat
+2. With no agent selected, send a data analysis question
+3. Verify the response contains actual computed data (not just a description of steps)
+4. Check logs for `[Chat Tabular SK]` entries confirming the mini SK trigger
+5. Verify agent mode still works as before
+
+## Impact
+
+- Enables tabular data analysis in model-only chat mode for chat-uploaded files
+- No changes to existing search-result-based tabular detection
+- No changes to full agent mode behavior
+- Streaming and non-streaming paths both fixed
+
+## Version
+
+- **Version**: 0.239.008
+- **Implemented in**: 0.239.008
diff --git a/docs/explanation/fixes/v0.239.032/TABULAR_WORKSPACE_TRIGGER_AND_THOUGHTS_FIX.md b/docs/explanation/fixes/v0.239.032/TABULAR_WORKSPACE_TRIGGER_AND_THOUGHTS_FIX.md
new file mode 100644
index 00000000..0ab4e9c2
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.032/TABULAR_WORKSPACE_TRIGGER_AND_THOUGHTS_FIX.md
@@ -0,0 +1,65 @@
+# Tabular Workspace Trigger and Thoughts Fix
+
+## Issue Description
+Users could ask multiple questions against the same selected tabular workspace file and see inconsistent behavior. A simple aggregation question could trigger the tabular SK mini-agent, while a later question against the same selected file could fall back to schema-only reasoning. In addition, the Processing Thoughts UI did not show any explicit `tabular_analysis` step even when tabular functions were used.
+
+**Version implemented:** 0.239.032
+
+Fixed/Implemented in version: **0.239.032**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.032`.
+
+## Root Cause Analysis
+1. **Workspace trigger depended too heavily on search results**
+   - The tabular trigger only inspected `combined_documents` returned from hybrid search.
+   - If the selected tabular file produced sparse retrieval output or schema-only chunks, the trigger could miss the explicit workspace selection.
+2. **Mini-agent responses were not hardened against no-tool replies**
+   - For more complex analytical prompts, the mini-agent could return narrative text without actually calling the `TabularProcessingPlugin`.
+   - That produced no tool citations and left the final model with schema-only context.
+3. **Processing thoughts missed tabular work entirely**
+   - The chat flow recorded search, web, and generation steps, but never wrote a `tabular_analysis` thought for workspace or chat tabular runs.
+
+## Technical Details
+### Files Modified
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_workspace_tabular_trigger_and_thoughts.py`
+
+### Code Changes Summary
+- Added shared helpers to:
+  - detect supported tabular filenames consistently,
+  - resolve explicitly selected workspace tabular documents, and
+  - merge search-result files with selected-document files before triggering analysis.
+- Moved workspace tabular trigger logic so it can run from explicit workspace selection, not just retrieved chunks.
+- Hardened `run_tabular_sk_analysis()` with a retry path that requires actual tabular tool usage before accepting the result.
+- Added `tabular_analysis` thoughts for:
+  - workspace tabular analysis start/completion in non-streaming mode,
+  - workspace tabular analysis start/completion in streaming mode,
+  - chat-uploaded tabular analysis start/completion in non-streaming mode,
+  - chat-uploaded tabular analysis start/completion in streaming mode.
+
+### Testing Approach
+- Added `functional_tests/test_workspace_tabular_trigger_and_thoughts.py` to verify:
+  - explicit workspace-selected tabular files participate in trigger detection,
+  - tabular analysis thoughts are emitted in both chat paths,
+  - the mini-agent prompt now requires tool execution and retries when it answers without tools.
+
+## Impact Analysis
+- Explicitly selected CSV/Excel workspace files now have a more reliable analysis trigger path.
+- Complex tabular prompts are less likely to degrade into schema-only answers.
+- Users can now see tabular analysis activity directly in Processing Thoughts, improving transparency and debugging.
+
+## Validation
+### Before
+- Some workspace-selected tabular questions skipped the SK mini-agent even though the same file was still selected.
+- Thoughts could show search and generation steps without any indication that tabular analysis ran.
+
+### After
+- Workspace tabular analysis considers both retrieved tabular documents and explicitly selected tabular files.
+- Mini-agent retries are stricter when the first response skips tool execution.
+- Processing Thoughts now includes clear `tabular_analysis` steps whenever tabular analysis is attempted.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_workspace_tabular_trigger_and_thoughts.py`
+- Related feature documentation: `docs/explanation/features/v0.239.003/PROCESSING_THOUGHTS.md`
+- Related earlier fix: `docs/explanation/fixes/v0.239.008/CHAT_TABULAR_SK_TRIGGER_FIX.md`
diff --git a/docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md b/docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md
new file mode 100644
index 00000000..fafcb2f5
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md
@@ -0,0 +1,64 @@
+# Tabular Datetime Component Analysis Fix
+
+## Issue Description
+Some tabular questions still fell back to schema-only context with the thought message `Tabular analysis could not compute results; using schema context instead`. This happened most often for time-based questions such as identifying peak hours, busiest weekdays, or monthly patterns from datetime columns.
+
+**Version implemented:** 0.239.033
+
+Fixed/Implemented in version: **0.239.033**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.033`.
+
+## Root Cause Analysis
+1. **The plugin lacked datetime component grouping support**
+   - Existing tabular functions could aggregate and group by existing columns, but they could not directly derive `hour`, `day_of_week`, `month`, or similar components from datetime-like fields.
+   - Questions like “During what hours of the day do departure queues peak?” therefore required a transformation step the plugin did not expose.
+2. **The SK mini-agent could still fail even when the file triggered correctly**
+   - If the model could not find a tool sequence that matched the requested transformation, the tabular analysis flow returned `None` and the chat fell back to schema-only context.
+3. **There was no deterministic recovery path for common time-based questions**
+   - Even when datetime columns and queue/delay metrics were clearly present, the system did not attempt a direct computed fallback.
+
+## Technical Details
+### Files Modified
+- `application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py`
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_datetime_component_analysis.py`
+
+### Code Changes Summary
+- Added datetime parsing helpers to `TabularProcessingPlugin` to support:
+  - ISO datetime strings,
+  - time-only strings,
+  - `HHMM` and `HHMMSS` compact time formats.
+- Added a new tabular plugin function: `group_by_datetime_component`
+  - Supports grouping by `year`, `month`, `month_name`, `day`, `date`, `hour`, `minute`, `day_name`, `weekday_number`, `quarter`, and `week`.
+  - Supports `count`, `sum`, `mean`, `min`, `max`, `median`, and `std` aggregations.
+  - Supports optional pre-group filtering with a pandas query expression.
+- Updated the tabular SK prompt and fallback guidance so time-based questions explicitly use `group_by_datetime_component`.
+- Added a direct datetime-aware fallback in `run_tabular_sk_analysis()` so common time-based questions can still return computed results even if the SK mini-agent does not successfully plan the tool sequence.
+
+### Testing Approach
+- Added `functional_tests/test_tabular_datetime_component_analysis.py` to verify:
+  - hour grouping works for ISO datetime strings,
+  - compact `HHMM` values are parsed correctly,
+  - route and plugin integration text references the new datetime grouping capability.
+
+## Impact Analysis
+- Time-based tabular questions now have a dedicated computation path instead of relying on schema-only reasoning.
+- Questions about peak hours, busiest weekdays, and similar datetime-derived trends are much less likely to fall back to the schema preview.
+- The direct fallback keeps user experience resilient even when the mini-agent does not autonomously choose the new function on the first try.
+
+## Validation
+### Before
+- Tabular analysis could trigger correctly but still fail to compute answers for questions requiring datetime-derived grouping.
+- Users saw the thought step `Tabular analysis could not compute results; using schema context instead` for time-based questions.
+
+### After
+- The tabular plugin can directly compute datetime component groupings.
+- The chat route can recover with a deterministic datetime-based fallback for common time-oriented questions.
+- Time-based questions now have a much stronger chance of returning computed results instead of schema-only context.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_tabular_datetime_component_analysis.py`
+- Related fix: `docs/explanation/fixes/v0.239.032/TABULAR_WORKSPACE_TRIGGER_AND_THOUGHTS_FIX.md`
+- Related thoughts documentation: `docs/explanation/features/v0.239.003/PROCESSING_THOUGHTS.md`
diff --git a/docs/explanation/fixes/v0.239.034/TABULAR_COMPUTED_ANALYSIS_ENFORCEMENT_FIX.md b/docs/explanation/fixes/v0.239.034/TABULAR_COMPUTED_ANALYSIS_ENFORCEMENT_FIX.md
new file mode 100644
index 00000000..58dc263a
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.034/TABULAR_COMPUTED_ANALYSIS_ENFORCEMENT_FIX.md
@@ -0,0 +1,60 @@
+# Tabular Computed Analysis Enforcement Fix
+
+## Issue Description
+Some analytical tabular questions still completed after a schema-only discovery call such as `describe_tabular_file`. That let the model answer from preview rows instead of using computed query, filter, aggregate, or grouped results from the full dataset.
+
+**Version implemented:** 0.239.034
+
+Fixed/Implemented in version: **0.239.034**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.034`.
+
+## Root Cause Analysis
+1. **Any plugin call counted as successful analysis**
+   - `run_tabular_sk_analysis()` accepted the first response as long as any plugin invocation occurred.
+   - Discovery calls such as `describe_tabular_file` therefore counted the same as real analytical operations.
+2. **Schema discovery citations overshadowed computed analysis intent**
+   - When discovery calls happened before analytical calls, citations could emphasize schema inspection rather than the computed operations that actually answered the question.
+3. **Prompt guidance did not explicitly reject discovery-only behavior**
+   - Even with pre-loaded schemas, the mini-agent could still call discovery helpers and stop there.
+
+## Technical Details
+### Files Modified
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_analysis_rejects_discovery_only.py`
+
+### Code Changes Summary
+- Added tabular invocation classification helpers in `route_backend_chats.py` to separate:
+  - discovery functions: `list_tabular_files`, `describe_tabular_file`
+  - analytical functions: `aggregate_column`, `filter_rows`, `query_tabular_data`, `group_by_aggregate`, `group_by_datetime_component`
+- Updated `run_tabular_sk_analysis()` so a response is accepted only when at least one analytical tabular function ran.
+- Added retry logging for discovery-only attempts so those paths are visible in diagnostics.
+- Updated tabular prompt guidance to explicitly reject discovery-only tool usage.
+- Filtered tabular citations so discovery-only calls are hidden when analytical tabular calls are present in the same analysis run.
+
+### Testing Approach
+- Added `functional_tests/test_tabular_analysis_rejects_discovery_only.py` to verify:
+  - discovery-only calls are explicitly rejected by prompt and retry guardrails,
+  - citation filtering prefers analytical calls,
+  - retry evaluation can isolate new invocations from the latest attempt.
+
+## Impact Analysis
+- Analytical tabular questions are less likely to be answered from schema previews or sample rows.
+- The mini-agent now has to perform a real computation before its output is trusted.
+- Citations better reflect the actual analytical operations used to answer the question.
+
+## Validation
+### Before
+- A single `describe_tabular_file` call could mark tabular analysis as complete.
+- Users could receive answers based on preview rows with thoughts showing tabular analysis as successful.
+
+### After
+- Discovery-only tool usage triggers a retry instead of being accepted as completed analysis.
+- Successful tabular analysis now requires a computed analytical call.
+- When analytical calls exist, tabular citations focus on those calls instead of schema-only discovery helpers.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_tabular_analysis_rejects_discovery_only.py`
+- Related fix: `docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md`
+- Related fix: `docs/explanation/fixes/v0.239.032/TABULAR_WORKSPACE_TRIGGER_AND_THOUGHTS_FIX.md`
diff --git a/docs/explanation/fixes/v0.239.035/TABULAR_TOOL_CALL_THOUGHTS_FIX.md b/docs/explanation/fixes/v0.239.035/TABULAR_TOOL_CALL_THOUGHTS_FIX.md
new file mode 100644
index 00000000..15481301
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.035/TABULAR_TOOL_CALL_THOUGHTS_FIX.md
@@ -0,0 +1,61 @@
+# Tabular Tool Call Thoughts Fix
+
+## Issue Description
+Tabular analysis thoughts were summarized as generic wrapper messages such as `Running tabular analysis on 1 workspace file(s)` and `Tabular analysis completed using 1 tool call(s)`. That hid which specific tabular tools actually ran, making it harder to understand whether the system queried, filtered, grouped, or only inspected the file.
+
+**Version implemented:** 0.239.035
+
+Fixed/Implemented in version: **0.239.035**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.035`.
+
+## Root Cause Analysis
+1. **Thoughts were recorded at the workflow level instead of the tool level**
+   - The workspace and chat tabular paths emitted only start/completion wrapper thoughts.
+   - Individual plugin invocations were collected for citations but not surfaced as separate tabular thoughts.
+2. **Users could not see what analysis actually happened**
+   - A completion message with a tool count did not reveal whether the mini-agent used `query_tabular_data`, `group_by_datetime_component`, `aggregate_column`, or other functions.
+3. **The agent tool-call pattern already existed elsewhere**
+   - Agent execution paths already emitted one thought per plugin invocation, but the tabular pre-analysis flow had not adopted the same level of detail.
+
+## Technical Details
+### Files Modified
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_workspace_tabular_trigger_and_thoughts.py`
+
+### Code Changes Summary
+- Added helpers in `route_backend_chats.py` to:
+  - format concise tabular tool thought content,
+  - sanitize thought detail fields,
+  - convert tabular plugin invocations into per-tool thought payloads.
+- Replaced generic workspace and chat tabular wrapper thoughts with one `tabular_analysis` thought per tabular plugin invocation.
+- Preserved failure thoughts when tabular analysis cannot compute results.
+- Kept enhanced citations behavior unchanged while making the thoughts feed more transparent.
+
+### Testing Approach
+- Updated `functional_tests/test_workspace_tabular_trigger_and_thoughts.py` to verify:
+  - per-tool tabular thought helpers exist,
+  - workspace and streaming paths emit tool-level thought payload loops,
+  - generic completion wrapper thoughts are no longer used,
+  - formatted thought payloads contain useful parameters while excluding user and conversation identifiers.
+
+## Impact Analysis
+- Processing Thoughts now shows which tabular tool functions actually ran.
+- Users can distinguish schema inspection, filtering, grouping, and datetime analysis directly from the thoughts timeline.
+- Debugging tabular behavior is easier because the thought feed reflects the real analysis steps instead of only wrapper status messages.
+
+## Validation
+### Before
+- Thoughts showed only generic tabular wrapper messages.
+- Users could not tell which tabular function actually answered the question.
+
+### After
+- Thoughts include individual entries such as the exact tabular function invoked and its key parameters.
+- Generic wrapper completion thoughts are replaced by specific tabular tool-call thoughts.
+- Failure thoughts still appear when tabular analysis cannot compute results.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_workspace_tabular_trigger_and_thoughts.py`
+- Related fix: `docs/explanation/fixes/v0.239.034/TABULAR_COMPUTED_ANALYSIS_ENFORCEMENT_FIX.md`
+- Related fix: `docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md`
diff --git a/docs/explanation/fixes/v0.239.036/TABULAR_GROUPED_PEAK_SUMMARY_FIX.md b/docs/explanation/fixes/v0.239.036/TABULAR_GROUPED_PEAK_SUMMARY_FIX.md
new file mode 100644
index 00000000..5dd1a069
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.036/TABULAR_GROUPED_PEAK_SUMMARY_FIX.md
@@ -0,0 +1,74 @@
+# Tabular Grouped Peak Summary Fix
+
+## Issue Description
+Peak-style analytical questions such as `During what hours of the day do departure queues peak?` still depended too heavily on the model interpreting raw grouped output. The plugin could group by hour, but it did not return explicit highest and lowest group summary fields, and datetime parsing relied too much on generic inference for common US-style timestamps.
+
+**Version implemented:** 0.239.036
+
+Fixed/Implemented in version: **0.239.036**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.036`.
+
+## Root Cause Analysis
+1. **Grouped outputs lacked explicit extremes**
+   - `group_by_datetime_component` and `group_by_aggregate` returned grouped data, but not direct highest and lowest group summaries.
+   - For peak-style questions, the model had to infer the answer from raw JSON instead of using clearly labeled summary fields.
+2. **Common US-style datetime strings were not parsed as explicitly as they should be**
+   - Real data such as the FAA sample file uses values like `5/14/2026 8:31:36 AM`.
+   - The plugin relied on generic fallback parsing too early, which is weaker and noisier than handling common formats directly.
+3. **The tabular prompt did not teach the model to use grouped summary fields**
+   - Even when grouped results were available, the prompt did not explicitly steer peak-style questions toward the strongest summary outputs.
+
+## Technical Details
+### Files Modified
+- `application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py`
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_grouped_peak_summary.py`
+
+### Code Changes Summary
+- Improved `_parse_datetime_like_series()` to explicitly handle common date and datetime formats before generic fallback parsing.
+- Added `_build_grouped_summary()` so grouped outputs can expose:
+  - `highest_group`
+  - `highest_value`
+  - `lowest_group`
+  - `lowest_value`
+  - `average_group_value`
+  - `median_group_value`
+  - `second_highest_group`
+  - `second_highest_value`
+- Extended `group_by_aggregate()` to support:
+  - `median`
+  - `std`
+  - `top_n`
+  - `sort_descending`
+  - grouped summary fields and ranked `top_results`
+- Extended `group_by_datetime_component()` to return grouped summary fields alongside `top_results`.
+- Updated the tabular SK prompt so peak-style questions explicitly use the grouped summary fields.
+
+### Testing Approach
+- Added `functional_tests/test_tabular_grouped_peak_summary.py` to verify:
+  - artifact-style `M/D/YYYY h:mm:ss AM/PM` timestamps group correctly by hour,
+  - grouped datetime outputs return highest and lowest summaries,
+  - grouped aggregate outputs return generic peak summaries,
+  - the route prompt mentions the new grouped summary guidance.
+
+## Impact Analysis
+- Peak-style questions are easier for the model to answer correctly because the plugin now returns explicit extremes.
+- Common tabular files with US-style date/time strings are parsed more reliably.
+- The enhancements remain generic and reusable for any grouped categorical or time-based tabular analysis.
+
+## Validation
+### Before
+- The plugin could compute grouped values but forced the model to infer peaks from raw grouped JSON.
+- Timestamp parsing depended more than necessary on generic datetime inference.
+
+### After
+- Grouped tools return explicit highest and lowest summary fields for peak-style interpretation.
+- Artifact-style timestamps like those observed in the FAA CSV are parsed directly by known formats.
+- The route prompt now encourages the model to use the summary fields when answering peak and busiest questions.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_tabular_grouped_peak_summary.py`
+- Related fix: `docs/explanation/fixes/v0.239.035/TABULAR_TOOL_CALL_THOUGHTS_FIX.md`
+- Related fix: `docs/explanation/fixes/v0.239.033/TABULAR_DATETIME_COMPONENT_ANALYSIS_FIX.md`
diff --git a/docs/explanation/fixes/v0.239.037/TABULAR_TOOL_ERROR_RETRY_AND_THOUGHTS_FIX.md b/docs/explanation/fixes/v0.239.037/TABULAR_TOOL_ERROR_RETRY_AND_THOUGHTS_FIX.md
new file mode 100644
index 00000000..52625425
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.037/TABULAR_TOOL_ERROR_RETRY_AND_THOUGHTS_FIX.md
@@ -0,0 +1,65 @@
+# Tabular Tool Error Retry and Thoughts Fix
+
+## Issue Description
+A failed analytical tabular tool call could still be treated as successful analysis when the plugin returned a JSON error payload rather than raising an exception. This let the chat stop after a single failed tabular tool attempt and produce a weak follow-up answer instead of retrying or falling back. It also left the visible thought feed too thin compared with the internal debug trail.
+
+**Version implemented:** 0.239.037
+
+Fixed/Implemented in version: **0.239.037**
+
+Related `config.py` update: `VERSION` was bumped to `0.239.037`.
+
+## Root Cause Analysis
+1. **Analytical tool presence was treated as success even when the result payload contained an error**
+   - `run_tabular_sk_analysis()` counted analytical function invocations without inspecting whether the returned JSON contained an `error` field.
+   - A single failed call such as `group_by_datetime_component` missing `aggregate_column` could therefore stop the retry flow early.
+2. **Retry attempts did not receive the previous tool error context**
+   - When the first tool call failed, the next SK attempt had no direct feedback about what argument was wrong.
+3. **Thoughts surfaced the tool call but not the recovery path**
+   - The UI could show a failed tool invocation, but not whether the system retried, recovered via fallback, or simply stopped.
+
+## Technical Details
+### Files Modified
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_tool_error_retry_and_thoughts.py`
+
+### Code Changes Summary
+- Added helpers to inspect tabular invocation result payloads and extract embedded JSON error messages.
+- Updated analytical invocation classification so tool calls returning JSON errors are treated as failed, not successful.
+- Updated citation filtering so failed analytical tabular calls do not appear as successful tool citations.
+- Fed previous tool error messages back into subsequent SK retry prompts.
+- Added tabular status thoughts for:
+  - recovery after retrying tool errors,
+  - recovery via internal fallback after tool errors,
+  - tool-error state before fallback when computation still fails.
+- Updated tabular tool-call thoughts so JSON error payloads render as failed tool thoughts in the UI.
+
+### Testing Approach
+- Added `functional_tests/test_tabular_tool_error_retry_and_thoughts.py` to verify:
+  - JSON error payloads are classified as failed analytical calls,
+  - failed analytical calls do not become citations,
+  - failed tool thoughts show error details,
+  - recovery thoughts are emitted for internal fallback,
+  - retry prompts include previous tool error feedback.
+
+## Impact Analysis
+- A single failed analytical tabular tool call no longer ends the analysis prematurely.
+- Retry attempts have better context to correct bad tool arguments.
+- The thought feed now better explains the difference between a failed tool call and a recovered final analysis.
+
+## Validation
+### Before
+- The system could stop after one failed analytical tool call.
+- A JSON error payload could still be treated like successful analysis.
+- Thoughts did not clearly show recovery after tool errors.
+
+### After
+- Failed analytical tool calls trigger retry or fallback instead of being accepted as success.
+- Previous tool errors are fed back into the retry prompt.
+- The UI can show failed tool calls and the recovery/fallback status more clearly.
+
+## Related Validation Assets
+- Functional test: `functional_tests/test_tabular_tool_error_retry_and_thoughts.py`
+- Related fix: `docs/explanation/fixes/v0.239.036/TABULAR_GROUPED_PEAK_SUMMARY_FIX.md`
+- Related fix: `docs/explanation/fixes/v0.239.035/TABULAR_TOOL_CALL_THOUGHTS_FIX.md`
diff --git a/docs/explanation/fixes/v0.239.038/TABULAR_YEAR_TREND_AND_SUMMARY_GUARDRAILS_FIX.md b/docs/explanation/fixes/v0.239.038/TABULAR_YEAR_TREND_AND_SUMMARY_GUARDRAILS_FIX.md
new file mode 100644
index 00000000..d92b0384
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.038/TABULAR_YEAR_TREND_AND_SUMMARY_GUARDRAILS_FIX.md
@@ -0,0 +1,65 @@
+# Tabular Year Trend And Summary Guardrails Fix
+
+Fixed in version: **0.239.038**
+
+## Issue Description
+
+Broad tabular questions against workbook data could still produce weak behavior in two places:
+
+1. Year-based trend intent was under-inferred in the direct datetime fallback logic, even though the plugin already supports `year` grouping.
+2. Broad summaries could still mention speculative parser or follow-up analysis failures that were not the main requested outcome.
+
+This showed up with the Superstore workbook where yearly profit analysis should be computable from `Order Date`, but the response still mentioned a parameter parsing issue.
+
+## Root Cause Analysis
+
+- The route-level `infer_datetime_component()` helper recognized hour, weekday, month, quarter, week, and date intent, but not year, yearly, or annual intent.
+- The plugin-level `_try_numeric_conversion()` step converted already-parsed Excel datetime columns into numeric values before the datetime grouping logic ran, which broke valid workbook date columns like `Order Date`.
+- The mini-agent system prompt strongly required computed analysis, but it did not explicitly forbid narrating hypothetical or secondary failures when the user asked for a broad business summary.
+
+## Technical Details
+
+### Files Modified
+
+- `application/single_app/route_backend_chats.py`
+- `application/single_app/semantic_kernel_plugins/tabular_processing_plugin.py`
+- `application/single_app/config.py`
+- `functional_tests/test_tabular_datetime_component_analysis.py`
+
+### Code Changes Summary
+
+- Added `year`, `years`, `yearly`, `annual`, and `annually` keyword inference to the route datetime-component helper.
+- Preserved datetime and timedelta columns during the plugin numeric-conversion pass so Excel date columns remain usable for datetime grouping.
+- Expanded the tabular tool prompt text to frame `group_by_datetime_component` as the trend-analysis tool for year, quarter, month, week, day, and hour groupings.
+- Added a stronger prompt guardrail telling the tabular mini-agent not to mention hypothetical follow-up analyses, parser errors, or failed attempts unless the user explicitly asks about failures and real tool error output exists.
+- Extended the existing datetime regression test to cover yearly grouping behavior and the new prompt guidance.
+
+### Testing Approach
+
+- Reused the existing datetime component functional test file.
+- Added an in-memory yearly grouping scenario modeled on workbook-style date columns like `Order Date`.
+- Verified route prompt text contains the new yearly trend guidance and speculative-failure guardrail.
+
+## Impact Analysis
+
+- Improves reliability for yearly or annual time-series questions on CSV and Excel files.
+- Reduces the chance of broad tabular summaries surfacing distracting parser-error commentary when the user did not ask for failure details.
+- Keeps the tabular tool behavior generic rather than special-casing the Superstore workbook.
+
+## Validation
+
+### Test Results
+
+- `functional_tests/test_tabular_datetime_component_analysis.py`
+
+### Before
+
+- Year intent was not part of the direct datetime inference keywords.
+- Excel datetime columns could be converted away from datetime dtype before grouping.
+- The prompt left more room for broad summaries to mention hypothetical or secondary parsing failures.
+
+### After
+
+- Yearly and annual phrasing now map to `year` grouping intent.
+- Excel datetime columns remain intact for yearly and other datetime-component grouping.
+- The prompt explicitly steers the model toward computed findings only, without narrating unrelated failed attempts.
\ No newline at end of file
diff --git a/docs/explanation/fixes/v0.239.112/AGENT_AUDIT_METADATA_VALIDATION_FIX.md b/docs/explanation/fixes/v0.239.112/AGENT_AUDIT_METADATA_VALIDATION_FIX.md
new file mode 100644
index 00000000..0672c25d
--- /dev/null
+++ b/docs/explanation/fixes/v0.239.112/AGENT_AUDIT_METADATA_VALIDATION_FIX.md
@@ -0,0 +1,34 @@
+# Agent Audit Metadata Validation Fix (v0.239.112)
+
+## Issue Description
+Saving an existing agent could fail with `Agent validation failed: Additional properties are not allowed ('created_at', 'created_by', 'modified_at', 'modified_by' were unexpected)` when the browser sent back a round-tripped agent object that included server-managed audit metadata.
+
+## Root Cause Analysis
+The backend sanitized user-editable agent fields, but it did not strip server-managed audit or Cosmos metadata before schema validation. As a result, valid agent edits could be rejected purely because the payload still contained fields previously added by the backend.
+
+## Version Implemented
+Fixed/Implemented in version: **0.239.112**
+
+## Technical Details
+### Files Modified
+- application/single_app/functions_agent_payload.py
+- application/single_app/config.py
+- functional_tests/test_agent_audit_metadata_validation_fix.py
+
+### Code Changes Summary
+- Strip server-managed agent metadata such as `created_at`, `created_by`, `modified_at`, `modified_by`, `updated_at`, `last_updated`, `user_id`, `group_id`, and Cosmos system fields during payload sanitization.
+- Preserve existing save behavior where the backend rehydrates authoritative audit fields before persistence.
+- Add a regression test that validates a round-tripped agent payload can still pass schema validation.
+
+### Testing Approach
+- Run the functional test `functional_tests/test_agent_audit_metadata_validation_fix.py`.
+
+## Impact Analysis
+- Editing and saving existing agents no longer fails when the client includes backend-managed metadata.
+- The backend now treats audit metadata as authoritative server state rather than client-provided input.
+
+## Validation
+- Functional test: functional_tests/test_agent_audit_metadata_validation_fix.py
+
+## Reference to Config Version Update
+- Version updated in application/single_app/config.py to **0.239.112**.
\ No newline at end of file
diff --git a/docs/explanation/index.md b/docs/explanation/index.md
index cddbc6ba..8930a58c 100644
--- a/docs/explanation/index.md
+++ b/docs/explanation/index.md
@@ -3,6 +3,8 @@ Welcome to the **Explanation** section. Here you'll find understanding-oriented
 - [Architecture](/explanation/architecture/)
 - [Design Principles](/explanation/design_principles/)
 - [Feature Guidance](/explanation/feature_guidance/)
+- [Running Simple Chat Locally](/explanation/running_simplechat_locally/)
+- [Running Simple Chat in Azure Production](/explanation/running_simplechat_azure_production/)
 - **Scenarios:**
   - [Agent Examples](/explanation/scenarios/agents/)
   - [Workspace Examples](/explanation/scenarios/workspaces/)
diff --git a/docs/explanation/release_notes.md b/docs/explanation/release_notes.md
index 8077ff3f..09224196 100644
--- a/docs/explanation/release_notes.md
+++ b/docs/explanation/release_notes.md
@@ -2,7 +2,343 @@
 
 # Feature Release
 
-### **(v0.239.001)**
+### **(v0.240.001)**
+
+#### Bug Fixes
+
+*   **Pillow PSD Upload Hardening**
+    *   Updated the application to use `pillow==12.1.1`, moving the app off the vulnerable Pillow range for specially crafted PSD image parsing.
+    *   Hardened admin logo and favicon uploads so Pillow now only opens the PNG and JPEG formats already allowed by the route, preventing disguised PSD content from being decoded during upload processing.
+    *   (Ref: `application/single_app/requirements.txt`, `application/single_app/route_frontend_admin_settings.py`, `functional_tests/test_pillow_psd_upload_hardening.py`)
+
+*   **Changed-Files GitHub Action Supply Chain Remediation**
+    *   Updated the release-notes pull request workflow to use the patched `tj-actions/changed-files@v46.0.1` release after the March 2025 supply chain compromise affecting older tag families.
+    *   Added a functional regression check to ensure the workflow does not drift back to the known malicious commit or an older vulnerable action reference.
+    *   (Ref: `release-notes-check.yml`, `test_changed_files_action_version.py`, GitHub Actions workflow security, CI dependency pinning)
+
+*   **Personal Conversation Notification Scope Detection**
+    *   Fixed a scope-detection bug where personal chat completions could save successfully without creating a completion notification or unread dot when unrelated active workspace state was still present in session.
+    *   Personal completion-side effects are now determined from the saved conversation type instead of active workspace session values.
+    *   (Ref: personal chat scope gating, `route_backend_chats.py`, `test_chat_completion_notifications.py`)
+
+*   **Distributed Background Task Locks**
+    *   Added Cosmos-backed distributed lock documents for approval expiry and retention policy background jobs so duplicate execution is reduced across multiple Gunicorn workers and App Service instances.
+    *   Kept the current web-app-hosted scheduler model intact so teams can continue running these jobs from the existing App Service while improving cross-worker coordination.
+    *   Updated the startup documentation and added functional validation for the distributed lock wiring.
+    *   (Ref: `background_tasks.py`, `SIMPLECHAT_STARTUP.md`, `test_background_task_distributed_locks.py`, `test_startup_scheduler_support.py`)
+
+*   **Background Task Default-On Gating**
+    *   Updated the web runtime background task gate so scheduler loops now start by default even when `SIMPLECHAT_RUN_BACKGROUND_TASKS` is unset.
+    *   Only explicit false-like values such as `0`, `false`, `no`, or `off` now disable the background loops, which matches the requested deployment behavior.
+    *   Updated the startup guide and Gunicorn runtime validation test to reflect the new default-on behavior.
+    *   (Ref: `app.py`, `SIMPLECHAT_STARTUP.md`, `test_gunicorn_startup_support.py`)
+
+*   **Gunicorn Production Startup Support**
+    *   Updated the app bootstrap so production deployments can run cleanly under Gunicorn instead of relying on Flask's built-in server, which is a poor fit for long-lived streaming chat requests on App Service.
+    *   Added a shared Gunicorn config, switched the container entrypoint to Gunicorn, and made application initialization idempotent so startup logic can run safely in multi-worker web processes.
+    *   Background timer and retention loops are now disabled by default under Gunicorn workers to avoid duplicating scheduler-style threads across workers, while local debug startup continues to use the Flask development server.
+    *   (Ref: `app.py`, `gunicorn.conf.py`, `Dockerfile`, `test_gunicorn_startup_support.py`)
+
+*   **Streaming-Only Chat Path**
+    *   Updated the first-party chat experience so normal sends, retries, and message edits now use the streaming chat path instead of maintaining a separate non-streaming UI path.
+    *   Preserved parity-sensitive behavior by extending the streaming flow to finalize image-generation responses correctly and by adding a backend compatibility bridge for retry, edit, and image-generation requests while the legacy `/api/chat` route remains in transition.
+    *   Removed the chat-page streaming toggle, updated the UI to treat streaming as required behavior, and added regression coverage to prevent first-party chat modules from drifting back to direct `/api/chat` calls.
+    *   (Ref: `route_backend_chats.py`, `chat-messages.js`, `chat-streaming.js`, `chat-retry.js`, `chat-edit.js`, `chats.html`, `test_streaming_only_chat_path.py`)
+
+*   **Embedding Retry-After Wait Time Handling**
+    *   Fixed embedding retries so `429 Too Many Requests` responses now honor server-provided wait times from `Retry-After` style headers instead of always using local backoff timing.
+    *   This reduces avoidable repeat throttling during document processing, batched embedding generation, and search embedding requests when Azure OpenAI asks the client to wait.
+    *   The existing exponential backoff behavior remains in place as a fallback when the service does not provide a usable retry delay.
+    *   (Ref: `functions_content.py`, embedding retry logic, `test_embedding_rate_limit_wait_time.py`)
+
+*   **SQL Plugin Key Vault Secret Storage**
+    *   New and updated SQL Query and SQL Schema actions now store sensitive values such as connection strings and passwords in Azure Key Vault when Key Vault secret storage is enabled.
+    *   Editing an existing SQL action now preserves stored Key Vault-backed credentials, including the SQL test connection flow, so users do not need to re-enter unchanged secrets just to validate or save the action.
+    *   Personal, group, and global action flows now preserve existing secret references during updates, clean them up correctly on delete, and redact secret-bearing plugin values from logs.
+    *   Existing plaintext SQL action credentials are not backfilled automatically; they move to Key Vault the next time the action is saved while Key Vault storage is enabled.
+    *   (Ref: `functions_keyvault.py`, `route_backend_plugins.py`, `plugin_modal_stepper.js`, `workspace_plugins.js`, SQL action configuration)
+
+*   **Group/Public Expanded Document Tags**
+    *   Fixed group and public workspace list views so expanding a document now shows its tags, matching the personal workspace experience.
+    *   The fix adds color-coded tag badges with a `No tags` fallback in expanded document details without changing the existing backend document APIs.
+    *   (Ref: `group_workspaces.html`, `public_workspace.js`, expanded document details, workspace tag rendering)
+
+*   **Agent Save Validation for Round-Tripped Metadata**
+    *   Fixed agent saves failing when an existing personal, group, or global agent was edited and the browser sent back backend-managed audit fields such as `created_at`, `created_by`, `modified_at`, and `modified_by`.
+    *   Agent payload sanitization now strips backend-managed audit and Cosmos metadata before schema validation, while preserving server-side tracking during persistence.
+    *   (Ref: `functions_agent_payload.py`, `route_backend_agents.py`, agent schema validation, functional test coverage)
+
+*   **Multi-Sheet Workbook Tabular Analysis**
+    *   Fixed multi-sheet Excel workbooks being analyzed from the wrong worksheet during tabular chat responses. Questions that clearly target a specific tab, such as asset values in a workbook with `Assets`, `Balance`, and `Income` sheets, no longer silently default to the first sheet.
+    *   Tabular runtime analysis now requires explicit `sheet_name` or `sheet_index` selection for analytical calls on multi-sheet workbooks, and the SK mini-agent preload now includes workbook sheet inventory and per-sheet schemas so the model can choose the correct worksheet before computing results.
+    *   Enhanced citations and tabular previews now preserve worksheet context, using `Sheet: <name>` for sheet-specific references and `Location: Workbook Schema` for workbook-level schema citations instead of generic `Page 1` labels. The tabular preview modal also supports switching between workbook sheets.
+    *   (Ref: `tabular_processing_plugin.py`, `route_backend_chats.py`, `route_enhanced_citations.py`, `chat-enhanced-citations.js`, `chat-citations.js`, `chat-messages.js`)
+
+*   **Tabular Citation Conversation Ownership Check**
+    *   Fixed an IDOR vulnerability on `/api/enhanced_citations/tabular` where any authenticated user who could guess a `conversation_id` and `file_id` could download another user's chat-uploaded tabular files.
+    *   The endpoint now reads the conversation document from Cosmos DB and verifies that `conversation.user_id` matches the current user before serving the blob. Returns 403 Forbidden on mismatch and 404 if the conversation does not exist.
+    *   (Ref: `route_enhanced_citations.py`, `cosmos_conversations_container`)
+
+*   **Tabular Preview `max_rows` Parameter Validation**
+    *   The `max_rows` query parameter on `/api/enhanced_citations/tabular_preview` was parsed with bare `int()`, causing a 500 error on non-integer input. Switched to Flask's `request.args.get(..., type=int)` which silently falls back to the default on invalid input, matching the pattern used by other endpoints.
+    *   (Ref: `route_enhanced_citations.py`)
+
+*   **On-Demand Summary Generation — Content Normalization Fix**
+    *   Fixed the `POST /api/conversations/<id>/summary` endpoint failing with an error when generating summaries from the conversation details modal.
+    *   Root cause: message `content` in Cosmos DB can be a list of content parts (e.g., `[{type: "text", text: "..."}]`) rather than a plain string. The endpoint was passing the raw list as `content_text`, which either stringified incorrectly or produced empty transcript text.
+    *   Now uses `_normalize_content()` to properly flatten list/dict content into plain text, matching the export pipeline's behavior.
+    *   (Ref: `route_backend_conversations.py`, `_normalize_content`, `generate_conversation_summary`)
+
+*   **Export Summary Reasoning-Model Compatibility**
+    *   Fixed export intro summary generation failing or returning empty content with reasoning-series models (gpt-5, o1, o3) through a series of incremental fixes: using `developer` role instead of `system` for instruction messages, removing all `max_tokens` / `max_completion_tokens` caps so the model decides output length naturally, and adding null-safe content extraction for `None` responses.
+    *   Summary now includes ALL messages (user, assistant, system, file, image analysis) for full context, with a simplified prompt producing 1-2 factual paragraphs.
+    *   Added detailed debug logging showing message count, character count, model name, role, and finish reason.
+    *   (Ref: `route_backend_conversation_export.py`, `_build_summary_intro`, `generate_conversation_summary`)
+
+*   **Conversation Export Schema and Markdown Refresh**
+    *   Fixed conversation exports lagging behind the live chat schema. JSON exports now include processing thoughts, normalized citations, and the raw document/web/tool citation buckets stored with assistant messages.
+    *   Fixed Markdown exports being too flat and text-heavy by reorganizing them into a transcript-first layout with appendices for metadata, message details, references, thoughts, and supplemental records.
+    *   Fixed exported conversations including content that no longer matched the visible chat by filtering deleted messages and inactive-thread retries, then reapplying thread-aware ordering before export.
+    *   (Ref: `route_backend_conversation_export.py`, `test_conversation_export.py`, conversation export rendering)
+
+*   **Export Tag/Classification Rendering Fix**
+    *   Fixed conversation tags and classifications rendering as raw Python dicts (e.g., `{'category': 'model', 'value': 'gpt-5'}`) in both Markdown and PDF exports.
+    *   Tags now display as readable `category: value` strings, with smart handling for participant names, document titles, and generic category/value pairs.
+    *   (Ref: `route_backend_conversation_export.py`, `_format_tag` helper, Markdown/PDF metadata rendering)
+
+*   **Export Summary Error Visibility**
+    *   Added `debug_print` and `log_event` logging to all summary generation error paths, including the empty-response path that previously failed silently.
+    *   The actual error detail is now shown in both Markdown and PDF exports when summary generation fails, replacing the generic "could not be generated" message.
+    *   (Ref: `route_backend_conversation_export.py`, `_build_summary_intro`, export error rendering)
+
+*   **Content Safety for Streaming Chat Path**
+    *   Added full Azure AI Content Safety checking to the streaming (`/api/chat/stream`) SSE path, matching the existing non-streaming (`/api/chat`) implementation.
+    *   Previously, only the non-streaming path performed content safety analysis; streaming conversations bypassed safety checks entirely.
+    *   Implementation includes: `AnalyzeTextOptions` analysis, severity threshold checking (severity ≥ 4 blocks the message), blocklist matching, persistence of blocked messages to `cosmos_safety_container`, creation of safety-role message documents, and proper SSE event delivery of blocked status to the client.
+    *   On block, the streaming generator yields the safety message and `[DONE]` event, then stops — preventing any further LLM invocation.
+    *   Errors in the content safety call are caught and logged without breaking the chat flow, consistent with the non-streaming behavior.
+    *   (Ref: `route_backend_chats.py`, streaming SSE generator, `AnalyzeTextOptions`, `cosmos_safety_container`)
+
+*   **SQL Schema Plugin — Eliminate Redundant Schema Calls**
+    *   Fixed agent calling `get_database_schema` twice per query even though the full schema was already injected into the agent's instructions at load time.
+    *   Root cause: The `@kernel_function` descriptions in `sql_schema_plugin.py` said "ALWAYS call this function FIRST," which overrode the schema context already available in the instructions.
+    *   Updated all four function descriptions (`get_database_schema`, `get_table_schema`, `get_table_list`, `get_relationships`) to use the resilient pattern: "If the database schema is already provided in your instructions, use that directly and do NOT call this function."
+    *   This eliminates ~400ms+ of unnecessary database round trips per query and aligns with the same pattern already used in `sql_query_plugin.py`.
+    *   (Ref: `sql_schema_plugin.py`, `@kernel_function` descriptions, schema injection)
+
+*   **SQL Schema Plugin — Empty Tables from INFORMATION_SCHEMA**
+    *   Fixed `get_database_schema` returning `'tables': {}` (empty) despite the database having tables, while relationships were returned correctly.
+    *   Root cause: SQL Server table/column enumeration used `INFORMATION_SCHEMA.TABLES` and `INFORMATION_SCHEMA.COLUMNS` views, which returned empty results in the Azure SQL environment. Meanwhile, the relationships query used `sys.foreign_keys`/`sys.tables`/`sys.columns` catalog views which worked perfectly.
+    *   Migrated all SQL Server schema queries to use `sys.*` catalog views consistently: `sys.tables`/`sys.schemas` for table enumeration, `sys.columns` with `TYPE_NAME()` for column details, and `sys.indexes`/`sys.index_columns` for primary key detection.
+    *   Fixed `pyodbc.Row` handling throughout the plugin — removed all `isinstance(table, tuple)` checks that could fail with pyodbc Row objects, replaced with robust try/except indexing.
+    *   This enables the full schema (tables, columns, types, PKs, FKs) to be injected into agent instructions, allowing agents to construct complex multi-table JOINs for analytical queries.
+    *   (Ref: `sql_schema_plugin.py`, `sys.tables`, `sys.columns`, `sys.indexes`, pyodbc.Row handling)
+
+*   **SQL Query Plugin — Auto-Create Companion Schema Plugin**
+    *   Fixed the remaining issue where SQL-connected agents still asked for clarification instead of querying the database, even after description improvements.
+    *   Root cause: Agents configured with only a `sql_query` action never had a `SQLSchemaPlugin` loaded in the kernel. The descriptions demanded calling `get_database_schema` — a function that didn't exist — creating an impossible dependency that caused the LLM to ask for clarification.
+    *   `LoggedPluginLoader` now automatically creates a companion `SQLSchemaPlugin` whenever a `SQLQueryPlugin` is loaded, using the same connection details. This ensures schema discovery is always available.
+    *   Updated `@kernel_function` descriptions to be resilient: "If the database schema is provided in your instructions, use it directly. Otherwise, call get_database_schema." This dual-path approach works whether schema is injected via instructions or available via plugin functions.
+    *   Added fallback in `_extract_sql_schema_for_instructions()` to also detect `SQLQueryPlugin` instances and create a temporary schema extractor if no `SQLSchemaPlugin` is found.
+    *   (Ref: `logged_plugin_loader.py`, `sql_query_plugin.py`, `semantic_kernel_loader.py`)
+
+*   **SQL Query Plugin Schema Awareness**
+    *   Fixed agents connected to SQL databases asking users for clarification about table/column names instead of querying the database directly.
+    *   Root cause: SQL Query and SQL Schema plugin `@kernel_function` descriptions were generic with no workflow guidance, agent instructions had no database schema context, and the two plugins operated independently with no linkage.
+    *   Rewrote all `@kernel_function` descriptions in both SQL plugins to be prescriptive workflow guides (modeled after the working LogAnalyticsPlugin), explicitly instructing the LLM to discover schema first before generating queries.
+    *   Added auto-injection of database schema into agent instructions at load time — when SQL Schema plugins are detected, the full schema (tables, columns, types, relationships) is fetched and appended to the agent's system prompt.
+    *   Added new `query_database(question, query)` convenience function to `SQLQueryPlugin` for intent-aligned tool calling.
+    *   Enabled the SQL-specific plugin creation path in `logged_plugin_loader.py` (was previously commented out).
+    *   (Ref: `sql_query_plugin.py`, `sql_schema_plugin.py`, `semantic_kernel_loader.py`, `logged_plugin_loader.py`)
+
+*   **Chat-Uploaded Tabular Files Now Trigger SK Mini-Agent in Model-Only Mode**
+    *   Fixed an issue where tabular files (CSV, XLSX, XLS, XLSM) uploaded directly to a chat conversation were not analyzed by the SK mini-agent when no agent was selected. The model would describe what analysis it would perform instead of returning actual computed results.
+    *   **Root Cause**: The mini SK agent only triggered from search results, but chat-uploaded files are stored in blob storage and not indexed in Azure AI Search. Additionally, the streaming path completely ignored `file` role messages in conversation history.
+    *   **Fix**: Both streaming and non-streaming chat paths now detect chat-uploaded tabular files during conversation history building and trigger `run_tabular_sk_analysis(source_hint="chat")` to pre-compute results. The streaming path also now properly handles `file` role messages (tabular and non-tabular) matching the non-streaming path's behavior.
+    *   (Ref: `route_backend_chats.py`, `run_tabular_sk_analysis()`, `collect_tabular_sk_citations()`)
+
+*   **Group SQL Action/Plugin Save Failure**
+    *   Fixed group SQL actions (sql_query and sql_schema types) failing to save correctly due to missing endpoint placeholder. Group routes now apply the same `sql://sql_query` / `sql://sql_schema` endpoint logic as personal action routes.
+    *   Fixed Step 4 (Advanced) dynamic fields overwriting Step 3 (Configuration) SQL values with empty strings during form data collection. SQL types now skip the dynamic field merge entirely since Step 3 already provides all necessary configuration.
+    *   Fixed auth type definition schemas (`sql_query.definition.json`, `sql_schema.definition.json`) only allowing `connection_string` auth type, blocking `user`, `identity`, and `servicePrincipal` types that the UI and runtime support.
+    *   Fixed `__Secret` key suffix mismatch in additional settings schemas where `connection_string__Secret` and `password__Secret` didn't match the runtime's expected `connection_string` and `password` field names. Also removed duplicate `azuresql` enum value.
+    *   (Ref: `route_backend_plugins.py`, `plugin_modal_stepper.js`, `sql_query.definition.json`, `sql_schema.definition.json`, `sql_query_plugin.additional_settings.schema.json`, `sql_schema_plugin.additional_settings.schema.json`)
+
+#### New Features
+
+*   **Conversation Completion Notifications**
+    *   Added personal chat completion notifications so users who leave a conversation before the assistant finishes can still see that a response is ready.
+    *   Notification clicks deep-link back into the completed conversation, and personal conversations now show a green unread dot until the assistant response is opened.
+    *   The unread state and notification lifecycle are wired into the chat conversation list, sidebar list, and mark-read flow so the indicator clears once the conversation is actually viewed.
+    *   (Ref: conversation notifications, unread assistant responses, `route_backend_chats.py`, `route_backend_conversations.py`, `functions_notifications.py`, `functions_conversation_unread.py`, `chat-conversations.js`, `chat-sidebar-conversations.js`)
+
+*   **Background Chat Completion Away From Chat Page**
+    *   Updated streaming chat execution so assistant responses can continue running after the user leaves the chat page instead of stopping when the browser disconnects from the stream.
+    *   This keeps final assistant persistence, unread markers, and completion notifications reachable even when users navigate into Personal, Group, or other pages while a reply is still generating.
+    *   (Ref: background stream execution, `BackgroundStreamBridge`, `route_backend_chats.py`, `test_chat_stream_background_execution.py`, `test_streaming_only_chat_path.py`)
+
+*   **SimpleChat Startup and Scheduler Separation**
+    *   Added deployment guidance for local development, Azure App Service native Python startup, and container runtimes so administrators can choose between direct Gunicorn startup and optional `python app.py` handoff behavior with clear environment-variable guidance.
+    *   Extracted the scheduler-style logging timer, approval expiration, and retention loops into a shared background task module and added a dedicated `simplechat_scheduler.py` entrypoint so scheduled work can run in a separate process or job.
+    *   This allows the web app to use Gunicorn with `workers=2` without duplicating scheduler loops inside every worker process, while keeping a legacy override available for single-process environments.
+    *   (Ref: `app.py`, `background_tasks.py`, `simplechat_scheduler.py`, `SIMPLECHAT_STARTUP.md`, `test_startup_scheduler_support.py`)
+
+*   **Chat Completion Notifications**
+    *   Added personal chat completion notifications so users who leave a streaming conversation before the assistant finishes now receive a notification when the AI response is ready.
+    *   Notification clicks deep-link directly back to the completed conversation, and personal conversations now show a green unread dot in both chat conversation lists until that response is opened.
+    *   The unread state is cleared automatically when the conversation is opened or when the user stays on the chat page through stream completion, keeping the active-view experience clean without adding heartbeat tracking.
+    *   (Ref: `route_backend_chats.py`, `route_backend_conversations.py`, `functions_notifications.py`, `functions_conversation_unread.py`, `chat-conversations.js`, `chat-sidebar-conversations.js`, `chat-streaming.js`, `test_chat_completion_notifications.py`)
+
+*   **Configurable Tabular Preview Blob Size Limit**
+    *   Added an admin-configurable maximum blob size for tabular file previews, replacing the previous hardcoded limit. Default is 200 MB.
+    *   New **Tabular Preview Limits** card in the Enhanced Citations section of Admin Settings (Citations tab) lets admins increase or decrease the limit based on their compute resources and user population.
+    *   Setting is stored as `tabular_preview_max_blob_size_mb` and accepts values from 1 to 1024 MB.
+    *   (Ref: `route_enhanced_citations.py`, `functions_settings.py`, `admin_settings.html`)
+
+*   **Tabular Preview Memory Optimization**
+    *   The `/api/enhanced_citations/tabular_preview` endpoint no longer loads entire files into a DataFrame. It now uses `nrows` limits in `pandas.read_csv`/`read_excel` to read only the rows needed for the preview, and checks blob size before downloading to reject oversized files early.
+    *   (Ref: `route_enhanced_citations.py`)
+
+*   **Persistent Conversation Summaries**
+    *   Summaries generated during conversation export are now saved to the conversation document in Cosmos DB for future reuse.
+    *   Cached summaries include `message_time_start` and `message_time_end` — when a conversation has new messages beyond the cached range, a fresh summary is generated automatically.
+    *   The conversation details modal now shows a **Summary** card at the top. If a summary exists it displays the content, generation date, and model used. If no summary exists a **Generate Summary** button with model selector lets users create one on demand.
+    *   A **Regenerate** button is available on existing summaries to force a refresh with the currently selected model.
+    *   New `POST /api/conversations/<id>/summary` endpoint accepts an optional `model_deployment` and returns the generated summary.
+    *   The `GET /api/conversations/<id>/metadata` response now includes a `summary` field.
+    *   Extracted `generate_conversation_summary()` as a shared helper used by both the export pipeline and the new API endpoint.
+    *   (Ref: `route_backend_conversation_export.py`, `route_backend_conversations.py`, `chat-conversation-details.js`, `functions_conversation_metadata.py`)
+
+*   **PDF Conversation Export**
+    *   Added PDF as a third export format option alongside JSON and Markdown, giving users a print-ready, visually styled conversation archive.
+    *   PDF output renders chat messages with colored bubbles that mirror the live chat UI: blue for user messages, gray for assistant messages, green for file messages, and amber for system messages.
+    *   Message content is converted from Markdown to HTML for rich formatting (bold, italic, code blocks, lists, tables) inside the PDF.
+    *   Full appendix structure is included (metadata, message details, references, processing thoughts, supplemental messages), matching the Markdown export layout.
+    *   Rendering uses PyMuPDF's Story API on US Letter paper with 0.5-inch margins and automatic multi-page overflow.
+    *   Works with both single-file and ZIP packaging; intro summaries are supported in PDF as well.
+    *   Frontend format step updated to a 3-column card grid with a new PDF card using the `bi-filetype-pdf` icon.
+    *   (Ref: `route_backend_conversation_export.py`, `chat-export.js`, PyMuPDF Story API, conversation export workflow)
+
+*   **Conversation Export Intro Summaries**
+    *   Added an optional AI-generated intro summary step to the conversation export workflow, so each exported chat can begin with a short abstract before the full transcript.
+    *   Summary model selection now reuses the same model list shown in the chat composer, keeping the export flow aligned with the main chat experience.
+    *   Works for both JSON and Markdown exports, including ZIP exports where each conversation keeps its own summary metadata.
+    *   (Ref: `route_backend_conversation_export.py`, `chat-export.js`, conversation export workflow)
+
+*   **Agent & Action User Tracking (created_by / modified_by)**
+    *   All agent and action documents (personal, group, and global) now include `created_by`, `created_at`, `modified_by`, and `modified_at` fields that track which user created or last modified the entity.
+    *   On updates, the original `created_by` and `created_at` values are preserved while `modified_by` and `modified_at` are refreshed with the current user and timestamp.
+    *   New optional `user_id` parameter added to `save_group_agent`, `save_global_agent`, `save_group_action`, and `save_global_action` for caller-supplied user tracking (backward-compatible, defaults to `None`).
+    *   (Ref: `functions_personal_agents.py`, `functions_group_agents.py`, `functions_global_agents.py`, `functions_personal_actions.py`, `functions_group_actions.py`, `functions_global_actions.py`)
+
+*   **Activity Logging for Agent & Action CRUD Operations**
+    *   Every create, update, and delete operation on agents and actions now generates an activity log record in the `activity_logs` Cosmos DB container and Application Insights.
+    *   Six new logging functions: `log_agent_creation`, `log_agent_update`, `log_agent_deletion`, `log_action_creation`, `log_action_update`, `log_action_deletion`.
+    *   Activity records include: `user_id`, `activity_type`, `entity_type` (agent/action), `operation` (create/update/delete), `workspace_type` (personal/group/global), and `workspace_context` (group_id when applicable).
+    *   Logging is fire-and-forget — failures never break the CRUD operation.
+    *   All personal, group, and admin routes for both agents and actions are wired up.
+    *   (Ref: `functions_activity_logging.py`, `route_backend_agents.py`, `route_backend_plugins.py`)
+
+*   **Tabular Data Analysis — SK Mini-Agent for Normal Chat**
+    *   Tabular files (CSV, XLSX, XLS, XLSM) detected in search results now trigger a lightweight Semantic Kernel mini-agent that pre-computes data analysis before the main LLM response. This brings the same analytical depth previously only available in full agent mode to every normal chat conversation.
+    *   **Automatic Detection**: When AI Search results include tabular files from any workspace (personal, group, or public) or chat-uploaded documents, the system automatically identifies them via the `TABULAR_EXTENSIONS` configuration and routes the query through the SK mini-agent pipeline.
+    *   **Unified Workspace and Chat Handling**: Tabular files are processed identically regardless of their storage location. The plugin resolves blob paths across all four container types (`user-documents`, `group-documents`, `public-documents`, `personal-chat`) with automatic fallback resolution if the primary source lookup fails. A user asking about an Excel file in their personal workspace gets the same analytical treatment as one asking about a CSV uploaded directly to a chat.
+    *   **Six Data Analysis Functions**: The `TabularProcessingPlugin` exposes `describe_tabular_file`, `aggregate_column` (sum, mean, count, min, max, median, std, nunique, value_counts), `filter_rows` (==, !=, >, <, >=, <=, contains, startswith, endswith), `query_tabular_data` (pandas query syntax), `group_by_aggregate`, and `list_tabular_files` — all registered as Semantic Kernel functions that the mini-agent orchestrates autonomously.
+    *   **Pre-Computed Results Injected as Context**: The mini-agent's computed analysis (exact numerical results, aggregations, filtered data) is injected into the main LLM's system context so it can present accurate, citation-backed answers without hallucinating numbers.
+    *   **Graceful Degradation**: If the mini-agent analysis fails for any reason, the system falls back to instructing the main LLM to use the tabular processing plugin functions directly, preserving full functionality.
+    *   **Non-Streaming and Streaming Support**: Both chat modes are supported. The mini-agent runs synchronously before the main LLM call in both paths.
+    *   **Requires Enhanced Citations**: The tabular processing plugin depends on the blob storage client initialized by the enhanced citations system. The `enable_enhanced_citations` admin setting must be enabled for tabular data analysis to activate.
+    *   (Ref: `run_tabular_sk_analysis()`, `TabularProcessingPlugin`, `collect_tabular_sk_citations()`, `TABULAR_EXTENSIONS`)
+
+*   **Tabular Tool Execution Citations**
+    *   Every tool call made by the SK mini-agent during tabular analysis is captured and surfaced as an agent citation, providing full transparency into the data analysis pipeline.
+    *   **Automatic Capture**: The existing `@plugin_function_logger` decorator on all `TabularProcessingPlugin` functions records each invocation including function name, input parameters, returned results, execution duration, and success/failure status.
+    *   **Citation Format**: Tool execution citations appear in the same "Agent Tool Execution" modal used by full agent mode, showing `tool_name` (e.g., `TabularProcessingPlugin.aggregate_column`), `function_arguments` (the exact parameters passed), and `function_result` (the computed data returned).
+    *   **End-to-End Auditability**: Users can verify exactly which aggregations, filters, or queries were run against their data, what parameters were used, and what raw results were returned — before the LLM summarized them into the final response.
+    *   (Ref: `collect_tabular_sk_citations()`, `plugin_invocation_logger.py`)
+
+*   **SK Mini-Agent Performance Optimization**
+    *   Reduced typical tabular analysis time from ~74 seconds to an estimated ~30-33 seconds (55-60% reduction) through three complementary optimizations.
+    *   **DataFrame Caching**: Per-request in-memory cache eliminates redundant blob downloads. Previously, each of the ~8 tool calls in a typical analysis downloaded and parsed the same file independently. Now the file is downloaded once and subsequent calls read from cache. Cache is automatically scoped to the request (new plugin instance per analysis) and garbage-collected afterward.
+    *   **Pre-Dispatch Schema Injection**: File schemas (columns, data types, row counts, and a 3-row preview) are pre-loaded and injected into the SK mini-agent's system prompt before execution begins. This eliminates 2 LLM round-trips that were previously spent on file discovery (`list_tabular_files`) and schema inspection (`describe_tabular_file`), allowing the model to jump directly to analysis tool calls.
+    *   **Async Plugin Functions**: All six `@kernel_function` methods converted to `async def` using `asyncio.to_thread()`. This enables Semantic Kernel's built-in `asyncio.gather()` to truly parallelize batched tool calls (e.g., 3 simultaneous `aggregate_column` calls) instead of executing them serially on the event loop.
+    *   **Batching Instructions**: The system prompt now instructs the model to batch multiple independent function calls in a single response, reducing LLM round-trips further.
+    *   (Ref: `_df_cache`, `asyncio.to_thread`, pre-dispatch schema injection in `run_tabular_sk_analysis()`)
+
+*   **SQL Test Connection Button**
+    *   Added a "Test Connection" button to the SQL Database Configuration section (Step 3) of the action wizard, allowing users to validate database connectivity before saving.
+    *   Supports all database types: SQL Server, Azure SQL (with managed identity), PostgreSQL, MySQL, and SQLite.
+    *   Shows inline success/failure alerts with a 15-second timeout cap and sanitized error messages.
+    *   New backend endpoint: `POST /api/plugins/test-sql-connection`.
+    *   (Ref: `route_backend_plugins.py`, `plugin_modal_stepper.js`, `_plugin_modal.html`)
+
+*   **Per-Message Export**
+    *   Added export and action options to the three-dots dropdown menu on individual chat messages (both AI and user messages).
+    *   **Export to Markdown**: Downloads the message as a `.md` file with a role header. Entirely client-side.
+    *   **Export to Word**: Generates a styled `.docx` document via a new backend endpoint (`POST /api/message/export-word`). Includes Markdown-to-Word formatting (headings, bold, italic, code blocks, lists) and a citations section when present.
+    *   **Use as Prompt**: Inserts the raw message content directly into the chat input box for reuse — no clipboard, one click and it's ready to edit and send.
+    *   **Open in Email**: Opens the user's default email client with the message pre-filled in the subject and body via `mailto:`.
+    *   New options appear below a divider in the dropdown, preserving existing actions (Delete, Retry, Edit, Feedback).
+    *   (Ref: `chat-message-export.js`, `chat-messages.js`, `route_backend_conversation_export.py`, per-message export)
+
+*   **Custom Azure Environment Support in Bicep Deployment**
+    *   Added `custom` as a supported `cloudEnvironment` value alongside `public` and `usgovernment`, enabling deployment to sovereign or custom Azure environments via Bicep.
+    *   New Bicep parameters for custom environments: `customBlobStorageSuffix`, `customGraphUrl`, `customIdentityUrl`, `customResourceManagerUrl`, `customCognitiveServicesScope`, and `customSearchResourceUrl`. All of these are automatically populated from `az.environment()` defaults except `customGraphUrl`, which must be explicitly provided for custom cloud environments and can be overridden as needed.
+    *   The `cloudEnvironment` parameter now defaults intelligently based on `az.environment().name`, and legacy values (`AzureCloud`, `AzureUSGovernment`) are mapped to SimpleChat's expected values (`public`, `usgovernment`).
+    *   Custom environment app settings (`CUSTOM_GRAPH_URL_VALUE`, `CUSTOM_IDENTITY_URL_VALUE`, `CUSTOM_RESOURCE_MANAGER_URL_VALUE`, etc.) are conditionally injected only when `azurePlatform == 'custom'`.
+    *   Replaced hardcoded ACR domain logic and auth issuer URLs with dynamic `az.environment()` lookups for better cross-cloud compatibility.
+    *   Fixed trailing slash handling in `AUTHORITY` URL construction in `config.py` using `rstrip('/')`.
+    *   (Ref: `deployers/bicep/main.bicep`, `deployers/bicep/modules/appService.bicep`, `config.py`, sovereign cloud support)
+
+*   **Redis Key Vault Authentication**
+    *   Added a new `key_vault` authentication type for Redis, allowing the Redis access key to be retrieved securely from Azure Key Vault at runtime rather than stored directly in settings.
+    *   Applies across all Redis usage paths: app settings cache (`app_settings_cache.py`), session management (`app.py`), and the Redis test connection flow (`route_backend_settings.py`).
+    *   Uses `retrieve_secret_direct()` from `functions_keyvault.py` to fetch the Redis key by its Key Vault secret name. Respects `key_vault_identity` for a user-assigned managed identity on the Key Vault client.
+    *   New admin setting fields: `redis_auth_type` (values: `key`, `managed_identity`, `key_vault`) and `redis_key` (used as the Key Vault secret name when `key_vault` auth type is selected).
+    *   **Files Modified**: `app_settings_cache.py`, `app.py` `configure_sessions`, `route_backend_settings.py` `_test_redis_connection`, `functions_keyvault.py` `retrieve_secret_direct`
+
+#### User Interface Enhancements
+
+*   **Agent Responded Thought — Seconds & Total Duration**
+    *   The "responded" thought now shows time in **seconds** instead of milliseconds, and clarifies it is the total time from the initial user message (e.g., `'gpt-5-nano' responded (16.3s from initial message)`).
+    *   A `request_start_time` is now captured at the top of both the non-streaming and streaming chat handlers, so the duration reflects the full request lifecycle — including content safety, hybrid search, and agent invocation — not just the model response time.
+    *   Applies to all three agent paths: local SK agents (non-streaming), Azure AI Foundry agents, and streaming SK agents.
+    *   (Ref: `route_backend_chats.py`, `request_start_time`, agent responded thoughts)
+
+*   **Enhanced Agent Execution Thoughts**
+    *   Added detailed model-level status messages during agent execution, giving users full visibility into each stage of the AI pipeline.
+    *   **Model Identification**: A new "Sending to '{deployment_name}'" thought appears immediately after "Sending to agent", showing the exact model deployment being used (e.g., `gpt-5-nano`).
+    *   **Generating Response**: A "Generating response..." thought now appears before the agent begins its invocation loop, matching the existing behavior for non-agent GPT calls.
+    *   **Model Responded with Duration**: A "'{deployment_name}' responded ({duration}ms)" thought appears after the agent completes, showing total wall-clock execution time.
+    *   Applies to all three agent paths: local SK agents (streaming and non-streaming) and Azure AI Foundry agents.
+    *   Uses the existing `generation` step type (lightning bolt icon) — no frontend changes required.
+    *   (Ref: `route_backend_chats.py`, `ThoughtTracker`, agent execution pipeline)
+
+*   **List/Grid View Toggle for Agents and Actions**
+    *   Added a list/grid view toggle to all four workspace areas: personal agents, personal actions, group agents, and group actions.
+    *   **Grid View**: Large cards with type icon, humanized name, truncated description, and action buttons (Chat, View, Edit, Delete as applicable).
+    *   **List View**: Improved table layout with fixed column widths (28%/47%/25%), humanized display names, and truncated descriptions with hover tooltips for full text.
+    *   **View Button**: New eye-icon button on every agent and action that opens a read-only detail modal with gradient-header summary cards (Basic Information, Model Configuration, Instructions for agents; Basic Information, Configuration for actions).
+    *   **Name Humanization**: Display names are now automatically parsed — underscores and camelCase/PascalCase boundaries are converted to properly spaced, title-cased words (e.g., `myCustomAgent` → `My Custom Agent`).
+    *   **Persistent Preference**: View mode selection (list/grid) is saved per area in localStorage and restored on page load.
+    *   New shared utility module `view-utils.js` provides reusable functions for all four workspace areas.
+    *   (Ref: `view-utils.js`, `workspace_agents.js`, `workspace_plugins.js`, `plugin_common.js`, `group_agents.js`, `group_plugins.js`, `workspace.html`, `group_workspaces.html`, `styles.css`)
+
+*   **Chat with Agent Button for Group Agents**
+    *   Added a "Chat" button to each group agent row, allowing users to quickly select a group agent and navigate to the chat page.
+    *   (Ref: `group_agents.js`, `group_workspaces.html`)
+
+*   **Hidden Deprecated Action Types**
+    *   Deprecated action types (`sql_schema`, `ui_test`, `queue_storage`, `blob_storage`, `embedding_model`) are now hidden from the action creation wizard type selector. Existing actions of these types remain functional.
+    *   (Ref: `plugin_modal_stepper.js`)
+
+*   **Advanced Settings Collapse Toggle**
+    *   Step 4 (Advanced) content is now hidden behind a collapsible toggle button ("Show Advanced Settings") instead of being displayed by default. Reduces visual noise for most users.
+    *   For SQL action types, the redundant additional fields UI in Step 4 is hidden entirely since all SQL configuration is already handled in Step 3.
+    *   Step 5 (Summary) no longer shows the raw additional fields JSON dump for SQL types, since that data is already shown in the SQL Database Configuration summary card.
+    *   (Ref: `_plugin_modal.html`, `plugin_modal_stepper.js`)
+
+### **(v0.239.002)**
 
 #### New Features
 
@@ -16,7 +352,7 @@
 *   **Retention Policy UI for Groups and Public Workspaces**
     *   Can now configure conversation and document retention periods directly from the workspace and group management page.
     *   Choose from preset retention periods ranging from 7 days to 10 years, use the organization default, or disable automatic deletion entirely.
-    
+
 *   **Owner-Only Group Agent and Action Management**
     *   New admin setting to restrict group agent and group action management (create, edit, delete) to only the group Owner role.
     *   **Admin Toggle**: "Require Owner to Manage Group Agents and Actions" located in Admin Settings > My Groups section, under the existing group creation membership setting.
@@ -92,19 +428,6 @@
     *   **Files Modified**: `chat-documents.js`, `chat-messages.js`, `functions_search.py`, `route_backend_chats.py`, `chats.html`.
     *   (Ref: Multi-document selection, tag filtering, OData search integration, `CHAT_DOCUMENT_AND_TAG_FILTERING.md`)
 
-#### New Features
-
-*   **Conversation Export**
-    *   Export one or multiple conversations from the Chat page in JSON or Markdown format.
-    *   **Single Export**: Use the ellipsis menu on any conversation to quickly export it.
-    *   **Multi-Export**: Enter selection mode, check the conversations you want, and click the export button.
-    *   A guided 4-step wizard walks you through selection review, format choice, packaging options (single file or ZIP archive), and download.
-    *   Sensitive internal metadata is automatically stripped from exported data for security.
-
-*   **Retention Policy UI for Groups and Public Workspaces**
-    *   Can now configure conversation and document retention periods directly from the workspace and group management page.
-    *   Choose from preset retention periods ranging from 7 days to 10 years, use the organization default, or disable automatic deletion entirely.
-
 #### Bug Fixes
 
 *   **Citation Parsing Bug Fix**
@@ -120,7 +443,7 @@
     *   Removed the membership verification from the `setActive` endpoint; the route still requires authentication (`@login_required`, `@user_required`) and the public workspaces feature flag (`@enabled_required`).
     *   Other admin-level endpoints (listing members, viewing stats, ownership transfer) retain their membership checks.
     *   (Ref: `route_backend_public_workspaces.py`, `api_set_active_public_workspace`)
-    
+
 *   **Chats Page User Settings Hardening**
     *   Fixed a user-specific chats page failure where only one affected user could not load `/chats` due to malformed per-user settings data.
     *   **Root Cause**: The chats route assumed `user_settings["settings"]` was always a dictionary. If that field existed but had an invalid type (for example string, null, or list), the page could fail before rendering.
@@ -175,7 +498,6 @@
     *   **Solution**: Removed the post-save `global_selected_agent` enforcement from the add and edit routes. The delete route already correctly prevents deletion of the selected agent.
     *   (Ref: `route_backend_agents.py`, global agent add/edit routes, `global_selected_agent` setting)
 
-### **(v0.237.008)**
 ### **(v0.237.011)**
 
 #### Bug Fixes
@@ -195,7 +517,7 @@
     *   **Removed Duplicate Comment**: Cleaned up duplicate "Render user-search results" comment.
     *   **Impact**: Member management buttons now render and function correctly, provide better error feedback, and auto-recover from stale member data.
     *   (Ref: `manage_group.js`, event handler deduplication, error handling improvements, toast notifications)
-    
+
 ### **(v0.237.009)**
 
 #### New Features
diff --git a/docs/explanation/running_simplechat_azure_production.md b/docs/explanation/running_simplechat_azure_production.md
new file mode 100644
index 00000000..7e6e3841
--- /dev/null
+++ b/docs/explanation/running_simplechat_azure_production.md
@@ -0,0 +1,105 @@
+# explanation/running_simplechat_azure_production.md
+---
+layout: libdoc/page
+title: Running Simple Chat in Azure Production
+order: 150
+category: Explanation
+---
+
+This guide explains the supported production startup patterns for Simple Chat in Azure.
+
+Current documentation version: 0.239.139
+
+## Default Azure Production Model in This Repo
+
+The repo-provided Azure deployment paths are container-based App Service deployments.
+
+That includes the deployers documented in this repository for:
+
+- `azd`
+- Bicep
+- Terraform
+- Azure CLI
+
+In those deployment models:
+
+- Azure App Service runs the published container image
+- the container entrypoint already starts Gunicorn
+- you do not need to set an App Service Stack Settings Startup command
+
+The web container entrypoint is:
+
+```text
+python3 -m gunicorn -c /app/gunicorn.conf.py app:app
+```
+
+## Native Python App Service Option
+
+If you intentionally deploy Simple Chat as a native Python App Service instead of using the repo container image, deploy the `application/single_app` folder and set the web startup command explicitly.
+
+Use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+## Background Scheduler Guidance
+
+For production, keep scheduler-style work separate from multi-worker web processes when possible.
+
+Recommended web-process setting when scheduler work runs elsewhere:
+
+```bash
+SIMPLECHAT_RUN_BACKGROUND_TASKS=0
+```
+
+Recommended scheduler command:
+
+```bash
+python simplechat_scheduler.py
+```
+
+Operationally, that scheduler can run as:
+
+- a separate App Service or worker process
+- a scheduled container or job
+- another automation path that launches the same codebase with the scheduler command
+
+## Gunicorn Guidance for Azure
+
+Gunicorn is the production web server for Simple Chat in Azure-oriented deployments.
+
+The shared runtime config supports these tuning variables:
+
+- `GUNICORN_BIND`
+- `GUNICORN_WORKERS`
+- `GUNICORN_THREADS`
+- `GUNICORN_TIMEOUT`
+- `GUNICORN_GRACEFUL_TIMEOUT`
+- `GUNICORN_KEEPALIVE`
+- `GUNICORN_MAX_REQUESTS`
+- `GUNICORN_MAX_REQUESTS_JITTER`
+
+Use multiple workers only after you have decided how scheduler work is isolated.
+
+## Recommended Azure Production Pattern
+
+For most production environments in this repository:
+
+1. Deploy the container image through the repo-supported deployer.
+2. Let the container entrypoint launch Gunicorn.
+3. Do not configure an extra App Service Startup command.
+4. Move scheduler work into a separate runtime if you want clean multi-worker web behavior.
+
+## What Not to Do
+
+- Do not configure a second Gunicorn startup layer on top of the container deployer.
+- Do not treat Windows local development startup as proof of Gunicorn production behavior.
+- Do not leave scheduler decisions implicit if you plan to scale out workers or instances.
+
+## Summary
+
+- Repo deployers: container-based, Gunicorn already handled.
+- Native Python App Service: set the Gunicorn startup command explicitly.
+- Multi-worker production: separate scheduler work deliberately.
+- Local developer startup and Azure production startup should be treated as different runtime concerns.
\ No newline at end of file
diff --git a/docs/explanation/running_simplechat_locally.md b/docs/explanation/running_simplechat_locally.md
new file mode 100644
index 00000000..c1afdc51
--- /dev/null
+++ b/docs/explanation/running_simplechat_locally.md
@@ -0,0 +1,101 @@
+# explanation/running_simplechat_locally.md
+---
+layout: libdoc/page
+title: Running Simple Chat Locally
+order: 140
+category: Explanation
+---
+
+This guide explains the recommended local developer workflow for Simple Chat.
+
+Current documentation version: 0.239.136
+
+## Recommended Local Startup
+
+For normal development, start the app directly with Python:
+
+```bash
+python app.py
+```
+
+Set:
+
+```bash
+FLASK_DEBUG=1
+```
+
+This keeps Simple Chat on the Flask development server, enables local HTTPS behavior, and avoids unnecessary production-runtime complexity while you are editing and debugging the application.
+
+## Windows Developer Workflow
+
+Windows developers should use the direct Python startup path.
+
+Recommended local settings:
+
+```dotenv
+FLASK_DEBUG="1"
+SIMPLECHAT_USE_GUNICORN="1"
+SIMPLECHAT_RUN_BACKGROUND_TASKS="1"
+```
+
+Why this still works:
+
+- When `FLASK_DEBUG="1"`, `python app.py` stays on the Flask development server.
+- `SIMPLECHAT_USE_GUNICORN` is ignored while debug mode is enabled.
+- Background tasks continue to run in the single local process unless explicitly disabled.
+
+## Linux and macOS Developer Workflow
+
+Linux and macOS developers can use the same default local workflow:
+
+```bash
+FLASK_DEBUG=1 python app.py
+```
+
+That remains the recommended path for everyday development even on systems that can run Gunicorn.
+
+## When You Need Gunicorn-Specific Validation
+
+Use a Linux-compatible runtime only when you specifically need to validate:
+
+- multi-worker behavior
+- Gunicorn thread settings
+- keepalive and timeout behavior
+- production-like streaming behavior
+
+Example Gunicorn command:
+
+```bash
+gunicorn --bind=0.0.0.0:5000 --worker-class gthread --workers 2 --threads 8 --timeout 900 --graceful-timeout 60 --keep-alive 75 --max-requests 500 --max-requests-jitter 50 app:app
+```
+
+On Windows, use one of these options for that kind of validation:
+
+- Docker Desktop running the repo container image
+- WSL2 with a Linux shell
+- another Linux environment
+
+Native Windows Python should not be used to run Gunicorn directly.
+
+## Scheduler Behavior in Local Development
+
+By default, background loops remain enabled in local development.
+
+Use this variable only if you want to disable them in the current process:
+
+```bash
+SIMPLECHAT_RUN_BACKGROUND_TASKS=0
+```
+
+If you want to test the scheduler separately, run:
+
+```bash
+python simplechat_scheduler.py
+```
+
+## Practical Guidance
+
+- Use `python app.py` for normal development.
+- Keep `FLASK_DEBUG=1` on local developer machines.
+- Treat Gunicorn as a production-runtime validation tool, not the default local developer startup path.
+- On Windows, move to Docker or WSL2 when testing Gunicorn workers and threads matters.
\ No newline at end of file
diff --git a/docs/how-to/docker_customization.md b/docs/how-to/docker_customization.md
new file mode 100644
index 00000000..23812966
--- /dev/null
+++ b/docs/how-to/docker_customization.md
@@ -0,0 +1,8 @@
+# Docker Customization
+
+## Custom Certificate Authorities
+
+Add custom certification authorities to the `docker-customization/custom-ca-certificates/` directory in the repository root, and they will be pulled into the system CAs during docker build. Must be in `.crt` format.
+
+## Custom pip conf
+Add customization as needed to the `docker-customization/pip.conf` file in the repository root. This will be used during docker build.
\ No newline at end of file
diff --git a/docs/how-to/upgrade_paths.md b/docs/how-to/upgrade_paths.md
new file mode 100644
index 00000000..50d70c00
--- /dev/null
+++ b/docs/how-to/upgrade_paths.md
@@ -0,0 +1,144 @@
+# Upgrade Paths
+
+Use this guide when you already have SimpleChat deployed and want to update the application without rediscovering the initial deployment steps.
+
+## Choose the Right Upgrade Path
+
+| If you deployed SimpleChat as... | Use this path | Default upgrade command or method |
+| :--- | :--- | :--- |
+| **Native Python Azure App Service** | [Native Python App Service Upgrades](#native-python-app-service-upgrades) | VS Code deployment or Azure CLI ZIP deploy |
+| **Container-based Azure App Service** using the repo `azd`, Bicep, Terraform, or Azure CLI deployers | [Container-Based App Service Upgrades](#container-based-app-service-upgrades) | `azd deploy` for code-only updates |
+
+## Native Python App Service Upgrades
+
+This path applies when you deployed the application code directly to Azure App Service instead of using the repo's container image.
+
+### Required Startup Command Check
+
+For native Python App Service upgrades, do **not** leave the App Service Stack Settings Startup command blank.
+
+Deploy and run the `application/single_app` folder in App Service.
+
+Use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+Validate this before or during the upgrade. A missing or incorrect Startup command is one of the fastest ways to turn a straightforward code update into an outage.
+
+### Recommended Native Upgrade Methods
+
+#### Option 1: Visual Studio Code Deployment
+
+Use this when you want the simplest manual update path.
+
+1. Sign in to Azure from VS Code.
+2. Open the Azure extension.
+3. Find the existing App Service.
+4. Right-click the App Service.
+5. Select **Deploy to Web App...**.
+6. Deploy the `application/single_app` folder.
+
+This is the same deployment mechanism used for an initial native Python deployment. It is also a valid upgrade method.
+
+#### Option 2: Azure CLI ZIP Deploy
+
+Use this when you want a repeatable manual package-and-deploy flow.
+
+1. Create a deployment ZIP from the required application contents.
+2. Build that ZIP from inside `application/single_app` so the deployed package contains the app files directly.
+3. Confirm `SCM_DO_BUILD_DURING_DEPLOYMENT=true` in App Service configuration.
+4. Deploy the ZIP with Azure CLI:
+
+```bash
+az webapp deploy \
+  --resource-group <Your-Resource-Group-Name> \
+  --name <Your-App-Service-Name> \
+  --src-path ../deployment.zip \
+  --type zip
+```
+
+This is an upgrade path, not only an initial deployment path. Package the new version, deploy the ZIP, and validate the Startup command before closing the change.
+
+#### Option 3: Deployment Slots for Production
+
+Use deployment slots when you want staged validation and rollback capability for native Python deployments.
+
+Recommended flow:
+
+1. Deploy the updated code to a staging slot.
+2. Validate the staging slot URL.
+3. Swap staging into production.
+4. Roll back with another swap if needed.
+
+### Native Python References
+
+- [Manual setup instructions](../setup_instructions_manual.md)
+- [Manual deployment notes](../reference/deploy/manual_deploy.md)
+
+## Container-Based App Service Upgrades
+
+This path applies to the repo-provided `azd`, Bicep, Terraform, and Azure CLI deployers. These deployers run SimpleChat as a **container** on Azure App Service.
+
+### Important Runtime Rule
+
+For container-based deployments, do **not** add a native Python App Service Startup command. Gunicorn is started by the container entrypoint in `application/single_app/Dockerfile`.
+
+### Upgrade Decision Guide
+
+| Situation | Recommended action | Why |
+| :--- | :--- | :--- |
+| **Application code change only** | `azd deploy` | Updates the app without treating the release like a full infrastructure event |
+| **Infrastructure change only** | `azd provision` | Applies Azure resource/configuration changes without redeploying the app container |
+| **Application code and infrastructure changed together** | `azd up` | Runs the combined app + infrastructure workflow |
+| **You are considering `azd down --purge` for a normal release** | Avoid this for routine upgrades | This is destructive and not a standard upgrade path |
+
+### Recommended Default for Container Releases
+
+For a normal code release, start with:
+
+```bash
+azd deploy
+```
+
+Do **not** assume `azd up` is required for every upgrade. Use `azd up` only when the release also needs infrastructure updates.
+
+When you are unsure whether infrastructure changes are included, review them first:
+
+```bash
+azd provision --preview
+```
+
+### Advanced Option: ACR/Image-Only Rollout
+
+If your App Service is already configured to pull its image from Azure Container Registry and your goal is to avoid any infrastructure reprovisioning, you can use an image-only rollout.
+
+The repo already contains an image publish workflow:
+
+- [.github/workflows/docker_image_publish.yml](../../.github/workflows/docker_image_publish.yml)
+
+That workflow publishes:
+
+1. A timestamped image tag for rollback-friendly releases.
+2. A `latest` tag for the current build.
+
+Use this path when your operations model is:
+
+1. Build and push the updated image to ACR.
+2. Refresh App Service to use the new image tag, or restart it if your container configuration intentionally tracks `latest`.
+3. Roll back by moving App Service back to the prior known-good tag.
+
+This is an **advanced operational option**, not the default repo deployment workflow. It exists specifically for teams that want to update the container image without treating every release like a provisioning event.
+
+### Container Upgrade References
+
+- [AZD deployment guide](../reference/deploy/azd-cli_deploy.md)
+- [Bicep deployment guide](../../deployers/bicep/README.md)
+- [Terraform deployment guide](../../deployers/terraform/ReadMe.md)
+
+## Summary
+
+- Native Python App Service upgrades: validate the Startup command, then use VS Code deploy, ZIP deploy, or deployment slots.
+- Container-based upgrades: prefer `azd deploy` for code-only changes and reserve `azd up` for releases that also change infrastructure.
+- If you already operate App Service against ACR and want lower-touch rollouts, use an image-only update process instead of full reprovisioning.
\ No newline at end of file
diff --git a/docs/reference/deploy/azd-cli_deploy.md b/docs/reference/deploy/azd-cli_deploy.md
index 1e2a2a19..fa95181c 100644
--- a/docs/reference/deploy/azd-cli_deploy.md
+++ b/docs/reference/deploy/azd-cli_deploy.md
@@ -30,6 +30,17 @@ Azure Developer CLI (azd) provides the fastest and most automated way to deploy
 
 ## Quick Start
 
+## Runtime Startup Behavior
+
+- The current `azd` deployment path in this repo is a **container-based App Service** deployment.
+- Gunicorn is started by the container entrypoint in `application/single_app/Dockerfile`.
+- You do **not** need to populate App Service Stack Settings Startup command when deploying through this `azd` path.
+- If you later switch to native Python App Service instead, deploy the `application/single_app` folder and use this startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
 ### 1. Clone Repository
 ```bash
 git clone https://github.com/microsoft/simplechat.git
@@ -265,6 +276,18 @@ azd up
 
 ## Management Commands
 
+### Upgrade Decision Guide
+
+Use the command that matches the type of change you are making.
+
+| If you changed... | Use | Why |
+| :--- | :--- | :--- |
+| **Application code only** | `azd deploy` | Recommended default for routine container upgrades |
+| **Infrastructure only** | `azd provision` | Updates Azure resources without treating the release like a full app deployment |
+| **Application code and infrastructure together** | `azd up` | Runs the combined deployment flow |
+
+Do **not** assume `azd up` is required for every release. For normal code-only container updates, start with `azd deploy`.
+
 ### Application Lifecycle
 
 **Deploy application updates:**
@@ -272,17 +295,23 @@ azd up
 azd deploy
 ```
 
+Recommended for routine container-based application upgrades when infrastructure is unchanged.
+
 **Provision infrastructure changes:**
 ```bash
 azd provision
 ```
 
+Use `azd provision --preview` first when you want to review infrastructure impact before applying it.
+
 **Full redeployment:**
 ```bash  
 azd down --purge
 azd up
 ```
 
+Do not use this as a standard upgrade flow. This is a destructive reprovisioning path.
+
 ### Environment Management
 
 **List environments:**
diff --git a/docs/reference/deploy/manual_deploy.md b/docs/reference/deploy/manual_deploy.md
index e69de29b..d8173d57 100644
--- a/docs/reference/deploy/manual_deploy.md
+++ b/docs/reference/deploy/manual_deploy.md
@@ -0,0 +1,57 @@
+# Manual Deployment Notes
+
+Use this path when deploying SimpleChat to **native Python Azure App Service** instead of the repo's container-based deployers.
+
+For the combined native-vs-container decision guide, see [../../how-to/upgrade_paths.md](../../how-to/upgrade_paths.md).
+
+## Native Python App Service Startup Command
+
+Set the App Service Stack Settings Startup command explicitly.
+
+Do **not** leave the Startup command empty during an upgrade. Validate it before or during the release.
+
+Deploy and run the `application/single_app` folder in App Service.
+
+Use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+## Native Python Upgrade Checklist
+
+Use this checklist when updating an existing native Python App Service deployment.
+
+1. Confirm the deployment model is **native Python App Service**, not container-based App Service.
+2. Confirm the `application/single_app` folder is the deployment unit and the Startup command is present and correct.
+3. Choose an upgrade method:
+    - **VS Code deployment** when you want the simplest manual update path.
+    - **Azure CLI ZIP deploy** when you want a repeatable package-and-deploy path.
+    - **Deployment slots** when you want validation and rollback for production.
+4. If you use ZIP deploy, confirm `SCM_DO_BUILD_DURING_DEPLOYMENT=true` so App Service installs dependencies from `requirements.txt`.
+5. Validate the site after deployment.
+
+## Native Python Upgrade Methods
+
+### Visual Studio Code Deployment
+
+Deploy the updated code from VS Code by right-clicking the existing App Service and selecting **Deploy to Web App...**.
+
+### Azure CLI ZIP Deploy
+
+Package the updated application into a deployment ZIP, then deploy it:
+
+```bash
+az webapp deploy \
+  --resource-group <Your-Resource-Group-Name> \
+  --name <Your-App-Service-Name> \
+  --src-path ../deployment.zip \
+  --type zip
+```
+
+This is an upgrade method, not only an initial deployment method.
+
+## Important Distinction
+
+- Native Python App Service needs the Startup command above.
+- The repo-provided `azd`, Bicep, Terraform, and Azure CLI deployers do not need this because they deploy a container image whose entrypoint already launches Gunicorn.
diff --git a/docs/reference/deploy/terraform_deploy.md b/docs/reference/deploy/terraform_deploy.md
index e69de29b..8b449694 100644
--- a/docs/reference/deploy/terraform_deploy.md
+++ b/docs/reference/deploy/terraform_deploy.md
@@ -0,0 +1,17 @@
+# Terraform Deployment Notes
+
+The current Terraform deployer in this repo provisions a **container-based Azure Linux Web App**.
+
+## Current Behavior
+
+- Terraform sets the App Service to run the published container image.
+- Gunicorn startup is already handled by the container entrypoint in `application/single_app/Dockerfile`.
+- You do **not** need to configure App Service Stack Settings Startup command for the current Terraform deployment.
+
+## If You Switch Terraform to Native Python Later
+
+If you change the Terraform deployment model away from containers and into native Python App Service, deploy the `application/single_app` folder and use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
diff --git a/docs/setup_instructions.md b/docs/setup_instructions.md
index 199ffa5f..560a7442 100644
--- a/docs/setup_instructions.md
+++ b/docs/setup_instructions.md
@@ -20,6 +20,7 @@ The options are:
 - [Azure CLI with Powershell](#azure-cli-with-powershell)
 - [BICEP](#bicep)
 - [Terraform](#hashicorp-terraform)
+- [Upgrade Existing Deployments](#upgrade-existing-deployments)
 
 **Note:** Terraform is the most robust and requires the least manual post-deployment actions at this time.
 
@@ -35,6 +36,12 @@ This is the step by step process required to deploy the infrastructure and confi
 
 [Link to manual deployment steps](./setup_instructions_manual.md)
 
+## Upgrade Existing Deployments
+
+If you already have Simple Chat deployed and only need to update the application, use the dedicated upgrade guide instead of rerunning the full setup flow.
+
+[Link to upgrade paths](./how-to/upgrade_paths.md)
+
 ## Azure CLI with Powershell
 
 All Azure resource provisioning happens with Azure CLI. Powershell is used for the control flow of the script only.
diff --git a/docs/setup_instructions_manual.md b/docs/setup_instructions_manual.md
index c7e13ca3..92e9421e 100644
--- a/docs/setup_instructions_manual.md
+++ b/docs/setup_instructions_manual.md
@@ -471,7 +471,7 @@ Deploy the application code from your local repository to the Azure App Service.
    - Expand **App Service**, find your subscription and the App Service instance you created.
    - **Right-click** on the App Service name.
    - Select **Deploy to Web App...**.
-   - Browse and select the folder containing the application code (the root folder you cloned, e.g., SimpleChat).
+    - Browse and select the `application/single_app` folder from the repository.
    - VS Code will prompt to confirm the deployment, potentially warning about overwriting existing content. Click **Deploy**.
    - Make sure your requirements.txt file is up-to-date before deploying. The deployment process (SCM_DO_BUILD_DURING_DEPLOYMENT=true) will use this file to install dependencies on the App Service.
    - Monitor the deployment progress in the VS Code Output window.
@@ -484,7 +484,7 @@ This method involves creating a zip file of the application code and uploading i
 
 1. **Create the ZIP file**:
 
-   - Navigate into the application's root directory (e.g., SimpleChat) in your terminal.
+    - Navigate into `application/single_app` in your terminal.
    - Create a zip file containing **only** the necessary application files and folders. **Crucially, zip the contents, not the parent folder itself.**
    - **Include**:
      - static/ folder
@@ -529,6 +529,20 @@ This method involves creating a zip file of the application code and uploading i
 
 > <a href="#simple-chat---manual-setup-instructions" style="text-decoration: none;">Return to top</a>
 
+This section covers **native Python Azure App Service** upgrades for the manual deployment path.
+
+Before upgrading a native Python deployment, confirm that the App Service Stack Settings Startup command is set correctly and is not blank.
+
+Deploy and run the `application/single_app` folder in App Service.
+
+Use this Startup command:
+
+```bash
+python -m gunicorn -c gunicorn.conf.py app:app
+```
+
+For a shorter decision guide that also covers container-based upgrades, see [Upgrade Paths](./how-to/upgrade_paths.md).
+
 Keeping your Simple Chat application up-to-date involves deploying the newer version of the code. Using **Deployment Slots** is the recommended approach for production environments to ensure zero downtime and provide easy rollback capabilities.
 
 ![alt text](./images/admin_settings-upgrade_available_notification.png)
diff --git a/functional_tests/test_access_denied_message_feature.py b/functional_tests/test_access_denied_message_feature.py
new file mode 100644
index 00000000..4a0a2194
--- /dev/null
+++ b/functional_tests/test_access_denied_message_feature.py
@@ -0,0 +1,197 @@
+#!/usr/bin/env python3
+# test_access_denied_message_feature.py
+"""
+Functional regression test for admin-configurable access denied message.
+
+Version: 0.239.002
+Implemented in: 0.239.002
+
+This test ensures that:
+1. The Admin Settings template exposes a textarea with name="access_denied_message".
+2. route_frontend_admin_settings.py reads the field from form_data and falls back
+   to the existing stored value (not '') when the field is absent -- preventing
+   silent data loss from cached/older form submissions.
+3. index.html renders app_settings.access_denied_message through the nl2br filter
+   without a redundant hardcoded fallback string.
+4. functions_settings.py defines a non-empty default for access_denied_message so
+   the field is always present after get_settings() deep-merges defaults.
+"""
+
+import sys
+import os
+import re
+
+# Resolve paths relative to repo root
+REPO_ROOT = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+ADMIN_TEMPLATE    = os.path.join(REPO_ROOT, "application", "single_app", "templates", "admin_settings.html")
+INDEX_TEMPLATE    = os.path.join(REPO_ROOT, "application", "single_app", "templates", "index.html")
+ROUTE_FILE        = os.path.join(REPO_ROOT, "application", "single_app", "route_frontend_admin_settings.py")
+SETTINGS_FILE     = os.path.join(REPO_ROOT, "application", "single_app", "functions_settings.py")
+
+
+# ---------------------------------------------------------------------------
+# Test 1 – Admin Settings template has the access_denied_message field
+# ---------------------------------------------------------------------------
+
+def test_admin_template_has_field():
+    """Admin Settings template must expose a textarea named access_denied_message."""
+    print("Testing admin_settings.html contains access_denied_message field...")
+    errors = []
+
+    with open(ADMIN_TEMPLATE, encoding="utf-8") as f:
+        content = f.read()
+
+    # textarea with correct name attribute
+    if 'name="access_denied_message"' not in content:
+        errors.append("No <textarea name=\"access_denied_message\"> found in admin_settings.html")
+
+    # label pointing to the field
+    if 'for="access_denied_message"' not in content:
+        errors.append("No <label for=\"access_denied_message\"> found in admin_settings.html")
+
+    # renders the current stored value
+    if 'settings.access_denied_message' not in content:
+        errors.append("Textarea does not render {{ settings.access_denied_message }}")
+
+    return _summarise(errors, "admin template field")
+
+
+# ---------------------------------------------------------------------------
+# Test 2 – Route persists access_denied_message with safe fallback
+# ---------------------------------------------------------------------------
+
+def test_route_persists_with_safe_fallback():
+    """route_frontend_admin_settings must fall back to existing stored value, not ''."""
+    print("\nTesting route_frontend_admin_settings.py persistence logic...")
+    errors = []
+
+    with open(ROUTE_FILE, encoding="utf-8") as f:
+        content = f.read()
+
+    # Key must be written into the settings dict
+    if "'access_denied_message'" not in content:
+        errors.append("'access_denied_message' key not found in route file")
+        return _summarise(errors, "route persistence")
+
+    # Must use form_data.get('access_denied_message', ...) - not a hard '' fallback
+    # Correct pattern: form_data.get('access_denied_message', settings.get(...))
+    safe_fallback_pattern = re.compile(
+        r"'access_denied_message'\s*:\s*form_data\.get\(\s*'access_denied_message'\s*,"
+        r"\s*settings\.get\("
+    )
+    if not safe_fallback_pattern.search(content):
+        errors.append(
+            "access_denied_message does not use settings.get() as fallback -- "
+            "form_data.get('access_denied_message', settings.get(...)) pattern not found"
+        )
+
+    # Must NOT be: form_data.get('access_denied_message', '')  (bare empty-string fallback)
+    bare_empty_pattern = re.compile(
+        r"'access_denied_message'\s*:\s*form_data\.get\(\s*'access_denied_message'\s*,\s*''\s*\)"
+    )
+    if bare_empty_pattern.search(content):
+        errors.append(
+            "access_denied_message still has bare '' fallback -- would wipe stored value "
+            "if field is absent from form submission"
+        )
+
+    return _summarise(errors, "route persistence")
+
+
+# ---------------------------------------------------------------------------
+# Test 3 – index.html renders via nl2br without a hardcoded fallback
+# ---------------------------------------------------------------------------
+
+def test_index_renders_via_nl2br_no_hardcoded_fallback():
+    """index.html must render access_denied_message | nl2br with no inline fallback."""
+    print("\nTesting index.html nl2br rendering...")
+    errors = []
+
+    with open(INDEX_TEMPLATE, encoding="utf-8") as f:
+        content = f.read()
+
+    # Must use the nl2br filter
+    if 'access_denied_message | nl2br' not in content:
+        errors.append("index.html does not render access_denied_message through nl2br filter")
+
+    # Must NOT contain a hardcoded fallback string inline
+    hardcoded_pattern = re.compile(
+        r"access_denied_message\s+or\s+'You are logged in"
+    )
+    if hardcoded_pattern.search(content):
+        errors.append(
+            "index.html still has a hardcoded fallback string -- "
+            "default should live only in functions_settings.py"
+        )
+
+    return _summarise(errors, "index nl2br rendering")
+
+
+# ---------------------------------------------------------------------------
+# Test 4 – functions_settings.py defines a non-empty default
+# ---------------------------------------------------------------------------
+
+def test_settings_default_is_defined():
+    """functions_settings.py must define a non-empty default for access_denied_message."""
+    print("\nTesting functions_settings.py default value...")
+    errors = []
+
+    with open(SETTINGS_FILE, encoding="utf-8") as f:
+        content = f.read()
+
+    pattern = re.compile(
+        r"'access_denied_message'\s*:\s*'(.+?)'"
+    )
+    match = pattern.search(content)
+    if not match:
+        errors.append("No non-empty default for 'access_denied_message' found in functions_settings.py")
+    else:
+        print(f"  Default value: \"{match.group(1)[:60]}...\"")
+
+    return _summarise(errors, "settings default")
+
+
+# ---------------------------------------------------------------------------
+# Helper
+# ---------------------------------------------------------------------------
+
+def _summarise(errors, label):
+    if errors:
+        for e in errors:
+            print(f"  FAIL: {e}")
+        return False
+    print(f"  All {label} checks passed!")
+    return True
+
+
+# ---------------------------------------------------------------------------
+# Entry point
+# ---------------------------------------------------------------------------
+
+if __name__ == "__main__":
+    tests = [
+        test_admin_template_has_field,
+        test_route_persists_with_safe_fallback,
+        test_index_renders_via_nl2br_no_hardcoded_fallback,
+        test_settings_default_is_defined,
+    ]
+    results = []
+    for t in tests:
+        print(f"\n{'='*60}")
+        print(f"Running {t.__name__}...")
+        print("="*60)
+        try:
+            results.append(t())
+        except Exception as exc:
+            import traceback
+            print(f"ERROR: {exc}")
+            traceback.print_exc()
+            results.append(False)
+
+    passed = sum(1 for r in results if r)
+    total = len(results)
+    print(f"\n{'='*60}")
+    print(f"Results: {passed}/{total} tests passed")
+    print("="*60)
+    sys.exit(0 if all(results) else 1)
diff --git a/functional_tests/test_agent_audit_metadata_validation_fix.py b/functional_tests/test_agent_audit_metadata_validation_fix.py
new file mode 100644
index 00000000..25a58e3b
--- /dev/null
+++ b/functional_tests/test_agent_audit_metadata_validation_fix.py
@@ -0,0 +1,113 @@
+#!/usr/bin/env python3
+# test_agent_audit_metadata_validation_fix.py
+"""
+Functional test for agent audit metadata validation fix.
+Version: 0.239.112
+Implemented in: 0.239.112
+
+This test ensures that round-tripped agent payloads containing server-managed
+audit and Cosmos metadata can still be sanitized and validated before save.
+"""
+
+import os
+import sys
+
+sys.path.append(os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+from functions_agent_payload import sanitize_agent_payload
+from json_schema_validation import validate_agent
+
+
+def test_round_tripped_agent_metadata_is_ignored():
+    """Audit and Cosmos metadata should not block agent validation."""
+    print("🔍 Testing round-tripped agent payload sanitization...")
+
+    payload = {
+        "id": "123e4567-e89b-42d3-a456-426614174000",
+        "name": "metadata_safe_agent",
+        "display_name": "Metadata Safe Agent",
+        "description": "Validates metadata stripping before agent save.",
+        "instructions": "Be helpful.",
+        "azure_openai_gpt_endpoint": "",
+        "azure_openai_gpt_key": "",
+        "azure_openai_gpt_deployment": "deployment-a",
+        "azure_openai_gpt_api_version": "2024-05-01-preview",
+        "azure_agent_apim_gpt_endpoint": "",
+        "azure_agent_apim_gpt_subscription_key": "",
+        "azure_agent_apim_gpt_deployment": "",
+        "azure_agent_apim_gpt_api_version": "",
+        "enable_agent_gpt_apim": False,
+        "is_global": False,
+        "is_group": False,
+        "agent_type": "local",
+        "actions_to_load": [],
+        "other_settings": {},
+        "max_completion_tokens": 4096,
+        "created_at": "2026-03-13T10:00:00Z",
+        "created_by": "user-1",
+        "modified_at": "2026-03-13T10:05:00Z",
+        "modified_by": "user-2",
+        "updated_at": "2026-03-13T10:05:00Z",
+        "last_updated": "2026-03-13T10:05:00Z",
+        "user_id": "user-1",
+        "group_id": "group-1",
+        "_etag": "etag-value",
+        "_rid": "rid-value",
+        "_self": "self-value",
+        "_ts": 1234567890
+    }
+
+    cleaned = sanitize_agent_payload(payload)
+    validation_error = validate_agent(cleaned)
+
+    unexpected_fields = [
+        "created_at",
+        "created_by",
+        "modified_at",
+        "modified_by",
+        "updated_at",
+        "last_updated",
+        "user_id",
+        "group_id",
+        "_etag",
+        "_rid",
+        "_self",
+        "_ts",
+    ]
+
+    for field in unexpected_fields:
+        if field in cleaned:
+            print(f"❌ Field should have been stripped during sanitization: {field}")
+            return False
+
+    if validation_error:
+        print(f"❌ Validation failed after sanitization: {validation_error}")
+        return False
+
+    print("✅ Agent payload validation ignores server-managed metadata.")
+    return True
+
+
+if __name__ == "__main__":
+    print("🧪 Testing Agent Audit Metadata Validation Fix...")
+    print("=" * 70)
+
+    tests = [
+        test_round_tripped_agent_metadata_is_ignored,
+    ]
+
+    results = []
+
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(tests)} tests passed")
+
+    if success:
+        print("✅ All agent audit metadata validation tests passed!")
+    else:
+        print("❌ Some tests failed. Please review the agent metadata sanitization changes.")
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_backend_agents_swagger_integration.py b/functional_tests/test_backend_agents_swagger_integration.py
index 9712c08f..5bb4ed3e 100644
--- a/functional_tests/test_backend_agents_swagger_integration.py
+++ b/functional_tests/test_backend_agents_swagger_integration.py
@@ -1,8 +1,8 @@
 #!/usr/bin/env python3
 """
 Functional test for route_backend_agents.py swagger integration.
-Version: 0.229.065
-Implemented in: 0.229.065
+Version: 0.239.146
+Implemented in: 0.239.146
 
 This test ensures that all endpoints in route_backend_agents.py are properly decorated 
 with @swagger_route decorators and will be included in the automatic swagger documentation.
@@ -27,6 +27,7 @@ def test_backend_agents_swagger_integration():
         from flask import Flask
         test_app = Flask(__name__)
         test_app.register_blueprint(bpa)
+        openapi_spec = extract_route_info(test_app)
         
         # Count endpoints with swagger decorators
         swagger_endpoints = 0
@@ -37,10 +38,19 @@ def test_backend_agents_swagger_integration():
             if rule.endpoint.startswith('admin_agents.'):
                 total_endpoints += 1
                 endpoint_name = rule.endpoint.split('.')[-1]
+                path = rule.rule
+                path = path.replace('<', '{').replace('>', '}')
+                path = path.replace('{int:', '{').replace('{string:', '{').replace('{float:', '{')
+                path = path.replace('{uuid:', '{').replace('{path:', '{')
+                route_operations = openapi_spec.get('paths', {}).get(path, {})
+                route_info = None
+                for method in rule.methods - {'HEAD', 'OPTIONS'}:
+                    route_info = route_operations.get(method.lower())
+                    if route_info:
+                        break
                 
                 # Try to extract route info (this will work if swagger_route decorator is present)
                 try:
-                    route_info = extract_route_info(rule, test_app.view_functions[rule.endpoint])
                     if route_info:
                         swagger_endpoints += 1
                         endpoint_details.append({
diff --git a/functional_tests/test_backend_chats_swagger_integration.py b/functional_tests/test_backend_chats_swagger_integration.py
index e60d2597..9d18721e 100644
--- a/functional_tests/test_backend_chats_swagger_integration.py
+++ b/functional_tests/test_backend_chats_swagger_integration.py
@@ -1,8 +1,8 @@
 #!/usr/bin/env python3
 """
 Functional test for route_backend_chats.py swagger integration.
-Version: 0.229.066
-Implemented in: 0.229.066
+Version: 0.239.146
+Implemented in: 0.239.146
 
 This test ensures that the /api/chat endpoint in route_backend_chats.py is properly decorated 
 with @swagger_route decorator and will be included in the automatic swagger documentation.
@@ -29,6 +29,7 @@ def test_backend_chats_swagger_integration():
         
         # Register the chat routes
         register_route_backend_chats(test_app)
+        openapi_spec = extract_route_info(test_app)
         
         # Count endpoints with swagger decorators
         swagger_endpoints = 0
@@ -39,10 +40,19 @@ def test_backend_chats_swagger_integration():
             if '/api/chat' in rule.rule:
                 total_endpoints += 1
                 endpoint_name = rule.endpoint
+                path = rule.rule
+                path = path.replace('<', '{').replace('>', '}')
+                path = path.replace('{int:', '{').replace('{string:', '{').replace('{float:', '{')
+                path = path.replace('{uuid:', '{').replace('{path:', '{')
+                route_operations = openapi_spec.get('paths', {}).get(path, {})
+                route_info = None
+                for method in rule.methods - {'HEAD', 'OPTIONS'}:
+                    route_info = route_operations.get(method.lower())
+                    if route_info:
+                        break
                 
                 # Try to extract route info (this will work if swagger_route decorator is present)
                 try:
-                    route_info = extract_route_info(rule, test_app.view_functions[rule.endpoint])
                     if route_info:
                         swagger_endpoints += 1
                         endpoint_details.append({
diff --git a/functional_tests/test_backend_foundry_agent_payload.py b/functional_tests/test_backend_foundry_agent_payload.py
index 72c3daa2..01e3b669 100644
--- a/functional_tests/test_backend_foundry_agent_payload.py
+++ b/functional_tests/test_backend_foundry_agent_payload.py
@@ -1,8 +1,8 @@
 #!/usr/bin/env python3
 """
 Functional test for Azure AI Foundry agent payload sanitation.
-Version: 0.233.176
-Implemented in: 0.233.176
+Version: 0.239.149
+Implemented in: 0.239.149
 
 This test ensures that sanitize_agent_payload enforces Foundry-specific backend
 constraints (actions_to_load cleared, APIM disabled, agent_id required) and
@@ -84,12 +84,16 @@ def test_foundry_agent_requires_agent_id():
     raise AssertionError("Expected AgentPayloadError for missing agent_id")
 
 if __name__ == "__main__":
-     tests = [
-         test_foundry_agent_actions_and_apim_rules,
-         test_foundry_agent_requires_agent_id
-     ]
-     results = []
-@@
-     success = all(results)
-     print(f"\n📊 Results: {sum(results)}/{len(tests)} tests passed")
-     sys.exit(0 if success else 1)
+    tests = [
+        test_foundry_agent_actions_and_apim_rules,
+        test_foundry_agent_requires_agent_id
+    ]
+    results = []
+
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test() is not False)
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(tests)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_background_task_distributed_locks.py b/functional_tests/test_background_task_distributed_locks.py
new file mode 100644
index 00000000..f5eb0bf4
--- /dev/null
+++ b/functional_tests/test_background_task_distributed_locks.py
@@ -0,0 +1,54 @@
+#!/usr/bin/env python3
+# test_background_task_distributed_locks.py
+"""
+Functional test for distributed background task locks.
+Version: 0.239.129
+Implemented in: 0.239.129
+
+This test ensures that retention policy and approval expiry background tasks
+use Cosmos-backed distributed lock helpers so duplicate execution is reduced
+across multiple workers and App Service instances.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+BACKGROUND_TASKS_FILE = ROOT / 'application' / 'single_app' / 'background_tasks.py'
+CONFIG_FILE = ROOT / 'application' / 'single_app' / 'config.py'
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_background_task_distributed_locks() -> bool:
+    print('Testing distributed background task lock wiring...')
+
+    assert_contains(BACKGROUND_TASKS_FILE, 'from azure.core import MatchConditions')
+    assert_contains(BACKGROUND_TASKS_FILE, 'from config import cosmos_settings_container, exceptions')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def acquire_distributed_task_lock(task_name, lease_seconds):')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def release_distributed_task_lock(lock_document):')
+    assert_contains(BACKGROUND_TASKS_FILE, 'background_task_lock_')
+    assert_contains(BACKGROUND_TASKS_FILE, "acquire_distributed_task_lock('approval_expiry', lease_seconds=1800)")
+    assert_contains(BACKGROUND_TASKS_FILE, "acquire_distributed_task_lock('retention_policy', lease_seconds=3600)")
+    assert_contains(BACKGROUND_TASKS_FILE, 'release_distributed_task_lock(lock_document)')
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.129"')
+
+    print('Distributed background task lock checks passed!')
+    return True
+
+
+if __name__ == '__main__':
+    try:
+        success = test_background_task_distributed_locks()
+    except Exception as exc:
+        print(f'Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_changed_files_action_version.py b/functional_tests/test_changed_files_action_version.py
new file mode 100644
index 00000000..8de5b64e
--- /dev/null
+++ b/functional_tests/test_changed_files_action_version.py
@@ -0,0 +1,61 @@
+#!/usr/bin/env python3
+# test_changed_files_action_version.py
+"""
+Functional test for changed-files GitHub Action version pin.
+Version: 0.239.136
+Implemented in: 0.239.135
+
+This test ensures the release notes workflow uses the patched
+tj-actions/changed-files version and no longer references the known malicious
+commit from the March 2025 supply chain incident.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+WORKFLOW_FILE = ROOT / '.github' / 'workflows' / 'release-notes-check.yml'
+CONFIG_FILE = ROOT / 'application' / 'single_app' / 'config.py'
+FIX_DOC_FILE = ROOT / 'docs' / 'explanation' / 'fixes' / 'CHANGED_FILES_GITHUB_ACTION_SUPPLY_CHAIN_FIX.md'
+
+SAFE_ACTION_REFERENCE = 'tj-actions/changed-files@v46.0.1'
+MALICIOUS_COMMIT = '0e58ed8671d6b60d0890c21b07f8835ace038e67'
+CURRENT_VERSION = '0.239.136'
+IMPLEMENTED_VERSION = '0.239.135'
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def assert_not_contains(file_path: Path, unexpected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if unexpected in content:
+        raise AssertionError(f"Did not expect to find {unexpected!r} in {file_path}")
+
+
+def test_changed_files_action_version() -> bool:
+    print('Testing changed-files action version pin...')
+
+    assert_contains(WORKFLOW_FILE, SAFE_ACTION_REFERENCE)
+    assert_not_contains(WORKFLOW_FILE, MALICIOUS_COMMIT)
+    assert_contains(CONFIG_FILE, f'VERSION = "{CURRENT_VERSION}"')
+    assert_contains(FIX_DOC_FILE, f'Fixed/Implemented in version: **{IMPLEMENTED_VERSION}**')
+
+    print('changed-files action version pin checks passed!')
+    return True
+
+
+if __name__ == '__main__':
+    try:
+        success = test_changed_files_action_version()
+    except Exception as exc:
+        print(f'Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_completion_notifications.py b/functional_tests/test_chat_completion_notifications.py
new file mode 100644
index 00000000..2b6c01a2
--- /dev/null
+++ b/functional_tests/test_chat_completion_notifications.py
@@ -0,0 +1,426 @@
+#!/usr/bin/env python3
+# test_chat_completion_notifications.py
+"""
+Functional test for chat completion notifications.
+Version: 0.239.136
+Implemented in: 0.239.128
+
+This test ensures that personal chat completions create deep-link notifications,
+conversation unread state is normalized for list/detail responses, and the
+mark-read flow clears both the unread marker and the related notification.
+"""
+
+import copy
+import os
+import re
+import sys
+
+from flask import Flask
+
+sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+class FakeConversationContainer:
+    """In-memory container for conversation route tests."""
+
+    def __init__(self, items=None):
+        self.items = {}
+        for item in items or []:
+            self.upsert_item(item)
+
+    def read_item(self, item=None, partition_key=None, *args, **kwargs):
+        item_id = item if item is not None else args[0]
+        if item_id not in self.items:
+            raise KeyError(item_id)
+        return copy.deepcopy(self.items[item_id])
+
+    def upsert_item(self, item):
+        self.items[item['id']] = copy.deepcopy(item)
+        return copy.deepcopy(item)
+
+    def query_items(self, query=None, enable_cross_partition_query=False, partition_key=None, parameters=None):
+        results = [copy.deepcopy(item) for item in self.items.values()]
+
+        if parameters:
+            parameter_map = {param['name']: param['value'] for param in parameters}
+            if '@user_id' in parameter_map:
+                results = [item for item in results if item.get('user_id') == parameter_map['@user_id']]
+        elif query:
+            user_match = re.search(r"c\.user_id = '([^']+)'", query)
+            if user_match:
+                results = [item for item in results if item.get('user_id') == user_match.group(1)]
+
+        if query and 'ORDER BY c.last_updated DESC' in query:
+            results.sort(key=lambda item: item.get('last_updated', ''), reverse=True)
+
+        return results
+
+
+class FakeNotificationContainer:
+    """In-memory container for notification helper tests."""
+
+    def __init__(self):
+        self.items = {}
+
+    def create_item(self, item):
+        self.items[item['id']] = copy.deepcopy(item)
+        return copy.deepcopy(item)
+
+    def upsert_item(self, item):
+        self.items[item['id']] = copy.deepcopy(item)
+        return copy.deepcopy(item)
+
+    def query_items(self, query=None, parameters=None, partition_key=None, enable_cross_partition_query=False):
+        results = [copy.deepcopy(item) for item in self.items.values()]
+        parameter_map = {param['name']: param['value'] for param in (parameters or [])}
+
+        if '@notification_id' in parameter_map:
+            results = [item for item in results if item.get('id') == parameter_map['@notification_id']]
+        if '@user_id' in parameter_map:
+            results = [item for item in results if item.get('user_id') == parameter_map['@user_id']]
+        if '@notification_type' in parameter_map:
+            results = [
+                item for item in results
+                if item.get('notification_type') == parameter_map['@notification_type']
+            ]
+        if '@conversation_id' in parameter_map:
+            results = [
+                item for item in results
+                if item.get('metadata', {}).get('conversation_id') == parameter_map['@conversation_id']
+            ]
+
+        return results
+
+
+def build_test_app(test_user_id, conversation_container, notification_container):
+    """Register the conversation routes with fake auth/container dependencies."""
+    import functions_notifications
+    import route_backend_conversations
+
+    original_notification_container = functions_notifications.cosmos_notifications_container
+    original_conversation_container = route_backend_conversations.cosmos_conversations_container
+    original_login_required = route_backend_conversations.login_required
+    original_user_required = route_backend_conversations.user_required
+    original_swagger_route = route_backend_conversations.swagger_route
+    original_get_auth_security = route_backend_conversations.get_auth_security
+    original_get_current_user_id = route_backend_conversations.get_current_user_id
+
+    functions_notifications.cosmos_notifications_container = notification_container
+    route_backend_conversations.cosmos_conversations_container = conversation_container
+    route_backend_conversations.login_required = lambda func: func
+    route_backend_conversations.user_required = lambda func: func
+    route_backend_conversations.swagger_route = lambda **kwargs: (lambda func: func)
+    route_backend_conversations.get_auth_security = lambda: {}
+    route_backend_conversations.get_current_user_id = lambda: test_user_id
+
+    app = Flask(__name__)
+    app.config['TESTING'] = True
+    route_backend_conversations.register_route_backend_conversations(app)
+
+    def restore():
+        functions_notifications.cosmos_notifications_container = original_notification_container
+        route_backend_conversations.cosmos_conversations_container = original_conversation_container
+        route_backend_conversations.login_required = original_login_required
+        route_backend_conversations.user_required = original_user_required
+        route_backend_conversations.swagger_route = original_swagger_route
+        route_backend_conversations.get_auth_security = original_get_auth_security
+        route_backend_conversations.get_current_user_id = original_get_current_user_id
+
+    return app, restore
+
+
+def unwrap_response(result):
+    """Normalize Flask view return values into (response, status_code)."""
+    if isinstance(result, tuple):
+        response = result[0]
+        status_code = result[1]
+    else:
+        response = result
+        status_code = response.status_code
+
+    return response, status_code
+
+
+def test_chat_response_notification_creation_and_deep_link():
+    """Verify helper-created notifications include the chat completion deep link and metadata."""
+    print("🔍 Testing chat completion notification creation...")
+
+    import functions_notifications
+
+    fake_container = FakeNotificationContainer()
+    original_container = functions_notifications.cosmos_notifications_container
+    functions_notifications.cosmos_notifications_container = fake_container
+
+    try:
+        notification = functions_notifications.create_chat_response_notification(
+            user_id='test-user-chat-notification',
+            conversation_id='conversation-123',
+            message_id='assistant-message-456',
+            conversation_title='Quarterly Planning',
+            response_preview='The plan is ready for review.'
+        )
+
+        if not notification:
+            print("❌ Notification helper did not return a document")
+            return False
+
+        if notification.get('notification_type') != 'chat_response_complete':
+            print("❌ Notification type was not chat_response_complete")
+            return False
+
+        if notification.get('link_url') != '/chats?conversationId=conversation-123':
+            print(f"❌ Unexpected deep link: {notification.get('link_url')}")
+            return False
+
+        metadata = notification.get('metadata', {})
+        if metadata.get('conversation_id') != 'conversation-123' or metadata.get('message_id') != 'assistant-message-456':
+            print(f"❌ Notification metadata mismatch: {metadata}")
+            return False
+
+        print("✅ Chat completion notification helper created the expected deep link and metadata")
+        return True
+    finally:
+        functions_notifications.cosmos_notifications_container = original_container
+
+
+def test_conversation_routes_normalize_unread_fields():
+    """Verify list/detail conversation routes normalize unread fields for older documents."""
+    print("🔍 Testing conversation unread field normalization...")
+
+    test_user_id = 'test-user-unread-normalization'
+    old_conversation = {
+        'id': 'conversation-old-shape',
+        'user_id': test_user_id,
+        'title': 'Legacy Conversation',
+        'last_updated': '2026-03-19T00:00:00Z',
+        'context': [],
+        'tags': [],
+        'strict': False,
+        'is_pinned': False,
+        'is_hidden': False,
+    }
+
+    app, restore = build_test_app(
+        test_user_id,
+        FakeConversationContainer([old_conversation]),
+        FakeNotificationContainer(),
+    )
+
+    try:
+        with app.test_request_context('/api/get_conversations'):
+            conversations_response, conversations_status = unwrap_response(
+                app.view_functions['get_conversations']()
+            )
+
+        if conversations_status != 200:
+            print(f"❌ Unexpected conversation list status: {conversations_status}")
+            return False
+
+        conversation_payload = conversations_response.get_json().get('conversations', [])[0]
+        if conversation_payload.get('has_unread_assistant_response') is not False:
+            print(f"❌ Conversation list did not normalize unread flag: {conversation_payload}")
+            return False
+
+        with app.test_request_context('/api/conversations/conversation-old-shape/metadata'):
+            metadata_response, metadata_status = unwrap_response(
+                app.view_functions['get_conversation_metadata_api']('conversation-old-shape')
+            )
+
+        if metadata_status != 200:
+            print(f"❌ Unexpected conversation metadata status: {metadata_status}")
+            return False
+
+        metadata_payload = metadata_response.get_json()
+        if metadata_payload.get('last_unread_assistant_message_id', 'missing') is not None:
+            print(f"❌ Metadata route did not normalize unread message id: {metadata_payload}")
+            return False
+
+        print("✅ Conversation list and metadata routes normalize unread fields for older documents")
+        return True
+    finally:
+        restore()
+
+
+def test_mark_read_endpoint_clears_unread_state_and_notification():
+    """Verify mark-read clears conversation unread state and marks matching notifications read."""
+    print("🔍 Testing conversation mark-read lifecycle...")
+
+    import functions_notifications
+
+    test_user_id = 'test-user-mark-read'
+    conversation_id = 'conversation-mark-read'
+    assistant_message_id = 'assistant-message-mark-read'
+
+    fake_conversations = FakeConversationContainer([
+        {
+            'id': conversation_id,
+            'user_id': test_user_id,
+            'title': 'Unread Conversation',
+            'last_updated': '2026-03-19T00:00:00Z',
+            'context': [],
+            'tags': [],
+            'strict': False,
+            'is_pinned': False,
+            'is_hidden': False,
+            'has_unread_assistant_response': True,
+            'last_unread_assistant_message_id': assistant_message_id,
+            'last_unread_assistant_at': '2026-03-19T00:00:00Z',
+        }
+    ])
+    fake_notifications = FakeNotificationContainer()
+
+    original_notification_container = functions_notifications.cosmos_notifications_container
+    functions_notifications.cosmos_notifications_container = fake_notifications
+    functions_notifications.create_chat_response_notification(
+        user_id=test_user_id,
+        conversation_id=conversation_id,
+        message_id=assistant_message_id,
+        conversation_title='Unread Conversation',
+        response_preview='The AI model responded while you were away.'
+    )
+    functions_notifications.cosmos_notifications_container = original_notification_container
+
+    app, restore = build_test_app(test_user_id, fake_conversations, fake_notifications)
+
+    try:
+        with app.test_request_context(f'/api/conversations/{conversation_id}/mark-read', method='POST'):
+            response, status_code = unwrap_response(
+                app.view_functions['mark_conversation_read_api'](conversation_id)
+            )
+
+        if status_code != 200:
+            print(f"❌ Unexpected mark-read status: {status_code}")
+            return False
+
+        response_payload = response.get_json()
+        if not response_payload.get('success'):
+            print(f"❌ Mark-read endpoint did not report success: {response_payload}")
+            return False
+
+        updated_conversation = fake_conversations.read_item(conversation_id, conversation_id)
+        if updated_conversation.get('has_unread_assistant_response'):
+            print(f"❌ Conversation unread flag still set: {updated_conversation}")
+            return False
+
+        if updated_conversation.get('last_unread_assistant_message_id') is not None:
+            print(f"❌ Conversation unread message id was not cleared: {updated_conversation}")
+            return False
+
+        stored_notification = next(iter(fake_notifications.items.values()))
+        if test_user_id not in stored_notification.get('read_by', []):
+            print(f"❌ Notification was not marked read: {stored_notification}")
+            return False
+
+        print("✅ Mark-read endpoint cleared conversation unread state and related notification")
+        return True
+    finally:
+        restore()
+
+
+def test_frontend_wires_unread_dot_and_mark_read_flow():
+    """Verify the chat UI files render the unread dot and call the mark-read API."""
+    print("🔍 Testing frontend unread dot and mark-read wiring...")
+
+    repo_root = os.path.join(os.path.dirname(os.path.abspath(__file__)), '..')
+    main_list_js = os.path.join(repo_root, 'application', 'single_app', 'static', 'js', 'chat', 'chat-conversations.js')
+    sidebar_js = os.path.join(repo_root, 'application', 'single_app', 'static', 'js', 'chat', 'chat-sidebar-conversations.js')
+    streaming_js = os.path.join(repo_root, 'application', 'single_app', 'static', 'js', 'chat', 'chat-streaming.js')
+    chats_css = os.path.join(repo_root, 'application', 'single_app', 'static', 'css', 'chats.css')
+
+    required_checks = [
+        (main_list_js, 'fetch(`/api/conversations/${conversationId}/mark-read`, {'),
+        (main_list_js, 'function createUnreadDotElement() {'),
+        (main_list_js, 'setSidebarConversationUnreadState(conversationId, hasUnread);'),
+        (sidebar_js, "conversation-unread-dot', 'sidebar-conversation-unread-dot"),
+        (streaming_js, "markConversationRead(finalData.conversation_id, { force: true, suppressErrorToast: true })"),
+        (chats_css, '.conversation-unread-dot {'),
+    ]
+
+    for file_path, snippet in required_checks:
+        with open(file_path, 'r', encoding='utf-8') as handle:
+            content = handle.read()
+        if snippet not in content:
+            print(f"❌ Missing required unread-flow snippet in {os.path.basename(file_path)}: {snippet}")
+            return False
+
+    print("✅ Frontend files include unread-dot rendering and mark-read wiring")
+    return True
+
+
+def test_streaming_completion_wires_unread_state_and_notification_creation():
+    """Verify the streaming completion branch persists unread state and notification creation."""
+    print("🔍 Testing streaming completion unread-state wiring...")
+
+    route_file_path = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'route_backend_chats.py'
+    )
+
+    with open(route_file_path, 'r', encoding='utf-8') as handle:
+        route_content = handle.read()
+
+    required_snippets = [
+        'from functions_conversation_unread import mark_conversation_unread',
+        'from functions_notifications import create_chat_response_notification',
+        'def is_personal_chat_conversation(conversation_item):',
+        "return not chat_type.startswith('group') and not chat_type.startswith('public')",
+        'if is_personal_chat_conversation(conversation_item):',
+        'conversation_item = mark_conversation_unread(',
+        'notification_doc = create_chat_response_notification(',
+        "response_preview=accumulated_content",
+    ]
+
+    for snippet in required_snippets:
+        if snippet not in route_content:
+            print(f"❌ Missing streaming completion notification snippet: {snippet}")
+            return False
+
+    print("✅ Streaming completion branch marks unread state and creates notifications")
+    return True
+
+
+def test_version_updated_for_feature():
+    """Verify config.py reflects the new chat completion notification version."""
+    print("🔍 Testing version update for chat completion notifications...")
+
+    config_file_path = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'config.py'
+    )
+
+    with open(config_file_path, 'r', encoding='utf-8') as handle:
+        config_content = handle.read()
+
+    if 'VERSION = "0.239.136"' not in config_content:
+        print("❌ Version not updated to 0.239.136")
+        return False
+
+    print("✅ Version properly updated to 0.239.136")
+    return True
+
+
+if __name__ == '__main__':
+    tests = [
+        test_chat_response_notification_creation_and_deep_link,
+        test_conversation_routes_normalize_unread_fields,
+        test_mark_read_endpoint_clears_unread_state_and_notification,
+        test_frontend_wires_unread_dot_and_mark_read_flow,
+        test_streaming_completion_wires_unread_state_and_notification_creation,
+        test_version_updated_for_feature,
+    ]
+
+    results = []
+    for test in tests:
+        print()
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Test Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_preserves_workspace_selection_on_auto_create.py b/functional_tests/test_chat_preserves_workspace_selection_on_auto_create.py
new file mode 100644
index 00000000..ecbe5482
--- /dev/null
+++ b/functional_tests/test_chat_preserves_workspace_selection_on_auto_create.py
@@ -0,0 +1,178 @@
+#!/usr/bin/env python3
+# test_chat_preserves_workspace_selection_on_auto_create.py
+"""
+Functional test for chat workspace selection preservation on implicit conversation creation.
+Version: 0.239.105
+Implemented in: 0.239.105
+
+This test ensures that auto-creating a conversation from the chat input,
+prompt picker, send flow, or file upload flow does not reset selected
+workspace scope, tags, or documents back to the default state.
+"""
+
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+CHAT_DOCUMENTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-documents.js',
+)
+CHAT_CONVERSATIONS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-conversations.js',
+)
+CHAT_ONLOAD_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-onload.js',
+)
+CHAT_INPUT_ACTIONS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-input-actions.js',
+)
+CHAT_MESSAGES_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-messages.js',
+)
+
+
+def read_file(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_scope_reset_supports_preserving_current_selections():
+    """Verify scope reset can clear lock state without wiping active selections."""
+    print('🔍 Testing resetScopeLock preserveSelections support...')
+
+    try:
+        content = read_file(CHAT_DOCUMENTS_FILE)
+
+        required_snippets = [
+            'export function resetScopeLock(options = {})',
+            'const { preserveSelections = false } = options;',
+            'if (preserveSelections) {',
+            'buildScopeDropdown();',
+            'updateScopeLockIcon();',
+            'updateHeaderLockIcon();',
+            'return;',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing preserveSelections reset support: {missing}'
+
+        print('✅ resetScopeLock preserveSelections support passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_create_new_conversation_reuses_pending_request_and_respects_preserve_flag():
+    """Verify createNewConversation preserves selections when requested and reuses in-flight creation."""
+    print('🔍 Testing createNewConversation preserve path and in-flight reuse...')
+
+    try:
+        content = read_file(CHAT_CONVERSATIONS_FILE)
+
+        required_snippets = [
+            'let pendingConversationCreation = null;',
+            'if (pendingConversationCreation) {',
+            'await pendingConversationCreation;',
+            'const { preserveSelections = false } = options;',
+            'resetScopeLock({ preserveSelections });',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing createNewConversation preservation logic: {missing}'
+
+        # Explicit New Conversation should still use the default full reset path.
+        assert 'createNewConversation();' in content, (
+            'Expected explicit New Conversation button flow to remain unchanged.'
+        )
+
+        print('✅ createNewConversation preserve path and in-flight reuse passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_implicit_conversation_creation_call_sites_preserve_workspace_filters():
+    """Verify implicit create-conversation flows request preserved selections."""
+    print('🔍 Testing implicit conversation creation call sites...')
+
+    try:
+        onload_content = read_file(CHAT_ONLOAD_FILE)
+        input_actions_content = read_file(CHAT_INPUT_ACTIONS_FILE)
+        messages_content = read_file(CHAT_MESSAGES_FILE)
+
+        checks = {
+            'input focus preserves selections': 'createNewConversation(null, { preserveSelections: true });' in onload_content,
+            'prompt button preserves selections': onload_content.count('createNewConversation(null, { preserveSelections: true });') >= 2,
+            'file button preserves selections': onload_content.count('createNewConversation(null, { preserveSelections: true });') >= 3,
+            'file upload auto-create preserves selections': input_actions_content.count('}, { preserveSelections: true });') >= 2,
+            'send flow preserves selections': '}, { preserveSelections: true });' in messages_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f'Missing preserve-selection implicit creation paths: {failed_checks}'
+
+        print('✅ Implicit conversation creation call sites passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_scope_reset_supports_preserving_current_selections,
+        test_create_new_conversation_reuses_pending_request_and_respects_preserve_flag,
+        test_implicit_conversation_creation_call_sites_preserve_workspace_filters,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_searchable_selectors.py b/functional_tests/test_chat_searchable_selectors.py
new file mode 100644
index 00000000..55b8ad20
--- /dev/null
+++ b/functional_tests/test_chat_searchable_selectors.py
@@ -0,0 +1,334 @@
+#!/usr/bin/env python3
+# test_chat_searchable_selectors.py
+"""
+Functional test for searchable chat selectors.
+Version: 0.239.125
+Implemented in: 0.239.124
+
+This test ensures that the chat page adds search support for workspace scope,
+tags, prompts, models, and agents, and that prompt loading fetches all pages so
+the searchable prompt picker is not capped at the first prompt page. It also
+verifies that the chat action buttons and selector controls use a responsive
+toolbar layout instead of compressing active buttons into narrow columns.
+"""
+
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+CHATS_TEMPLATE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'templates',
+    'chats.html',
+)
+CHAT_DOCUMENTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-documents.js',
+)
+CHAT_PROMPTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-prompts.js',
+)
+CHAT_SEARCHABLE_SELECT_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-searchable-select.js',
+)
+CHAT_MODEL_SELECTOR_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-model-selector.js',
+)
+CHAT_AGENTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-agents.js',
+)
+CHAT_CSS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'css',
+    'chats.css',
+)
+CONFIG_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'config.py',
+)
+
+
+def read_file(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_chat_template_contains_searchable_selectors():
+    """Verify the chat template contains search inputs and custom selector shells."""
+    print('🔍 Testing chat template searchable selector markup...')
+
+    try:
+        content = read_file(CHATS_TEMPLATE)
+
+        required_snippets = [
+            'id="scope-search-input"',
+            'id="tags-search-input"',
+            'class="chat-toolbar mb-2"',
+            'class="chat-toolbar-actions"',
+            'class="chat-toolbar-controls"',
+            'class="chat-toolbar-toggles"',
+            'class="chat-toolbar-selectors"',
+            'id="prompt-dropdown"',
+            'id="prompt-search-input"',
+            'id="model-dropdown"',
+            'id="model-search-input"',
+            'id="agent-dropdown"',
+            'id="agent-search-input"',
+            'id="prompt-selection-container" class="chat-toolbar-selector"',
+            'id="agent-select-container" class="chat-toolbar-selector"',
+            'id="model-select-container" class="chat-toolbar-selector"',
+            'chat-searchable-select',
+            'id="prompt-select"',
+            'id="model-select"',
+            'id="agent-select"',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing searchable selector markup: {missing}'
+
+        print('✅ Chat template searchable selector markup passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_chat_toolbar_layout_supports_wrapping_without_button_compression():
+    """Verify the chat toolbar uses responsive layout rules for buttons and selectors."""
+    print('🔍 Testing chat toolbar layout wiring...')
+
+    try:
+        content = read_file(CHAT_CSS_FILE)
+
+        required_snippets = [
+            '.chat-toolbar {',
+            '.chat-toolbar-actions,',
+            '.chat-toolbar-controls {',
+            '.chat-toolbar-selectors {',
+            '.chat-toolbar-selector {',
+            '.chat-toolbar-selector .chat-searchable-select {',
+            '.search-btn,',
+            '.file-btn {',
+            '@media (max-width: 1200px) {',
+            '@media (max-width: 768px) {',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing responsive toolbar layout rules: {missing}'
+
+        print('✅ Chat toolbar layout wiring passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_shared_search_helper_supports_dropdown_filtering_and_single_selects():
+    """Verify the shared helper supports both filterable dropdowns and searchable single-selects."""
+    print('🔍 Testing shared searchable select helper...')
+
+    try:
+        content = read_file(CHAT_SEARCHABLE_SELECT_FILE)
+
+        required_snippets = [
+            'export function initializeFilterableDropdownSearch',
+            'export function createSearchableSingleSelect',
+            'updateDropdownStructure(itemsContainerEl);',
+            "itemsContainerEl.appendChild(createNoMatchesElement(emptyMessage));",
+            "selectEl.dispatchEvent(new Event('change', { bubbles: true }));",
+            'const observer = new MutationObserver(() => {',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing shared helper logic: {missing}'
+
+        print('✅ Shared searchable select helper passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_scope_tag_and_document_search_are_wired_in_chat_documents():
+    """Verify scope, tags, and documents all use the shared filter helper."""
+    print('🔍 Testing scope/tag/document search wiring...')
+
+    try:
+        content = read_file(CHAT_DOCUMENTS_FILE)
+
+        required_snippets = [
+            'const scopeSearchInput = document.getElementById("scope-search-input");',
+            'const tagsSearchInput = document.getElementById("tags-search-input");',
+            'const documentSearchController = initializeFilterableDropdownSearch({',
+            'const scopeSearchController = initializeFilterableDropdownSearch({',
+            'const tagsSearchController = initializeFilterableDropdownSearch({',
+            "allItem.setAttribute('data-search-role', 'action');",
+            "item.setAttribute('data-search-role', 'item');",
+            "documentSearchController?.applyFilter(docSearchInput ? docSearchInput.value : '');",
+            "tagsSearchController?.applyFilter(tagsSearchInput ? tagsSearchInput.value : '');",
+            "scopeSearchController?.applyFilter(scopeSearchInput ? scopeSearchInput.value : '');",
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing scope/tag/document search wiring: {missing}'
+
+        print('✅ Scope/tag/document search wiring passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_prompt_selector_pages_all_prompts_and_uses_searchable_select():
+    """Verify prompt loading walks all pages and renders through the shared searchable select."""
+    print('🔍 Testing prompt selector pagination and search wiring...')
+
+    try:
+        content = read_file(CHAT_PROMPTS_FILE)
+
+        required_snippets = [
+            'import { createSearchableSingleSelect } from "./chat-searchable-select.js";',
+            'const promptPageSize = 100;',
+            'async function fetchAllPromptPages(endpoint, emptyStatuses = []) {',
+            'page_size: String(promptPageSize)',
+            'prompts.length >= totalCount',
+            'promptSelectorController = createSearchableSingleSelect({',
+            'promptSelect.dispatchEvent(new Event("change", { bubbles: true }));',
+            'loadAllPromptsPromise = Promise.all([loadUserPrompts(), loadGroupPrompts(), loadPublicPrompts()])',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing prompt searchable selector logic: {missing}'
+
+        print('✅ Prompt selector pagination and search wiring passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_model_and_agent_selectors_use_searchable_wrapper():
+    """Verify model and agent selectors initialize the shared searchable wrapper."""
+    print('🔍 Testing model and agent searchable selector wiring...')
+
+    try:
+        model_content = read_file(CHAT_MODEL_SELECTOR_FILE)
+        agent_content = read_file(CHAT_AGENTS_FILE)
+
+        model_snippets = [
+            "import { createSearchableSingleSelect } from './chat-searchable-select.js';",
+            'export function initializeModelSelector()',
+            'modelSelectorController = createSearchableSingleSelect({',
+        ]
+        agent_snippets = [
+            "import { createSearchableSingleSelect } from './chat-searchable-select.js';",
+            'function initializeAgentSelector() {',
+            'agentSelectorController = createSearchableSingleSelect({',
+            'agentSelectorController?.refresh();',
+        ]
+
+        missing_model = [snippet for snippet in model_snippets if snippet not in model_content]
+        missing_agent = [snippet for snippet in agent_snippets if snippet not in agent_content]
+        assert not missing_model, f'Missing model selector wiring: {missing_model}'
+        assert not missing_agent, f'Missing agent selector wiring: {missing_agent}'
+
+        print('✅ Model and agent searchable selector wiring passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_version_bumped_for_searchable_chat_selector_change():
+    """Verify config version was bumped for the searchable selector feature."""
+    print('🔍 Testing config version bump...')
+
+    try:
+        config_content = read_file(CONFIG_FILE)
+        assert 'VERSION = "0.239.125"' in config_content, 'Expected config.py version 0.239.125'
+
+        print('✅ Config version bump passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_chat_template_contains_searchable_selectors,
+        test_chat_toolbar_layout_supports_wrapping_without_button_compression,
+        test_shared_search_helper_supports_dropdown_filtering_and_single_selects,
+        test_scope_tag_and_document_search_are_wired_in_chat_documents,
+        test_prompt_selector_pages_all_prompts_and_uses_searchable_select,
+        test_model_and_agent_selectors_use_searchable_wrapper,
+        test_version_bumped_for_searchable_chat_selector_change,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_stream_background_execution.py b/functional_tests/test_chat_stream_background_execution.py
new file mode 100644
index 00000000..9ec9c8f6
--- /dev/null
+++ b/functional_tests/test_chat_stream_background_execution.py
@@ -0,0 +1,58 @@
+#!/usr/bin/env python3
+# test_chat_stream_background_execution.py
+"""
+Functional test for chat stream background execution.
+Version: 0.239.143
+Implemented in: 0.239.129
+
+This test ensures that the streaming chat route runs its SSE generator through
+background execution so chat completion can continue after the browser leaves
+the page, while still streaming live events to an attached consumer.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+ROUTE_FILE = ROOT / "application" / "single_app" / "route_backend_chats.py"
+CONFIG_FILE = ROOT / "application" / "single_app" / "config.py"
+FIX_DOC_FILE = ROOT / "docs" / "explanation" / "fixes" / "CHAT_STREAM_BACKGROUND_EXECUTION_FIX.md"
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_chat_stream_background_execution() -> bool:
+    print("Testing chat stream background execution...")
+
+    assert_contains(ROUTE_FILE, "class BackgroundStreamBridge:")
+    assert_contains(ROUTE_FILE, "@copy_current_request_context")
+    assert_contains(ROUTE_FILE, "executor = current_app.extensions.get('executor')")
+    assert_contains(ROUTE_FILE, "executor.submit(stream_worker)")
+    assert_contains(ROUTE_FILE, "worker_thread = threading.Thread(target=stream_worker, daemon=True)")
+    assert_contains(ROUTE_FILE, "for event in event_generator_factory():")
+    assert_contains(ROUTE_FILE, "stream_bridge.detach_consumer()")
+    assert_contains(ROUTE_FILE, "return build_background_stream_response(generate_compatibility_response)")
+    assert_contains(ROUTE_FILE, "return build_background_stream_response(generate)")
+
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.143"')
+    assert_contains(FIX_DOC_FILE, "Fixed/Implemented in version: **0.239.129**")
+
+    print("Chat stream background execution checks passed!")
+    return True
+
+
+if __name__ == "__main__":
+    try:
+        success = test_chat_stream_background_execution()
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_stream_compatibility_sse_syntax.py b/functional_tests/test_chat_stream_compatibility_sse_syntax.py
new file mode 100644
index 00000000..c49bd45d
--- /dev/null
+++ b/functional_tests/test_chat_stream_compatibility_sse_syntax.py
@@ -0,0 +1,55 @@
+#!/usr/bin/env python3
+# test_chat_stream_compatibility_sse_syntax.py
+"""
+Functional test for chat stream compatibility SSE syntax.
+Version: 0.239.143
+Implemented in: 0.239.134
+
+This test ensures that the streaming chat route compiles successfully and that
+the compatibility SSE bridge builds image-generation thought payloads outside
+the f-string expression so parser regressions are caught early.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+ROUTE_FILE = ROOT / "application" / "single_app" / "route_backend_chats.py"
+CONFIG_FILE = ROOT / "application" / "single_app" / "config.py"
+FIX_DOC_FILE = ROOT / "docs" / "explanation" / "fixes" / "CHAT_STREAM_COMPATIBILITY_SSE_SYNTAX_FIX.md"
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_chat_stream_compatibility_sse_syntax() -> bool:
+    print("Testing chat stream compatibility SSE syntax...")
+
+    source = ROUTE_FILE.read_text(encoding="utf-8")
+    compile(source, str(ROUTE_FILE), "exec")
+
+    assert_contains(ROUTE_FILE, "image_prompt_event = {")
+    assert_contains(ROUTE_FILE, "image_request_event = {")
+    assert_contains(ROUTE_FILE, "image_ready_event = {")
+    assert_contains(ROUTE_FILE, 'yield f"data: {json.dumps(image_prompt_event)}\\n\\n"')
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.143"')
+    assert_contains(FIX_DOC_FILE, "Fixed/Implemented in version: **0.239.134**")
+
+    print("Chat stream compatibility SSE syntax checks passed!")
+    return True
+
+
+if __name__ == "__main__":
+    try:
+        success = test_chat_stream_compatibility_sse_syntax()
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_stream_debug_logging.py b/functional_tests/test_chat_stream_debug_logging.py
new file mode 100644
index 00000000..396bb774
--- /dev/null
+++ b/functional_tests/test_chat_stream_debug_logging.py
@@ -0,0 +1,55 @@
+#!/usr/bin/env python3
+# test_chat_stream_debug_logging.py
+"""
+Functional test for chat stream debug logging.
+Version: 0.239.143
+Implemented in: 0.239.142
+
+This test ensures that the streaming chat route retains unconditional
+debug_print instrumentation for request entry, plugin callback orchestration,
+and final stream completion so local troubleshooting remains visible.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+ROUTE_FILE = ROOT / "application" / "single_app" / "route_backend_chats.py"
+CONFIG_FILE = ROOT / "application" / "single_app" / "config.py"
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_chat_stream_debug_logging() -> bool:
+    print("Testing chat stream debug logging markers...")
+
+    assert_contains(ROUTE_FILE, '[Streaming] Incoming /api/chat/stream request | ')
+    assert_contains(ROUTE_FILE, '[Streaming] Routing request through compatibility bridge')
+    assert_contains(ROUTE_FILE, '[Streaming] Parsed request payload | ')
+    assert_contains(ROUTE_FILE, '[Streaming] Cleared plugin invocations for user_id=')
+    assert_contains(ROUTE_FILE, '[Streaming] Selected response path | ')
+    assert_contains(ROUTE_FILE, '[Streaming][Plugin Callback] Registering callback for key=')
+    assert_contains(ROUTE_FILE, '[Streaming][Plugin Callback] Received invocation ')
+    assert_contains(ROUTE_FILE, '[Streaming][Plugin Callback] Deregistered callback after successful stream for key=')
+    assert_contains(ROUTE_FILE, '[Streaming] Finalizing stream response | ')
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.143"')
+
+    print("Chat stream debug logging checks passed!")
+    return True
+
+
+if __name__ == "__main__":
+    try:
+        success = test_chat_stream_debug_logging()
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_chat_tabular_sk_trigger.py b/functional_tests/test_chat_tabular_sk_trigger.py
new file mode 100644
index 00000000..9de91b39
--- /dev/null
+++ b/functional_tests/test_chat_tabular_sk_trigger.py
@@ -0,0 +1,319 @@
+#!/usr/bin/env python3
+# test_chat_tabular_sk_trigger.py
+"""
+Functional test for chat-uploaded tabular file SK mini-agent trigger fix.
+Version: 0.239.008
+Implemented in: 0.239.008
+
+This test ensures that when tabular data files (CSV, XLSX) are uploaded directly
+to a chat conversation, the SK mini-agent (`run_tabular_sk_analysis`) is properly
+triggered in model-only mode (no agent selected) to pre-compute analysis results.
+
+Previously, the mini SK agent only triggered from search results, missing
+chat-uploaded files entirely. The streaming path also ignored file role messages.
+
+This test validates:
+1. Chat-uploaded tabular files are detected in conversation history (file role)
+2. Filenames are collected into chat_tabular_files set for both blob and inline cases
+3. The streaming path properly handles file role messages (not just user/assistant)
+4. The mini SK trigger block activates when chat_tabular_files is non-empty
+"""
+
+import sys
+import os
+
+sys.path.append(os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
+sys.path.append(os.path.join(os.path.dirname(os.path.dirname(os.path.abspath(__file__))), 'application', 'single_app'))
+
+
+def test_chat_tabular_file_detection():
+    """Test that chat-uploaded tabular files are detected from file role messages."""
+    print("Testing chat-uploaded tabular file detection...")
+
+    try:
+        # Simulate conversation messages as they would appear from Cosmos DB
+        recent_messages = [
+            {
+                'role': 'file',
+                'filename': 'Sample_-_Superstore_1.xlsx',
+                'file_content': '',
+                'is_table': True,
+                'file_content_source': 'blob',
+                'metadata': {},
+            },
+            {
+                'role': 'user',
+                'content': 'analyze sales/profit',
+                'metadata': {},
+            },
+        ]
+
+        # Simulate the conversation history building logic (streaming path)
+        conversation_history_for_api = []
+        allowed_roles_in_history = ['user', 'assistant']
+        max_file_content_length_in_history = 50000
+        max_tabular_content_length_in_history = 50000
+        chat_tabular_files = set()
+
+        for message in recent_messages:
+            role = message.get('role')
+            content = message.get('content', '')
+
+            if role in allowed_roles_in_history:
+                conversation_history_for_api.append({
+                    'role': role,
+                    'content': content
+                })
+            elif role == 'file':
+                filename = message.get('filename', 'uploaded_file')
+                file_content = message.get('file_content', '')
+                is_table = message.get('is_table', False)
+                file_content_source = message.get('file_content_source', '')
+
+                if is_table and file_content_source == 'blob':
+                    chat_tabular_files.add(filename)
+                    conversation_history_for_api.append({
+                        'role': 'system',
+                        'content': (
+                            f"[User uploaded a tabular data file named '{filename}'. "
+                            f"The file is stored in blob storage and available for analysis. "
+                            f"The file source is 'chat'.]"
+                        )
+                    })
+
+        # Verify file was detected
+        assert 'Sample_-_Superstore_1.xlsx' in chat_tabular_files, \
+            f"Expected xlsx file in chat_tabular_files, got: {chat_tabular_files}"
+
+        # Verify system message was added for the file
+        system_msgs = [m for m in conversation_history_for_api if m['role'] == 'system']
+        assert len(system_msgs) == 1, f"Expected 1 system message, got {len(system_msgs)}"
+        assert 'Sample_-_Superstore_1.xlsx' in system_msgs[0]['content'], \
+            "System message should reference the uploaded file"
+
+        # Verify user message was included
+        user_msgs = [m for m in conversation_history_for_api if m['role'] == 'user']
+        assert len(user_msgs) == 1, f"Expected 1 user message, got {len(user_msgs)}"
+        assert user_msgs[0]['content'] == 'analyze sales/profit', \
+            "User message content should be preserved"
+
+        print("Test passed!")
+        return True
+
+    except Exception as e:
+        print(f"Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_non_tabular_file_handling():
+    """Test that non-tabular files are handled with content preview."""
+    print("Testing non-tabular file handling in streaming path...")
+
+    try:
+        recent_messages = [
+            {
+                'role': 'file',
+                'filename': 'report.txt',
+                'file_content': 'This is the report content.',
+                'is_table': False,
+                'file_content_source': 'inline',
+                'metadata': {},
+            },
+            {
+                'role': 'user',
+                'content': 'summarize the report',
+                'metadata': {},
+            },
+        ]
+
+        conversation_history_for_api = []
+        allowed_roles_in_history = ['user', 'assistant']
+        max_file_content_length_in_history = 50000
+        max_tabular_content_length_in_history = 50000
+        chat_tabular_files = set()
+
+        for message in recent_messages:
+            role = message.get('role')
+            content = message.get('content', '')
+
+            if role in allowed_roles_in_history:
+                conversation_history_for_api.append({
+                    'role': role,
+                    'content': content
+                })
+            elif role == 'file':
+                filename = message.get('filename', 'uploaded_file')
+                file_content = message.get('file_content', '')
+                is_table = message.get('is_table', False)
+                file_content_source = message.get('file_content_source', '')
+
+                if is_table and file_content_source == 'blob':
+                    chat_tabular_files.add(filename)
+                    conversation_history_for_api.append({
+                        'role': 'system',
+                        'content': f"[User uploaded a tabular data file named '{filename}'.]"
+                    })
+                else:
+                    content_limit = (
+                        max_tabular_content_length_in_history if is_table
+                        else max_file_content_length_in_history
+                    )
+                    display_content = file_content[:content_limit]
+                    if len(file_content) > content_limit:
+                        display_content += "..."
+
+                    conversation_history_for_api.append({
+                        'role': 'system',
+                        'content': (
+                            f"[User uploaded a file named '{filename}'. "
+                            f"Content preview:\n{display_content}]\n"
+                            f"Use this file context if relevant."
+                        )
+                    })
+
+        # Non-tabular files should NOT be in chat_tabular_files
+        assert len(chat_tabular_files) == 0, \
+            f"Non-tabular files should not be tracked, got: {chat_tabular_files}"
+
+        # System message should contain file content preview
+        system_msgs = [m for m in conversation_history_for_api if m['role'] == 'system']
+        assert len(system_msgs) == 1, f"Expected 1 system message, got {len(system_msgs)}"
+        assert 'report.txt' in system_msgs[0]['content'], \
+            "System message should reference the file"
+        assert 'This is the report content.' in system_msgs[0]['content'], \
+            "System message should include file content preview"
+
+        print("Test passed!")
+        return True
+
+    except Exception as e:
+        print(f"Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_multiple_tabular_files():
+    """Test detection of multiple tabular files uploaded to chat."""
+    print("Testing multiple tabular file detection...")
+
+    try:
+        recent_messages = [
+            {
+                'role': 'file',
+                'filename': 'sales_2024.xlsx',
+                'file_content': '',
+                'is_table': True,
+                'file_content_source': 'blob',
+                'metadata': {},
+            },
+            {
+                'role': 'file',
+                'filename': 'inventory.csv',
+                'file_content': 'item,count\nwidget,100',
+                'is_table': True,
+                'file_content_source': 'blob',
+                'metadata': {},
+            },
+            {
+                'role': 'user',
+                'content': 'compare sales to inventory',
+                'metadata': {},
+            },
+        ]
+
+        chat_tabular_files = set()
+        conversation_history_for_api = []
+        allowed_roles_in_history = ['user', 'assistant']
+
+        for message in recent_messages:
+            role = message.get('role')
+            content = message.get('content', '')
+
+            if role in allowed_roles_in_history:
+                conversation_history_for_api.append({'role': role, 'content': content})
+            elif role == 'file':
+                filename = message.get('filename', 'uploaded_file')
+                is_table = message.get('is_table', False)
+                file_content_source = message.get('file_content_source', '')
+
+                if is_table and file_content_source == 'blob':
+                    chat_tabular_files.add(filename)
+                    conversation_history_for_api.append({
+                        'role': 'system',
+                        'content': f"[Tabular file '{filename}' available for analysis.]"
+                    })
+
+        assert len(chat_tabular_files) == 2, \
+            f"Expected 2 tabular files, got {len(chat_tabular_files)}"
+        assert 'sales_2024.xlsx' in chat_tabular_files, "Missing sales_2024.xlsx"
+        assert 'inventory.csv' in chat_tabular_files, "Missing inventory.csv"
+
+        print("Test passed!")
+        return True
+
+    except Exception as e:
+        print(f"Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_inline_tabular_not_tracked_for_sk():
+    """Test that inline tabular files (not blob) are NOT tracked for SK analysis."""
+    print("Testing inline tabular files are not tracked for SK analysis...")
+
+    try:
+        recent_messages = [
+            {
+                'role': 'file',
+                'filename': 'data.csv',
+                'file_content': 'col1,col2\n1,2\n3,4',
+                'is_table': True,
+                'file_content_source': 'inline',  # Not blob
+                'metadata': {},
+            },
+        ]
+
+        chat_tabular_files = set()
+
+        for message in recent_messages:
+            role = message.get('role')
+            if role == 'file':
+                is_table = message.get('is_table', False)
+                file_content_source = message.get('file_content_source', '')
+                filename = message.get('filename', '')
+
+                if is_table and file_content_source == 'blob':
+                    chat_tabular_files.add(filename)
+
+        assert len(chat_tabular_files) == 0, \
+            f"Inline tabular files should not be tracked for SK, got: {chat_tabular_files}"
+
+        print("Test passed!")
+        return True
+
+    except Exception as e:
+        print(f"Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == "__main__":
+    results = []
+    results.append(test_chat_tabular_file_detection())
+    results.append(test_non_tabular_file_handling())
+    results.append(test_multiple_tabular_files())
+    results.append(test_inline_tabular_not_tracked_for_sk())
+
+    print(f"\n{'='*50}")
+    print(f"Results: {sum(results)}/{len(results)} tests passed")
+    if all(results):
+        print("All tests passed!")
+        sys.exit(0)
+    else:
+        print("Some tests failed!")
+        sys.exit(1)
diff --git a/functional_tests/test_control_center_group_manager_refresh_groups_duplicate_fix.py b/functional_tests/test_control_center_group_manager_refresh_groups_duplicate_fix.py
new file mode 100644
index 00000000..d974070f
--- /dev/null
+++ b/functional_tests/test_control_center_group_manager_refresh_groups_duplicate_fix.py
@@ -0,0 +1,42 @@
+#!/usr/bin/env python3
+"""
+Functional test for control center GroupManager refreshGroups duplicate fix.
+Version: 0.239.145
+Implemented in: 0.239.145
+
+This test ensures that the GroupManager object in control_center.html defines
+refreshGroups only once so object literal members are not accidentally
+overwritten.
+"""
+
+# test_control_center_group_manager_refresh_groups_duplicate_fix.py
+
+from pathlib import Path
+import re
+import sys
+
+
+ROOT_DIR = Path(__file__).resolve().parents[1]
+TEMPLATE_PATH = ROOT_DIR / "application" / "single_app" / "templates" / "control_center.html"
+
+
+def test_refresh_groups_defined_once() -> bool:
+    """Validate GroupManager.refreshGroups is declared exactly once."""
+    template = TEMPLATE_PATH.read_text(encoding="utf-8")
+    matches = re.findall(r"\brefreshGroups\s*:\s*function\s*\(", template)
+
+    if len(matches) != 1:
+        print(f"Expected exactly 1 refreshGroups definition, found {len(matches)}")
+        return False
+
+    if "Loading groups..." not in template:
+        print("Expected refreshGroups implementation to preserve the loading placeholder")
+        return False
+
+    print("refreshGroups is defined exactly once and preserves the loading state")
+    return True
+
+
+if __name__ == "__main__":
+    success = test_refresh_groups_defined_once()
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_conversation_export.py b/functional_tests/test_conversation_export.py
index cb8e56d0..654059b1 100644
--- a/functional_tests/test_conversation_export.py
+++ b/functional_tests/test_conversation_export.py
@@ -1,365 +1,545 @@
 #!/usr/bin/env python3
 # test_conversation_export.py
 """
-Functional test for conversation export feature.
-Version: 0.237.050
-Implemented in: 0.237.050
-
-This test validates the conversation export backend endpoint
-and ensures JSON/Markdown formats and single/ZIP packaging work correctly.
+Functional test for conversation export enhancements.
+Version: 0.239.029
+Implemented in: 0.239.022 (base), 0.239.023 (PDF export), 0.239.025 (tag formatting fix), 0.239.028 (summary token budget fix)
+
+This test ensures conversation export now includes normalized and raw citations,
+processing thoughts, transcript-style Markdown appendices, deleted-message filtering,
+optional summary-intro metadata, PDF export with chat-bubble styling, and properly
+formatted tag/classification rendering.
 """
 
-import sys
-import os
+import io
 import json
+import os
+import sys
 import zipfile
-import io
 
 sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
 
+from route_backend_conversation_export import (  # noqa: E402
+    _build_citation_counts,
+    _build_pdf_html_body,
+    _collect_raw_citation_buckets,
+    _conversation_to_markdown,
+    _conversation_to_pdf_bytes,
+    _filter_messages_for_export,
+    _format_tag,
+    _normalize_citations,
+    _normalize_content,
+    _pdf_bubble_class,
+    _safe_filename,
+    _sanitize_conversation,
+    _sanitize_message,
+    _truncate_for_summary,
+)
+
+
+def test_filter_messages_for_export():
+    """Test that deleted and inactive-thread messages are excluded."""
+    print("🔍 Testing export message filtering...")
 
-def test_sanitize_conversation():
-    """Test that _sanitize_conversation strips internal fields."""
-    print("🔍 Testing _sanitize_conversation...")
-
-    raw_conversation = {
-        'id': 'conv-123',
-        'title': 'Test Conversation',
-        'last_updated': '2025-01-01T00:00:00Z',
-        'chat_type': 'personal',
-        'tags': ['test'],
-        'is_pinned': False,
-        'context': [],
-        'user_id': 'secret-user-id',
-        '_rid': 'cosmos-internal-rid',
-        '_self': 'cosmos-self-link',
-        '_etag': 'some-etag',
-        '_attachments': 'attachments',
-        '_ts': 1234567890,
-        'partition_key': 'should-not-appear'
-    }
+    messages = [
+        {'id': 'm1', 'role': 'user', 'content': 'Keep me', 'metadata': {}},
+        {'id': 'm2', 'role': 'assistant', 'content': 'Delete me', 'metadata': {'is_deleted': True}},
+        {'id': 'm3', 'role': 'assistant', 'content': 'Inactive retry', 'metadata': {'thread_info': {'active_thread': False}}},
+        {'id': 'm4', 'role': 'assistant', 'content': 'Active reply', 'metadata': {'thread_info': {'active_thread': True}}},
+        {'id': 'm5', 'role': 'user', 'content': 'Legacy message', 'metadata': {'thread_info': {}}},
+    ]
 
-    # Import after path setup — may fail if dependencies aren't installed
-    try:
-        from route_backend_conversation_export import register_route_backend_conversation_export
-        print("  Module imported successfully (dependencies available)")
-    except ImportError as ie:
-        print(f"  Skipping import test (missing dependency: {ie})")
-        print("  Verifying sanitization logic inline instead...")
-
-    # We test the logic manually since inner functions are not directly accessible
-    sanitized = {
-        'id': raw_conversation.get('id'),
-        'title': raw_conversation.get('title', 'Untitled'),
-        'last_updated': raw_conversation.get('last_updated', ''),
-        'chat_type': raw_conversation.get('chat_type', 'personal'),
-        'tags': raw_conversation.get('tags', []),
-        'is_pinned': raw_conversation.get('is_pinned', False),
-        'context': raw_conversation.get('context', [])
-    }
+    filtered = _filter_messages_for_export(messages)
+    filtered_ids = [message['id'] for message in filtered]
 
-    assert 'id' in sanitized, "Should retain id"
-    assert 'title' in sanitized, "Should retain title"
-    assert 'user_id' not in sanitized, "Should strip user_id"
-    assert '_rid' not in sanitized, "Should strip Cosmos internal fields"
-    assert '_etag' not in sanitized, "Should strip _etag"
-    assert 'partition_key' not in sanitized, "Should strip partition_key"
+    assert filtered_ids == ['m1', 'm4', 'm5'], f"Unexpected filtered IDs: {filtered_ids}"
 
-    print("✅ _sanitize_conversation test passed!")
+    print("✅ Export message filtering test passed!")
     return True
 
 
-def test_sanitize_message():
-    """Test that _sanitize_message strips internal fields."""
-    print("🔍 Testing _sanitize_message...")
+def test_sanitize_message_with_citations_and_thoughts():
+    """Test that sanitized messages retain normalized/raw citations and thoughts."""
+    print("🔍 Testing message sanitization with citations and thoughts...")
 
-    raw_message = {
-        'id': 'msg-456',
+    assistant_message = {
+        'id': 'assistant-1',
         'role': 'assistant',
-        'content': 'Hello, how can I help?',
-        'timestamp': '2025-01-01T00:00:01Z',
-        'citations': [{'title': 'Doc1', 'url': 'https://example.com'}],
-        'conversation_id': 'conv-123',
-        'user_id': 'secret-user-id',
-        '_rid': 'cosmos-internal',
-        'metadata': {'thread_info': {'active_thread': True}},
+        'content': [{'type': 'text', 'text': 'Answer with evidence.'}],
+        'timestamp': '2026-03-06T12:00:02Z',
+        'augmented': True,
+        'hybrid_citations': [
+            {
+                'file_name': 'paper.pdf',
+                'page_number': 4,
+                'citation_id': 'doc-1_4',
+                'chunk_id': 'chunk-1',
+                'metadata_type': 'abstract',
+                'metadata_content': 'Study abstract text.'
+            }
+        ],
+        'web_search_citations': [
+            {'title': 'Example Source', 'url': 'https://example.com/source'}
+        ],
+        'agent_citations': [
+            {
+                'tool_name': 'web_lookup',
+                'function_name': 'azure_ai_foundry_web_search',
+                'plugin_name': 'azure_ai_foundry',
+                'function_arguments': {'query': 'test'},
+                'function_result': {'answer': 'done'},
+                'timestamp': '2026-03-06T12:00:01Z',
+                'success': True
+            }
+        ],
+        'metadata': {
+            'reasoning_effort': 'medium',
+            'token_usage': {
+                'prompt_tokens': 10,
+                'completion_tokens': 20,
+                'total_tokens': 30
+            }
+        },
+        'model_deployment_name': 'gpt-5-mini',
+        'hybridsearch_query': 'refined search query'
     }
+    thoughts = [
+        {
+            'step_index': 0,
+            'step_type': 'search',
+            'content': 'Searching documents...',
+            'detail': 'query=refined search query',
+            'duration_ms': 123,
+            'timestamp': '2026-03-06T12:00:01Z'
+        }
+    ]
 
-    result = {
-        'role': raw_message.get('role', ''),
-        'content': raw_message.get('content', ''),
-        'timestamp': raw_message.get('timestamp', ''),
-    }
-    if raw_message.get('citations'):
-        result['citations'] = raw_message['citations']
-
-    assert result['role'] == 'assistant', "Should retain role"
-    assert result['content'] == 'Hello, how can I help?', "Should retain content"
-    assert 'citations' in result, "Should retain citations"
-    assert 'user_id' not in result, "Should strip user_id"
-    assert '_rid' not in result, "Should strip Cosmos internal fields"
-    assert 'conversation_id' not in result, "Should strip conversation_id"
-    assert 'metadata' not in result, "Should strip metadata"
-
-    print("✅ _sanitize_message test passed!")
+    sanitized = _sanitize_message(assistant_message, sequence_index=2, transcript_index=2, thoughts=thoughts)
+
+    assert sanitized['content_text'] == 'Answer with evidence.', 'Content should normalize list-based text.'
+    assert sanitized['label'] == 'Turn 2', 'Transcript messages should use turn labels.'
+    assert sanitized['citation_counts']['document'] == 1, 'Document citation count should be preserved.'
+    assert sanitized['citation_counts']['web'] == 1, 'Web citation count should be preserved.'
+    assert sanitized['citation_counts']['agent_tool'] == 1, 'Agent citation count should be preserved.'
+    assert len(sanitized['citations']) == 3, 'Normalized citations should include all raw buckets.'
+    assert sanitized['hybrid_citations'][0]['file_name'] == 'paper.pdf', 'Raw hybrid citations should be preserved.'
+    assert sanitized['thoughts'][0]['step_type'] == 'search', 'Thoughts should be attached to the message.'
+    assert sanitized['details']['generation']['model_deployment'] == 'gpt-5-mini', 'Generation details should include the model.'
+
+    print("✅ Message sanitization test passed!")
+    return True
+
+
+def test_conversation_metadata_and_json_shape():
+    """Test that conversation-level metadata captures counts for export JSON."""
+    print("🔍 Testing conversation metadata and JSON shape...")
+
+    messages = [
+        {
+            'id': 'u1',
+            'role': 'user',
+            'is_transcript_message': True,
+            'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+            'thoughts': []
+        },
+        {
+            'id': 'a1',
+            'role': 'assistant',
+            'is_transcript_message': True,
+            'citation_counts': {'document': 1, 'web': 1, 'agent_tool': 0, 'legacy': 0, 'total': 2},
+            'thoughts': [{'step_type': 'search'}]
+        },
+        {
+            'id': 'f1',
+            'role': 'file',
+            'is_transcript_message': False,
+            'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+            'thoughts': []
+        }
+    ]
+
+    conversation = _sanitize_conversation(
+        {
+            'id': 'conv-123',
+            'title': 'Export Test',
+            'last_updated': '2026-03-06T12:00:00Z',
+            'chat_type': 'personal',
+            'tags': ['export', 'thoughts'],
+            'context': ['workspace-a'],
+            'classification': ['research'],
+            'strict': False,
+            'is_pinned': True,
+            'scope_locked': True,
+            'locked_contexts': ['workspace-a']
+        },
+        messages=messages,
+        role_counts={'user': 1, 'assistant': 1, 'file': 1},
+        citation_counts={'document': 1, 'web': 1, 'agent_tool': 0, 'legacy': 0, 'total': 2},
+        thought_count=1
+    )
+
+    exported = [{'conversation': conversation, 'summary_intro': {'enabled': False, 'generated': False}, 'messages': messages}]
+    parsed = json.loads(json.dumps(exported, indent=2, ensure_ascii=False, default=str))
+
+    assert parsed[0]['conversation']['transcript_message_count'] == 2, 'Transcript count should exclude supplemental messages.'
+    assert parsed[0]['conversation']['citation_counts']['total'] == 2, 'Conversation citation counts should be included.'
+    assert parsed[0]['summary_intro']['enabled'] is False, 'Summary intro status should be included in JSON.'
+
+    print("✅ Conversation metadata and JSON shape test passed!")
     return True
 
 
-def test_conversation_to_markdown():
-    """Test markdown generation from a conversation entry."""
-    print("🔍 Testing markdown generation...")
+def test_markdown_export_structure():
+    """Test transcript-style Markdown with appendices, citations, and thoughts."""
+    print("🔍 Testing Markdown export structure...")
 
     entry = {
         'conversation': {
             'id': 'conv-123',
-            'title': 'My Test Chat',
-            'last_updated': '2025-01-01T12:00:00Z',
+            'title': 'My Exported Chat',
+            'last_updated': '2026-03-06T12:00:00Z',
             'chat_type': 'personal',
-            'tags': ['important', 'test'],
+            'tags': ['science'],
+            'classification': ['research'],
+            'context': ['workspace-a'],
+            'strict': False,
             'is_pinned': False,
-            'context': []
+            'scope_locked': True,
+            'locked_contexts': ['workspace-a'],
+            'message_count': 3,
+            'message_counts_by_role': {'user': 1, 'assistant': 1, 'file': 1},
+            'citation_counts': {'document': 1, 'web': 1, 'agent_tool': 1, 'legacy': 0, 'total': 3},
+            'thought_count': 1
+        },
+        'summary_intro': {
+            'enabled': True,
+            'generated': True,
+            'model_deployment': 'gpt-5-mini',
+            'generated_at': '2026-03-06T12:05:00Z',
+            'content': 'A concise abstract.\n\n- Key point one\n- Key point two',
+            'error': None
         },
         'messages': [
             {
+                'id': 'u1',
                 'role': 'user',
-                'content': 'Hello!',
-                'timestamp': '2025-01-01T12:00:01Z'
+                'speaker_label': 'User',
+                'label': 'Turn 1',
+                'sequence_index': 1,
+                'transcript_index': 1,
+                'is_transcript_message': True,
+                'timestamp': '2026-03-06T12:00:01Z',
+                'content': 'What did the paper conclude?',
+                'content_text': 'What did the paper conclude?',
+                'details': {'interaction_mode': {'workspace_search': {'search_enabled': True, 'document_scope': 'personal'}}},
+                'citations': [],
+                'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+                'thoughts': [],
+                'legacy_citations': [],
+                'hybrid_citations': [],
+                'web_search_citations': [],
+                'agent_citations': []
             },
             {
+                'id': 'a1',
                 'role': 'assistant',
-                'content': 'Hi there! How can I help you?',
-                'timestamp': '2025-01-01T12:00:02Z',
-                'citations': [{'title': 'Doc1'}]
+                'speaker_label': 'Assistant',
+                'label': 'Turn 2',
+                'sequence_index': 2,
+                'transcript_index': 2,
+                'is_transcript_message': True,
+                'timestamp': '2026-03-06T12:00:02Z',
+                'content': 'The paper concluded the intervention improved outcomes.',
+                'content_text': 'The paper concluded the intervention improved outcomes.',
+                'details': {'generation': {'model_deployment': 'gpt-5-mini', 'citation_counts': {'document': 1, 'web': 1, 'agent_tool': 1, 'legacy': 0, 'total': 3}}},
+                'citations': [
+                    {'citation_type': 'document', 'label': 'paper.pdf — Page 4', 'citation_id': 'doc-1_4', 'page_number': 4, 'classification': 'research', 'metadata_type': 'abstract', 'metadata_content': 'Study abstract text.'},
+                    {'citation_type': 'web', 'label': 'Example Source', 'title': 'Example Source', 'url': 'https://example.com/source'},
+                    {'citation_type': 'agent_tool', 'label': 'web_lookup'}
+                ],
+                'citation_counts': {'document': 1, 'web': 1, 'agent_tool': 1, 'legacy': 0, 'total': 3},
+                'thoughts': [{'step_type': 'search', 'content': 'Searching documents...', 'detail': 'query=paper outcome', 'duration_ms': 88, 'timestamp': '2026-03-06T12:00:01Z'}],
+                'legacy_citations': [],
+                'hybrid_citations': [{'file_name': 'paper.pdf'}],
+                'web_search_citations': [{'title': 'Example Source', 'url': 'https://example.com/source'}],
+                'agent_citations': [{'tool_name': 'web_lookup', 'function_name': 'azure_ai_foundry_web_search', 'plugin_name': 'azure_ai_foundry', 'function_arguments': {'query': 'paper outcome'}, 'function_result': {'answer': 'done'}, 'timestamp': '2026-03-06T12:00:01Z', 'success': True}]
+            },
+            {
+                'id': 'f1',
+                'role': 'file',
+                'speaker_label': 'File',
+                'label': 'Message 3',
+                'sequence_index': 3,
+                'transcript_index': None,
+                'is_transcript_message': False,
+                'timestamp': '2026-03-06T12:00:00Z',
+                'content': 'paper.pdf',
+                'content_text': 'paper.pdf',
+                'details': {'message_context': {'filename': 'paper.pdf'}},
+                'citations': [],
+                'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+                'thoughts': [],
+                'legacy_citations': [],
+                'hybrid_citations': [],
+                'web_search_citations': [],
+                'agent_citations': []
             }
         ]
     }
 
-    # Replicate the markdown conversion logic
-    conv = entry['conversation']
-    messages = entry['messages']
-    lines = []
-    lines.append(f"# {conv['title']}")
-    lines.append('')
-    lines.append(f"**Last Updated:** {conv['last_updated']}  ")
-    lines.append(f"**Chat Type:** {conv['chat_type']}  ")
-    if conv.get('tags'):
-        lines.append(f"**Tags:** {', '.join(conv['tags'])}  ")
-    lines.append(f"**Messages:** {len(messages)}  ")
-    lines.append('')
-    lines.append('---')
-    lines.append('')
-
-    for msg in messages:
-        role = msg.get('role', 'unknown')
-        role_label = role.capitalize()
-        if role == 'assistant':
-            role_label = 'Assistant'
-        elif role == 'user':
-            role_label = 'User'
-        lines.append(f"### {role_label}")
-        if msg.get('timestamp'):
-            lines.append(f"*{msg['timestamp']}*")
-        lines.append('')
-        lines.append(msg.get('content', ''))
-        lines.append('')
-        if msg.get('citations'):
-            lines.append('**Citations:**')
-            for cit in msg['citations']:
-                if isinstance(cit, dict):
-                    source = cit.get('title') or cit.get('filepath') or cit.get('url', 'Unknown')
-                    lines.append(f"- {source}")
-            lines.append('')
-        lines.append('---')
-        lines.append('')
-
-    markdown = '\n'.join(lines)
-
-    assert '# My Test Chat' in markdown, "Should have title as H1"
-    assert '**Last Updated:**' in markdown, "Should have last updated"
-    assert '**Tags:** important, test' in markdown, "Should list tags"
-    assert '### User' in markdown, "Should have user heading"
-    assert '### Assistant' in markdown, "Should have assistant heading"
-    assert 'Hello!' in markdown, "Should contain user message"
-    assert 'Hi there! How can I help you?' in markdown, "Should contain assistant reply"
-    assert '**Citations:**' in markdown, "Should include citations section"
-    assert '- Doc1' in markdown, "Should list citation title"
-
-    print("✅ Markdown generation test passed!")
+    markdown = _conversation_to_markdown(entry)
+
+    assert '# My Exported Chat' in markdown, 'Markdown should include the title heading.'
+    assert '## Abstract' in markdown, 'Markdown should include the abstract section.'
+    assert '## Transcript' in markdown, 'Markdown should include the transcript section.'
+    assert '## Appendix B — Message Details' in markdown, 'Markdown should include the message-details appendix.'
+    assert '## Appendix C — References' in markdown, 'Markdown should include the references appendix.'
+    assert '## Appendix D — Processing Thoughts' in markdown, 'Markdown should include the thoughts appendix.'
+    assert '## Appendix E — Supplemental Messages' in markdown, 'Markdown should include supplemental messages.'
+    assert 'paper.pdf — Page 4' in markdown, 'Document citation labels should appear in references.'
+    assert 'Example Source' in markdown, 'Web citations should appear in references.'
+    assert 'Searching documents...' in markdown, 'Thought content should appear in the appendix.'
+
+    print("✅ Markdown export structure test passed!")
     return True
 
 
-def test_json_export_structure():
-    """Test that JSON export produces the expected structure."""
-    print("🔍 Testing JSON export structure...")
+def test_safe_filename_and_zip_packaging():
+    """Test filename sanitization and ZIP naming consistency."""
+    print("🔍 Testing safe filename and ZIP packaging...")
+
+    assert _safe_filename('Normal Title') == 'Normal_Title', 'Spaces should become underscores.'
+    assert _safe_filename('File/With:Bad*Chars') == 'File_With_Bad_Chars', 'Unsafe characters should be replaced.'
+    assert _safe_filename('A' * 100) == 'A' * 50, 'Long names should be truncated.'
+    assert _safe_filename('') == 'Untitled', 'Empty titles should use Untitled.'
 
     exported = [
-        {
-            'conversation': {
-                'id': 'conv-abc',
-                'title': 'Test Convo',
-                'last_updated': '2025-01-01T00:00:00Z',
-                'chat_type': 'personal',
-                'tags': [],
-                'is_pinned': False,
-                'context': []
-            },
-            'messages': [
-                {'role': 'user', 'content': 'Hello', 'timestamp': '2025-01-01T00:00:01Z'},
-                {'role': 'assistant', 'content': 'World', 'timestamp': '2025-01-01T00:00:02Z'}
-            ]
-        }
+        {'conversation': {'id': 'conv-001-extra', 'title': 'First Chat'}, 'messages': []},
+        {'conversation': {'id': 'conv-002-extra', 'title': 'Second Chat'}, 'messages': []}
     ]
 
-    content = json.dumps(exported, indent=2, ensure_ascii=False, default=str)
-    parsed = json.loads(content)
+    buffer = io.BytesIO()
+    with zipfile.ZipFile(buffer, 'w', zipfile.ZIP_DEFLATED) as archive:
+        for entry in exported:
+            conversation = entry['conversation']
+            safe_title = _safe_filename(conversation.get('title', 'Untitled'))
+            conversation_id_short = conversation.get('id', 'unknown')[:8]
+            archive.writestr(f"{safe_title}_{conversation_id_short}.json", json.dumps(entry))
 
-    assert isinstance(parsed, list), "Export should be a list"
-    assert len(parsed) == 1, "Should have one conversation"
-    assert 'conversation' in parsed[0], "Each entry should have conversation"
-    assert 'messages' in parsed[0], "Each entry should have messages"
-    assert len(parsed[0]['messages']) == 2, "Should have 2 messages"
-    assert parsed[0]['conversation']['title'] == 'Test Convo', "Title should match"
+    buffer.seek(0)
+    with zipfile.ZipFile(buffer, 'r') as archive:
+        names = archive.namelist()
+        assert 'First_Chat_conv-001.json' in names, f"Unexpected ZIP entries: {names}"
+        assert 'Second_Chat_conv-002.json' in names, f"Unexpected ZIP entries: {names}"
 
-    print("✅ JSON export structure test passed!")
+    print("✅ Safe filename and ZIP packaging test passed!")
     return True
 
 
-def test_zip_packaging():
-    """Test that ZIP packaging creates valid archive with correct entries."""
-    print("🔍 Testing ZIP packaging...")
+def test_content_and_citation_helpers():
+    """Test content normalization, citation normalization, and summary truncation helpers."""
+    print("🔍 Testing helper utilities...")
 
-    exported = [
-        {
-            'conversation': {
-                'id': 'conv-001-abc-def',
-                'title': 'First Chat',
-                'last_updated': '2025-01-01',
-                'chat_type': 'personal',
-                'tags': [],
-                'is_pinned': False,
-                'context': []
-            },
-            'messages': [
-                {'role': 'user', 'content': 'Hello', 'timestamp': '2025-01-01'}
-            ]
-        },
-        {
-            'conversation': {
-                'id': 'conv-002-xyz-ghi',
-                'title': 'Second Chat',
-                'last_updated': '2025-01-02',
-                'chat_type': 'personal',
-                'tags': [],
-                'is_pinned': False,
-                'context': []
-            },
-            'messages': [
-                {'role': 'user', 'content': 'Goodbye', 'timestamp': '2025-01-02'}
-            ]
-        }
-    ]
+    content_text = _normalize_content([
+        {'type': 'text', 'text': 'Line one'},
+        {'type': 'image_url', 'image_url': {'url': 'https://example.com/image.png'}},
+        {'type': 'text', 'text': 'Line two'}
+    ])
+    assert content_text == 'Line one\n[Image]\nLine two', 'Content normalization should flatten list-based content.'
 
-    import re
+    raw_buckets = _collect_raw_citation_buckets({
+        'citations': [{'title': 'Legacy'}],
+        'hybrid_citations': [{'file_name': 'doc.pdf', 'page_number': 2}],
+        'web_search_citations': [{'title': 'Web', 'url': 'https://example.com'}],
+        'agent_citations': [{'tool_name': 'lookup'}]
+    })
+    normalized = _normalize_citations(raw_buckets)
+    counts = _build_citation_counts(normalized)
 
-    def safe_filename(title):
-        safe = re.sub(r'[<>:"/\\|?*]', '_', title)
-        safe = re.sub(r'\s+', '_', safe)
-        safe = safe.strip('_. ')
-        if len(safe) > 50:
-            safe = safe[:50]
-        return safe or 'Untitled'
+    assert counts == {'document': 1, 'web': 1, 'agent_tool': 1, 'legacy': 1, 'total': 4}, f"Unexpected citation counts: {counts}"
 
-    buffer = io.BytesIO()
-    with zipfile.ZipFile(buffer, 'w', zipfile.ZIP_DEFLATED) as zf:
-        for entry in exported:
-            conv = entry['conversation']
-            safe_title = safe_filename(conv.get('title', 'Untitled'))
-            conv_id_short = conv.get('id', 'unknown')[:8]
-            file_content = json.dumps(entry, indent=2, ensure_ascii=False, default=str)
-            file_name = f"{safe_title}_{conv_id_short}.json"
-            zf.writestr(file_name, file_content)
+    truncated = _truncate_for_summary('A' * 70000)
+    assert '[... transcript truncated for export summary generation ...]' in truncated, 'Summary truncation marker should be inserted for long transcripts.'
+    assert len(truncated) < 70000, 'Truncated summary source should be shorter than the original transcript.'
 
-    buffer.seek(0)
+    print("✅ Helper utilities test passed!")
+    return True
 
-    with zipfile.ZipFile(buffer, 'r') as zf:
-        names = zf.namelist()
-        assert len(names) == 2, f"ZIP should have 2 files, got {len(names)}"
-        assert 'First_Chat_conv-001.json' in names, f"Expected First_Chat_conv-001.json, got {names}"
-        assert 'Second_Chat_conv-002.json' in names, f"Expected Second_Chat_conv-002.json, got {names}"
 
-        # Verify content
-        first_content = json.loads(zf.read('First_Chat_conv-001.json'))
-        assert first_content['conversation']['title'] == 'First Chat'
-        assert len(first_content['messages']) == 1
+def test_pdf_export_generation():
+    """Test PDF export generates valid HTML body and PDF bytes."""
+    print("🔍 Testing PDF export generation...")
 
-    print("✅ ZIP packaging test passed!")
+    entry = {
+        'conversation': {
+            'id': 'conv-pdf-001',
+            'title': 'PDF Export Test',
+            'last_updated': '2026-03-06T14:00:00Z',
+            'chat_type': 'personal',
+            'tags': ['pdf', 'test'],
+            'classification': [],
+            'context': [],
+            'strict': False,
+            'is_pinned': False,
+            'scope_locked': False,
+            'locked_contexts': [],
+            'message_count': 2,
+            'message_counts_by_role': {'user': 1, 'assistant': 1},
+            'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+            'thought_count': 0
+        },
+        'summary_intro': {
+            'enabled': False,
+            'generated': False,
+            'model_deployment': None,
+            'generated_at': None,
+            'content': '',
+            'error': None
+        },
+        'messages': [
+            {
+                'id': 'u1',
+                'role': 'user',
+                'speaker_label': 'User',
+                'label': 'Turn 1',
+                'sequence_index': 1,
+                'transcript_index': 1,
+                'is_transcript_message': True,
+                'timestamp': '2026-03-06T14:00:01Z',
+                'content': 'Hello, can you help me?',
+                'content_text': 'Hello, can you help me?',
+                'details': {},
+                'citations': [],
+                'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+                'thoughts': [],
+                'legacy_citations': [],
+                'hybrid_citations': [],
+                'web_search_citations': [],
+                'agent_citations': []
+            },
+            {
+                'id': 'a1',
+                'role': 'assistant',
+                'speaker_label': 'Assistant',
+                'label': 'Turn 2',
+                'sequence_index': 2,
+                'transcript_index': 2,
+                'is_transcript_message': True,
+                'timestamp': '2026-03-06T14:00:02Z',
+                'content': 'Of course! How can I assist you today?',
+                'content_text': 'Of course! How can I assist you today?',
+                'details': {},
+                'citations': [],
+                'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+                'thoughts': [],
+                'legacy_citations': [],
+                'hybrid_citations': [],
+                'web_search_citations': [],
+                'agent_citations': []
+            }
+        ]
+    }
+
+    # Test HTML body generation
+    html_body = _build_pdf_html_body(entry)
+    assert '<h1>PDF Export Test</h1>' in html_body, 'HTML should contain the conversation title.'
+    assert 'user-bubble' in html_body, 'HTML should contain user bubble CSS class.'
+    assert 'assistant-bubble' in html_body, 'HTML should contain assistant bubble CSS class.'
+    assert 'Transcript' in html_body, 'HTML should contain Transcript section.'
+    assert 'Appendix A' in html_body, 'HTML should contain Appendix A.'
+    assert 'Hello, can you help me?' in html_body, 'HTML should contain user message content.'
+    assert 'Of course! How can I assist you today?' in html_body, 'HTML should contain assistant message content.'
+
+    # Test bubble class helper
+    assert _pdf_bubble_class('user') == 'user-bubble', 'User role should map to user-bubble.'
+    assert _pdf_bubble_class('assistant') == 'assistant-bubble', 'Assistant role should map to assistant-bubble.'
+    assert _pdf_bubble_class('system') == 'system-bubble', 'System role should map to system-bubble.'
+    assert _pdf_bubble_class('file') == 'file-bubble', 'File role should map to file-bubble.'
+    assert _pdf_bubble_class('unknown') == 'other-bubble', 'Unknown roles should map to other-bubble.'
+
+    # Test PDF bytes generation
+    pdf_bytes = _conversation_to_pdf_bytes(entry)
+    assert isinstance(pdf_bytes, bytes), 'PDF output should be bytes.'
+    assert pdf_bytes[:5] == b'%PDF-', f'PDF should start with %PDF- header, got: {pdf_bytes[:10]}'
+    assert len(pdf_bytes) > 100, 'PDF should have a reasonable size.'
+
+    print("✅ PDF export generation test passed!")
     return True
 
 
-def test_safe_filename():
-    """Test filename sanitization."""
-    print("🔍 Testing safe filename generation...")
+def test_tag_formatting():
+    """Test that dict-style tags and classifications render as readable strings."""
+    print("🔍 Testing tag/classification formatting...")
 
-    import re
+    # Category/value tag
+    assert _format_tag({'category': 'model', 'value': 'gpt-5'}) == 'model: gpt-5'
 
-    def safe_filename(title):
-        safe = re.sub(r'[<>:"/\\|?*]', '_', title)
-        safe = re.sub(r'\s+', '_', safe)
-        safe = safe.strip('_. ')
-        if len(safe) > 50:
-            safe = safe[:50]
-        return safe or 'Untitled'
+    # Participant tag with name
+    assert _format_tag({'category': 'participant', 'name': 'Alice', 'user_id': 'u1'}) == 'participant: Alice'
 
-    assert safe_filename('Normal Title') == 'Normal_Title', "Spaces should become underscores"
-    assert safe_filename('File/With:Bad*Chars') == 'File_With_Bad_Chars', "Bad chars should be replaced"
-    assert safe_filename('A' * 100) == 'A' * 50, "Long names should be truncated"
-    assert safe_filename('') == 'Untitled', "Empty should become Untitled"
-    assert safe_filename('   ') == 'Untitled', "Whitespace-only should become Untitled"
+    # Participant tag with email fallback
+    assert _format_tag({'category': 'participant', 'email': 'bob@test.com'}) == 'participant: bob@test.com'
 
-    print("✅ Safe filename test passed!")
-    return True
+    # Document tag with title
+    assert _format_tag({'category': 'document', 'title': 'Study.pdf', 'document_id': 'd1'}) == 'document: Study.pdf'
 
+    # Document tag with only document_id
+    assert _format_tag({'category': 'document', 'document_id': 'd1'}) == 'document: d1'
 
-def test_active_thread_filter():
-    """Test that only active thread messages are included."""
-    print("🔍 Testing active thread message filtering...")
+    # Plain string tag (older data)
+    assert _format_tag('science') == 'science'
 
-    messages = [
-        {'role': 'user', 'content': 'Hello', 'metadata': {}},
-        {'role': 'assistant', 'content': 'Reply 1', 'metadata': {'thread_info': {'active_thread': True}}},
-        {'role': 'assistant', 'content': 'Reply 2 (inactive)', 'metadata': {'thread_info': {'active_thread': False}}},
-        {'role': 'user', 'content': 'Follow up', 'metadata': {'thread_info': {}}},
-        {'role': 'assistant', 'content': 'Final', 'metadata': {'thread_info': {'active_thread': None}}},
-    ]
+    # Category only (no value)
+    assert _format_tag({'category': 'semantic'}) == 'semantic'
+
+    # Verify Markdown export uses formatted tags
+    entry = {
+        'conversation': {
+            'id': 'conv-tags-001',
+            'title': 'Tag Format Test',
+            'last_updated': '2026-03-08T00:00:00Z',
+            'chat_type': 'personal',
+            'tags': [
+                {'category': 'model', 'value': 'gpt-5'},
+                {'category': 'semantic', 'value': 'cubesats'}
+            ],
+            'classification': [],
+            'context': [],
+            'strict': False,
+            'is_pinned': False,
+            'scope_locked': False,
+            'locked_contexts': [],
+            'message_count': 0,
+            'message_counts_by_role': {},
+            'citation_counts': {'document': 0, 'web': 0, 'agent_tool': 0, 'legacy': 0, 'total': 0},
+            'thought_count': 0
+        },
+        'summary_intro': {'enabled': False, 'generated': False},
+        'messages': []
+    }
+
+    md = _conversation_to_markdown(entry)
+    assert 'model: gpt-5' in md, f'Markdown should contain formatted tag, got: {md[:500]}'
+    assert "{'category'" not in md, 'Markdown should not contain raw dict strings'
+
+    html = _build_pdf_html_body(entry)
+    assert 'model: gpt-5' in html, f'PDF HTML should contain formatted tag'
+    assert "{'category'" not in html, 'PDF HTML should not contain raw dict strings'
 
-    filtered = []
-    for msg in messages:
-        thread_info = msg.get('metadata', {}).get('thread_info', {})
-        active = thread_info.get('active_thread')
-        if active is True or active is None or 'active_thread' not in thread_info:
-            filtered.append(msg)
-
-    assert len(filtered) == 4, f"Expected 4 active messages, got {len(filtered)}"
-    contents = [m['content'] for m in filtered]
-    assert 'Reply 2 (inactive)' not in contents, "Inactive thread message should be excluded"
-    assert 'Hello' in contents, "Message without thread info should be included"
-    assert 'Reply 1' in contents, "Active=True message should be included"
-    assert 'Follow up' in contents, "Message with empty thread_info should be included"
-    assert 'Final' in contents, "Message with active_thread=None should be included"
-
-    print("✅ Active thread filter test passed!")
+    print("✅ Tag formatting test passed!")
     return True
 
 
 if __name__ == "__main__":
     tests = [
-        test_sanitize_conversation,
-        test_sanitize_message,
-        test_conversation_to_markdown,
-        test_json_export_structure,
-        test_zip_packaging,
-        test_safe_filename,
-        test_active_thread_filter
+        test_filter_messages_for_export,
+        test_sanitize_message_with_citations_and_thoughts,
+        test_conversation_metadata_and_json_shape,
+        test_markdown_export_structure,
+        test_safe_filename_and_zip_packaging,
+        test_content_and_citation_helpers,
+        test_pdf_export_generation,
+        test_tag_formatting,
     ]
     results = []
 
@@ -367,8 +547,8 @@ def test_active_thread_filter():
         print(f"\n🧪 Running {test.__name__}...")
         try:
             results.append(test())
-        except Exception as e:
-            print(f"❌ {test.__name__} failed: {e}")
+        except Exception as exc:
+            print(f"❌ {test.__name__} failed: {exc}")
             import traceback
             traceback.print_exc()
             results.append(False)
diff --git a/functional_tests/test_docs_json_gem_security_fix.py b/functional_tests/test_docs_json_gem_security_fix.py
new file mode 100644
index 00000000..9f238dba
--- /dev/null
+++ b/functional_tests/test_docs_json_gem_security_fix.py
@@ -0,0 +1,51 @@
+#!/usr/bin/env python3
+# test_docs_json_gem_security_fix.py
+"""
+Functional test for docs json gem security fix.
+Version: 0.239.136
+Implemented in: 0.239.136
+
+This test ensures the docs site bundle pins the Ruby json gem to a patched
+version and that the lockfile resolves to a non-vulnerable release.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+GEMFILE_PATH = ROOT / "docs" / "Gemfile"
+LOCKFILE_PATH = ROOT / "docs" / "Gemfile.lock"
+CONFIG_PATH = ROOT / "application" / "single_app" / "config.py"
+FIX_DOC_PATH = ROOT / "docs" / "explanation" / "fixes" / "DOCS_JSON_GEM_SECURITY_FIX.md"
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_docs_json_gem_security_fix() -> bool:
+    print("Testing docs json gem security fix...")
+
+    assert_contains(GEMFILE_PATH, 'gem "json", ">= 2.19.2"')
+    assert_contains(LOCKFILE_PATH, 'json (2.19.2)')
+    assert_contains(LOCKFILE_PATH, 'json (>= 2.19.2)')
+    assert_contains(CONFIG_PATH, 'VERSION = "0.239.136"')
+    assert_contains(FIX_DOC_PATH, 'Fixed/Implemented in version: **0.239.136**')
+
+    print("Docs json gem security fix checks passed!")
+    return True
+
+
+if __name__ == "__main__":
+    try:
+        success = test_docs_json_gem_security_fix()
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_embedding_rate_limit_wait_time.py b/functional_tests/test_embedding_rate_limit_wait_time.py
new file mode 100644
index 00000000..782616d9
--- /dev/null
+++ b/functional_tests/test_embedding_rate_limit_wait_time.py
@@ -0,0 +1,232 @@
+#!/usr/bin/env python3
+# test_embedding_rate_limit_wait_time.py
+"""
+Functional test for embedding rate limit wait time handling.
+Version: 0.239.116
+Implemented in: 0.239.116
+
+This test ensures that embedding retries respect server-provided wait times
+from 429 responses before falling back to local exponential backoff.
+"""
+
+import email.utils
+import importlib
+import os
+import sys
+import time as real_time
+import types
+
+
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+class FakeRateLimitError(Exception):
+    def __init__(self, headers=None):
+        super().__init__('Rate limit exceeded')
+        self.response = types.SimpleNamespace(headers=headers or {})
+
+
+class FakeEmbeddingResponse:
+    def __init__(self, embeddings, prompt_tokens=12, total_tokens=12):
+        self.data = [types.SimpleNamespace(embedding=embedding) for embedding in embeddings]
+        self.usage = types.SimpleNamespace(
+            prompt_tokens=prompt_tokens,
+            total_tokens=total_tokens,
+        )
+
+
+class FakeEmbeddingsEndpoint:
+    def __init__(self, scripted_results):
+        self.scripted_results = list(scripted_results)
+
+    def create(self, model, input):
+        result = self.scripted_results.pop(0)
+        if isinstance(result, Exception):
+            raise result
+        return result
+
+
+class FakeAzureOpenAI:
+    scripted_results = []
+
+    def __init__(self, *args, **kwargs):
+        self.embeddings = FakeEmbeddingsEndpoint(type(self).scripted_results)
+
+
+class SleepRecorder:
+    def __init__(self):
+        self.calls = []
+
+    def __call__(self, delay):
+        self.calls.append(delay)
+
+
+def _restore_modules(original_modules):
+    for module_name, original_module in original_modules.items():
+        if original_module is None:
+            sys.modules.pop(module_name, None)
+        else:
+            sys.modules[module_name] = original_module
+
+
+def _load_functions_content(sleep_recorder):
+    config_stub = types.ModuleType('config')
+    config_stub.time = types.SimpleNamespace(time=real_time.time, sleep=sleep_recorder)
+    config_stub.random = types.SimpleNamespace(uniform=lambda low, high: low)
+    config_stub.AzureOpenAI = FakeAzureOpenAI
+    config_stub.RateLimitError = FakeRateLimitError
+    config_stub.DefaultAzureCredential = object
+    config_stub.get_bearer_token_provider = lambda *args, **kwargs: 'token-provider'
+    config_stub.cognitive_services_scope = 'https://cognitiveservices.azure.com/.default'
+
+    settings_stub = types.ModuleType('functions_settings')
+    settings_stub.get_settings = lambda: {
+        'enable_embedding_apim': False,
+        'azure_openai_embedding_authentication_type': 'api_key',
+        'azure_openai_embedding_api_version': '2024-06-01',
+        'azure_openai_embedding_endpoint': 'https://example.openai.azure.com',
+        'azure_openai_embedding_key': 'test-key',
+        'embedding_model': {
+            'selected': [
+                {'deploymentName': 'text-embedding-test'}
+            ]
+        }
+    }
+
+    logging_stub = types.ModuleType('functions_logging')
+    debug_stub = types.ModuleType('functions_debug')
+    debug_stub.debug_print = lambda *args, **kwargs: None
+
+    original_modules = {}
+    module_stubs = {
+        'config': config_stub,
+        'functions_settings': settings_stub,
+        'functions_logging': logging_stub,
+        'functions_debug': debug_stub,
+    }
+
+    for module_name, module_stub in module_stubs.items():
+        original_modules[module_name] = sys.modules.get(module_name)
+        sys.modules[module_name] = module_stub
+
+    original_modules['functions_content'] = sys.modules.get('functions_content')
+    sys.modules.pop('functions_content', None)
+
+    module = importlib.import_module('functions_content')
+    return module, original_modules
+
+
+def test_parse_retry_after_ms_and_date_headers():
+    """Test that Retry-After helper parses millisecond and date-based headers."""
+    print('🔍 Testing Retry-After header parsing...')
+    sleep_recorder = SleepRecorder()
+    module, original_modules = _load_functions_content(sleep_recorder)
+
+    try:
+        retry_after_ms = module._parse_retry_after_seconds({'retry-after-ms': '4500'})
+        if retry_after_ms != 4.5:
+            print(f'❌ Expected retry-after-ms to parse as 4.5, got {retry_after_ms}')
+            return False
+
+        retry_after_date = email.utils.formatdate(real_time.time() + 5, usegmt=True)
+        parsed_date_wait = module._parse_retry_after_seconds({'retry-after': retry_after_date})
+        if parsed_date_wait is None or not 0 < parsed_date_wait <= 5.5:
+            print(f'❌ Expected retry-after date header to resolve to about 5 seconds, got {parsed_date_wait}')
+            return False
+
+        print('✅ Retry-After headers parse into usable wait times')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+def test_generate_embedding_uses_retry_after_wait_time():
+    """Test single embedding retries use the server-provided wait time."""
+    print('🔍 Testing single embedding Retry-After handling...')
+    sleep_recorder = SleepRecorder()
+    module, original_modules = _load_functions_content(sleep_recorder)
+
+    try:
+        FakeAzureOpenAI.scripted_results = [
+            FakeRateLimitError({'retry-after-ms': '4500'}),
+            FakeEmbeddingResponse([[0.1, 0.2, 0.3]])
+        ]
+
+        embedding, token_usage = module.generate_embedding('retry-after test text')
+
+        if embedding != [0.1, 0.2, 0.3]:
+            print(f'❌ Unexpected embedding result: {embedding}')
+            return False
+
+        if 4.5 not in sleep_recorder.calls:
+            print(f'❌ Expected a 4.5 second retry wait, got sleep calls: {sleep_recorder.calls}')
+            return False
+
+        if not isinstance(token_usage, dict) or token_usage.get('model_deployment_name') != 'text-embedding-test':
+            print(f'❌ Unexpected token usage payload: {token_usage}')
+            return False
+
+        print('✅ Single embedding retries honor Retry-After wait times')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+def test_generate_embeddings_batch_uses_retry_after_wait_time():
+    """Test batch embedding retries use the server-provided wait time."""
+    print('🔍 Testing batch embedding Retry-After handling...')
+    sleep_recorder = SleepRecorder()
+    module, original_modules = _load_functions_content(sleep_recorder)
+
+    try:
+        FakeAzureOpenAI.scripted_results = [
+            FakeRateLimitError({'retry-after': '3'}),
+            FakeEmbeddingResponse([[0.4, 0.5], [0.6, 0.7]], prompt_tokens=20, total_tokens=20)
+        ]
+
+        results = module.generate_embeddings_batch(['first', 'second'], batch_size=2)
+
+        if len(results) != 2:
+            print(f'❌ Expected 2 embedding results, got {len(results)}')
+            return False
+
+        if 3.0 not in sleep_recorder.calls:
+            print(f'❌ Expected a 3.0 second retry wait, got sleep calls: {sleep_recorder.calls}')
+            return False
+
+        first_embedding, first_usage = results[0]
+        if first_embedding != [0.4, 0.5]:
+            print(f'❌ Unexpected first embedding: {first_embedding}')
+            return False
+
+        if not isinstance(first_usage, dict) or first_usage.get('prompt_tokens') != 10:
+            print(f'❌ Unexpected batch token usage payload: {first_usage}')
+            return False
+
+        print('✅ Batch embedding retries honor Retry-After wait times')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+if __name__ == '__main__':
+    tests = [
+        test_parse_retry_after_ms_and_date_headers,
+        test_generate_embedding_uses_retry_after_wait_time,
+        test_generate_embeddings_batch_uses_retry_after_wait_time,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        try:
+            results.append(test())
+        except Exception as exc:
+            print(f'❌ {test.__name__} failed: {exc}')
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(bool(result) for result in results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_file_processing_logging_setting_key_fix.py b/functional_tests/test_file_processing_logging_setting_key_fix.py
new file mode 100644
index 00000000..a488f0a0
--- /dev/null
+++ b/functional_tests/test_file_processing_logging_setting_key_fix.py
@@ -0,0 +1,58 @@
+#!/usr/bin/env python3
+# test_file_processing_logging_setting_key_fix.py
+"""
+Functional test for file processing logging setting key alignment.
+Version: 0.239.141
+Implemented in: 0.239.141
+
+This test ensures that the file processing logging helper reads the same
+pluralized setting key that the application settings defaults define.
+"""
+
+import sys
+from pathlib import Path
+
+
+REPO_ROOT = Path(__file__).resolve().parents[1]
+FUNCTIONS_LOGGING_PATH = REPO_ROOT / 'application' / 'single_app' / 'functions_logging.py'
+FUNCTIONS_SETTINGS_PATH = REPO_ROOT / 'application' / 'single_app' / 'functions_settings.py'
+CONFIG_PATH = REPO_ROOT / 'application' / 'single_app' / 'config.py'
+
+
+def test_file_processing_logging_setting_key_alignment():
+    """Verify the helper and defaults use the same pluralized setting key."""
+    print('🔍 Checking file processing logging setting key alignment...')
+
+    functions_logging_source = FUNCTIONS_LOGGING_PATH.read_text(encoding='utf-8')
+    functions_settings_source = FUNCTIONS_SETTINGS_PATH.read_text(encoding='utf-8')
+    config_source = CONFIG_PATH.read_text(encoding='utf-8')
+
+    assert "settings.get('enable_file_processing_logs', True)" in functions_logging_source, (
+        'functions_logging.py should read the pluralized enable_file_processing_logs key.'
+    )
+    assert "settings.get('enable_file_processing_log', True)" not in functions_logging_source, (
+        'functions_logging.py should not use the legacy singular enable_file_processing_log lookup.'
+    )
+    assert "'enable_file_processing_logs': True" in functions_settings_source, (
+        'functions_settings.py should define the pluralized enable_file_processing_logs default.'
+    )
+    assert 'VERSION = "0.239.141"' in config_source, (
+        'config.py should be bumped to version 0.239.141 for this fix.'
+    )
+
+    print('✅ File processing logging setting keys are aligned.')
+
+
+if __name__ == '__main__':
+    try:
+        test_file_processing_logging_setting_key_alignment()
+    except AssertionError as exc:
+        print(f'❌ Test failed: {exc}')
+        sys.exit(1)
+    except Exception as exc:
+        print(f'❌ Unexpected error: {exc}')
+        import traceback
+        traceback.print_exc()
+        sys.exit(1)
+
+    sys.exit(0)
\ No newline at end of file
diff --git a/functional_tests/test_global_action_user_audit_fallback.py b/functional_tests/test_global_action_user_audit_fallback.py
new file mode 100644
index 00000000..b2129432
--- /dev/null
+++ b/functional_tests/test_global_action_user_audit_fallback.py
@@ -0,0 +1,176 @@
+#!/usr/bin/env python3
+# test_global_action_user_audit_fallback.py
+"""
+Functional test for global action audit user fallback.
+Version: 0.239.103
+Implemented in: 0.239.103
+
+This test ensures that save_global_action() resolves a missing user ID from
+get_current_user_id() and falls back to a non-null system audit value.
+"""
+
+# pyright: reportMissingImports=false
+
+import importlib
+import os
+import sys
+import types
+
+
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+class FakeContainer:
+    def __init__(self, existing_action=None):
+        self.existing_action = existing_action
+        self.upserted_body = None
+
+    def read_item(self, item, partition_key):
+        if self.existing_action is None:
+            raise Exception('not found')
+        return dict(self.existing_action)
+
+    def upsert_item(self, body):
+        self.upserted_body = dict(body)
+        self.existing_action = dict(body)
+        return dict(body)
+
+
+def _load_functions_global_actions(fake_container, current_user_id):
+    config_stub = types.ModuleType('config')
+    config_stub.cosmos_global_actions_container = fake_container
+
+    auth_stub = types.ModuleType('functions_authentication')
+    auth_stub.get_current_user_id = lambda: current_user_id
+
+    keyvault_stub = types.ModuleType('functions_keyvault')
+
+    class SecretReturnType:
+        TRIGGER = 'trigger'
+
+    keyvault_stub.SecretReturnType = SecretReturnType
+    keyvault_stub.keyvault_plugin_save_helper = lambda action_data, scope_value, scope: action_data
+    keyvault_stub.keyvault_plugin_get_helper = (
+        lambda action_data, scope_value, scope, return_type: action_data
+    )
+    keyvault_stub.keyvault_plugin_delete_helper = lambda action, scope_value, scope: None
+
+    original_modules = {}
+    for module_name, module_stub in {
+        'config': config_stub,
+        'functions_authentication': auth_stub,
+        'functions_keyvault': keyvault_stub,
+    }.items():
+        original_modules[module_name] = sys.modules.get(module_name)
+        sys.modules[module_name] = module_stub
+
+    original_modules['functions_global_actions'] = sys.modules.get('functions_global_actions')
+    sys.modules.pop('functions_global_actions', None)
+
+    module = importlib.import_module('functions_global_actions')
+    return module, original_modules
+
+
+def _restore_modules(original_modules):
+    for module_name, original_module in original_modules.items():
+        if original_module is None:
+            sys.modules.pop(module_name, None)
+        else:
+            sys.modules[module_name] = original_module
+
+
+def test_save_global_action_uses_current_user_when_missing():
+    """Validate missing user_id is filled from get_current_user_id."""
+    print('🔍 Testing global action audit fallback to current user...')
+
+    fake_container = FakeContainer()
+    module, original_modules = _load_functions_global_actions(fake_container, 'user-123')
+
+    try:
+        result = module.save_global_action(
+            {
+                'name': 'test_plugin',
+                'displayName': 'Test Plugin',
+                'type': 'custom',
+                'description': 'Regression coverage for missing user audit fields',
+                'endpoint': 'https://example.com/plugin',
+                'auth': {'type': 'identity'},
+                'metadata': {},
+                'additionalFields': {},
+            }
+        )
+
+        assert result is not None, 'save_global_action should return the saved action'
+        assert result['created_by'] == 'user-123', 'created_by should use current user ID'
+        assert result['modified_by'] == 'user-123', 'modified_by should use current user ID'
+        assert fake_container.upserted_body['created_by'] == 'user-123'
+        assert fake_container.upserted_body['modified_by'] == 'user-123'
+
+        print('✅ Missing user_id now resolves to get_current_user_id()')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+def test_save_global_action_falls_back_to_system_and_repairs_null_creator():
+    """Validate update flow repairs null created_by and never persists null audit values."""
+    print('🔍 Testing global action audit fallback to system...')
+
+    existing_action = {
+        'id': 'action-1',
+        'name': 'test_plugin',
+        'created_by': None,
+        'created_at': '2026-03-01T12:00:00',
+    }
+    fake_container = FakeContainer(existing_action=existing_action)
+    module, original_modules = _load_functions_global_actions(fake_container, None)
+
+    try:
+        result = module.save_global_action(
+            {
+                'id': 'action-1',
+                'name': 'test_plugin',
+                'displayName': 'Test Plugin',
+                'type': 'custom',
+                'description': 'Regression coverage for null audit repair',
+                'endpoint': 'https://example.com/plugin',
+                'auth': {'type': 'identity'},
+                'metadata': {},
+                'additionalFields': {},
+            },
+            user_id=None,
+        )
+
+        assert result is not None, 'save_global_action should return the saved action'
+        assert result['created_by'] == 'system', 'created_by should fall back to system'
+        assert result['modified_by'] == 'system', 'modified_by should fall back to system'
+        assert result['created_at'] == '2026-03-01T12:00:00', 'created_at should be preserved'
+        assert fake_container.upserted_body['created_by'] == 'system'
+        assert fake_container.upserted_body['modified_by'] == 'system'
+
+        print('✅ Null audit values are repaired to a non-null system value')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+if __name__ == '__main__':
+    tests = [
+        test_save_global_action_uses_current_user_when_missing,
+        test_save_global_action_falls_back_to_system_and_repairs_null_creator,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        try:
+            results.append(test())
+        except Exception as exc:
+            print(f'❌ {test.__name__} failed: {exc}')
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(bool(result) for result in results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_group_public_workspace_expanded_tags.py b/functional_tests/test_group_public_workspace_expanded_tags.py
new file mode 100644
index 00000000..a2b6f8ab
--- /dev/null
+++ b/functional_tests/test_group_public_workspace_expanded_tags.py
@@ -0,0 +1,158 @@
+#!/usr/bin/env python3
+# test_group_public_workspace_expanded_tags.py
+"""
+Functional test for group and public workspace expanded tag rendering.
+Version: 0.239.113
+Implemented in: 0.239.113
+
+This test ensures that group and public workspace list-view expanded rows
+render document tags like the personal workspace does.
+"""
+
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+PERSONAL_WORKSPACE_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'workspace',
+    'workspace-documents.js',
+)
+GROUP_WORKSPACE_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'templates',
+    'group_workspaces.html',
+)
+PUBLIC_WORKSPACE_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'public',
+    'public_workspace.js',
+)
+
+
+def read_file(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def assert_contains(content, snippets, label):
+    missing = [snippet for snippet in snippets if snippet not in content]
+    if missing:
+        raise AssertionError(f'{label} is missing required snippets: {missing}')
+
+
+def assert_in_order(content, snippets, label):
+    positions = [content.find(snippet) for snippet in snippets]
+    if any(position == -1 for position in positions):
+        raise AssertionError(f'{label} is missing snippets required for ordering: {snippets}')
+    if positions != sorted(positions):
+        raise AssertionError(f'{label} snippets are not in the expected order: {snippets}')
+
+
+def test_personal_workspace_reference():
+    """Verify the personal workspace still provides the parity reference."""
+    print('Testing personal workspace tag row reference...')
+
+    content = read_file(PERSONAL_WORKSPACE_FILE)
+    assert_contains(
+        content,
+        ['<p class="mb-1"><strong>Tags:</strong> ${renderTagBadges(doc.tags || [])}</p>'],
+        'Personal workspace',
+    )
+
+    print('Personal workspace tag row reference is present.')
+    return True
+
+
+def test_group_workspace_expanded_tags():
+    """Verify group workspace expanded rows render tag badges."""
+    print('Testing group workspace expanded tag rendering...')
+
+    content = read_file(GROUP_WORKSPACE_FILE)
+    assert_contains(
+        content,
+        [
+            'function renderGroupTagBadges(tags, maxDisplay = 3)',
+            'groupWorkspaceTags.find(t => t.name === tagName)',
+            'return \'<span class="text-muted small">No tags</span>\';',
+            'html += `<span class="badge bg-secondary">+${tags.length - maxDisplay}</span>`;',
+            '<p class="mb-1"><strong>Tags:</strong> ${renderGroupTagBadges(doc.tags || [])}</p>',
+        ],
+        'Group workspace',
+    )
+    assert_in_order(
+        content,
+        [
+            '<p class="mb-1"><strong>Keywords:</strong>',
+            '<p class="mb-1"><strong>Tags:</strong> ${renderGroupTagBadges(doc.tags || [])}</p>',
+            '<p class="mb-0"><strong>Abstract:</strong>',
+        ],
+        'Group workspace',
+    )
+
+    print('Group workspace expanded rows include tag badges.')
+    return True
+
+
+def test_public_workspace_expanded_tags():
+    """Verify public workspace expanded rows render tag badges."""
+    print('Testing public workspace expanded tag rendering...')
+
+    content = read_file(PUBLIC_WORKSPACE_FILE)
+    assert_contains(
+        content,
+        [
+            'function renderPublicTagBadges(tags, maxDisplay = 3)',
+            'publicWorkspaceTags.find(t => t.name === tagName)',
+            'return \'<span class="text-muted small">No tags</span>\';',
+            'html += `<span class="badge bg-secondary">+${tags.length - maxDisplay}</span>`;',
+            '<p class="mb-1"><strong>Tags:</strong> ${renderPublicTagBadges(doc.tags || [])}</p>',
+        ],
+        'Public workspace',
+    )
+    assert_in_order(
+        content,
+        [
+            '<p class="mb-1"><strong>Keywords:</strong>',
+            '<p class="mb-1"><strong>Tags:</strong> ${renderPublicTagBadges(doc.tags || [])}</p>',
+            '<p class="mb-0"><strong>Abstract:</strong>',
+        ],
+        'Public workspace',
+    )
+
+    print('Public workspace expanded rows include tag badges.')
+    return True
+
+
+if __name__ == '__main__':
+    tests = [
+        test_personal_workspace_reference,
+        test_group_workspace_expanded_tags,
+        test_public_workspace_expanded_tags,
+    ]
+    results = []
+
+    for test in tests:
+        try:
+            results.append(test())
+        except Exception as exc:
+            print(f'{test.__name__} failed: {exc}')
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_gunicorn_startup_support.py b/functional_tests/test_gunicorn_startup_support.py
new file mode 100644
index 00000000..2b85ebfd
--- /dev/null
+++ b/functional_tests/test_gunicorn_startup_support.py
@@ -0,0 +1,108 @@
+#!/usr/bin/env python3
+# test_gunicorn_startup_support.py
+"""
+Functional test for Gunicorn startup support.
+Version: 0.239.128
+Implemented in: 0.239.128
+
+This test ensures that the application exposes Gunicorn-friendly startup
+configuration and only disables background loops when the runtime is
+explicitly configured to do so.
+"""
+
+import os
+import sys
+import importlib.util
+
+sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+def load_module(module_name, file_path):
+    """Load a module directly from disk for standalone functional validation."""
+    spec = importlib.util.spec_from_file_location(module_name, file_path)
+    if spec is None or spec.loader is None:
+        raise AssertionError(f'Could not load module spec for {module_name}')
+
+    module = importlib.util.module_from_spec(spec)
+    spec.loader.exec_module(module)
+    return module
+
+
+def test_gunicorn_startup_support():
+    """Validate Gunicorn config defaults and runtime helper behavior."""
+    print("Testing Gunicorn startup support...")
+
+    original_server_software = os.environ.get('SERVER_SOFTWARE')
+    original_background_setting = os.environ.get('SIMPLECHAT_RUN_BACKGROUND_TASKS')
+    original_disable_instrumentation = os.environ.get('DISABLE_FLASK_INSTRUMENTATION')
+
+    try:
+        os.environ['SERVER_SOFTWARE'] = 'gunicorn/23.0.0'
+        os.environ['SIMPLECHAT_RUN_BACKGROUND_TASKS'] = '0'
+        os.environ['DISABLE_FLASK_INSTRUMENTATION'] = '1'
+
+        app_dir = os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app')
+        app_module = load_module('simplechat_app_module', os.path.join(app_dir, 'app.py'))
+        import gunicorn
+        gunicorn_conf = load_module('simplechat_gunicorn_conf', os.path.join(app_dir, 'gunicorn.conf.py'))
+
+        if gunicorn.__name__ != 'gunicorn':
+            raise AssertionError('Gunicorn package import failed')
+
+        if not app_module.is_running_under_gunicorn():
+            raise AssertionError('Gunicorn runtime was not detected')
+
+        if app_module.should_start_background_tasks():
+            raise AssertionError('Background tasks should stay disabled when explicitly set to 0')
+
+        os.environ.pop('SIMPLECHAT_RUN_BACKGROUND_TASKS', None)
+        if not app_module.should_start_background_tasks():
+            raise AssertionError('Background tasks should default to enabled when unset')
+
+        os.environ['SIMPLECHAT_RUN_BACKGROUND_TASKS'] = '1'
+        if not app_module.should_start_background_tasks():
+            raise AssertionError('Background task override did not enable background loops')
+
+        if gunicorn_conf.worker_class != 'gthread':
+            raise AssertionError(f"Unexpected worker class: {gunicorn_conf.worker_class}")
+
+        if gunicorn_conf.preload_app is not False:
+            raise AssertionError('Gunicorn preload_app should remain disabled')
+
+        if gunicorn_conf.workers < 1 or gunicorn_conf.threads < 1:
+            raise AssertionError('Gunicorn workers/threads must be positive integers')
+
+        print('Gunicorn runtime detection: OK')
+        print(f"Gunicorn bind: {gunicorn_conf.bind}")
+        print(f"Gunicorn workers: {gunicorn_conf.workers}")
+        print(f"Gunicorn threads: {gunicorn_conf.threads}")
+        print('Background task gating: explicit false disables, unset enables')
+        return True
+
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+    finally:
+        if original_server_software is None:
+            os.environ.pop('SERVER_SOFTWARE', None)
+        else:
+            os.environ['SERVER_SOFTWARE'] = original_server_software
+
+        if original_background_setting is None:
+            os.environ.pop('SIMPLECHAT_RUN_BACKGROUND_TASKS', None)
+        else:
+            os.environ['SIMPLECHAT_RUN_BACKGROUND_TASKS'] = original_background_setting
+
+        if original_disable_instrumentation is None:
+            os.environ.pop('DISABLE_FLASK_INSTRUMENTATION', None)
+        else:
+            os.environ['DISABLE_FLASK_INSTRUMENTATION'] = original_disable_instrumentation
+
+
+if __name__ == '__main__':
+    success = test_gunicorn_startup_support()
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_inline_schema_generation.py b/functional_tests/test_inline_schema_generation.py
index 5e704ac1..798f47bd 100644
--- a/functional_tests/test_inline_schema_generation.py
+++ b/functional_tests/test_inline_schema_generation.py
@@ -1,8 +1,8 @@
 #!/usr/bin/env python3
 """
 Inline Schema Generation Test
-Version: 0.230.041
-Implemented in: 0.230.041
+Version: 0.239.149
+Implemented in: 0.239.149
 
 This test validates that OpenAPI schemas are now generated inline directly 
 from actual route implementations, eliminating hardcoded and outdated parameters 
@@ -11,6 +11,7 @@
 
 import sys
 import os
+import inspect
 sys.path.append(os.path.join(os.path.dirname(__file__), '..', 'application', 'single_app'))
 
 def test_inline_schema_generation():
@@ -33,7 +34,7 @@ def test_inline_schema_generation():
         chat_func = None
         for rule in test_app.url_map.iter_rules():
             if '/api/chat' in rule.rule and 'POST' in rule.methods:
-                chat_func = test_app.view_functions.get(rule.endpoint)
+                chat_func = inspect.unwrap(test_app.view_functions.get(rule.endpoint))
                 break
         
         if not chat_func:
@@ -48,11 +49,21 @@ def test_inline_schema_generation():
         if not schema:
             print("❌ No schema generated from chat_api function")
             return False
+
+        json_schema = (
+            schema
+            .get('content', {})
+            .get('application/json', {})
+            .get('schema', {})
+        )
+        if not json_schema:
+            print("❌ JSON request body schema not found in generated request body")
+            return False
             
-        print(f"✅ Generated schema from actual route: {len(schema.get('properties', {}))} properties")
+        print(f"✅ Generated schema from actual route: {len(json_schema.get('properties', {}))} properties")
         
         # Verify the schema contains actual parameters from the route
-        properties = schema.get('properties', {})
+        properties = json_schema.get('properties', {})
         
         expected_fields = [
             'message', 'conversation_id', 'hybrid_search', 'selected_document_id', 
@@ -81,7 +92,7 @@ def test_inline_schema_generation():
             print("✅ Correctly excluded 'bing_search' - not found in actual route")
         
         # Verify message is required
-        required_fields = schema.get('required', [])
+        required_fields = json_schema.get('required', [])
         if 'message' in required_fields:
             print("✅ 'message' correctly identified as required")
         else:
@@ -109,4 +120,144 @@ def test_inline_schema_generation():
         print(f"   Required Fields: {len(required_fields)}")
         print(f"   Properties Generated: {len(properties)}")
         
-        return len(found_fields) >= 8  # Should find most of the expected fields\n        \n        for field in expected_fields:\n            if field in properties:\n                found_fields.append(field)\n                print(f\"  ✅ {field}: {properties[field].get('type')} - {properties[field].get('description', 'No description')[:50]}\")\n            else:\n                missing_fields.append(field)\n        \n        if missing_fields:\n            print(f\"⚠️  Missing fields that should be detected: {missing_fields}\")\n        \n        # Verify bing_search is NOT in the schema (since it's not in the actual route)\n        if 'bing_search' in properties:\n            print(\"❌ ERROR: 'bing_search' found in schema but it's not in the actual route!\")\n            return False\n        else:\n            print(\"✅ Correctly excluded 'bing_search' - not found in actual route\")\n        \n        # Verify message is required\n        required_fields = schema.get('required', [])\n        if 'message' in required_fields:\n            print(\"✅ 'message' correctly identified as required\")\n        else:\n            print(\"⚠️  'message' should be required\")\n        \n        # Check for proper type inference\n        message_def = properties.get('message', {})\n        if message_def.get('type') == 'string' and message_def.get('minLength') == 1:\n            print(\"✅ 'message' has correct type and validation\")\n        \n        conversation_id_def = properties.get('conversation_id', {})\n        if conversation_id_def.get('format') == 'uuid' and conversation_id_def.get('nullable'):\n            print(\"✅ 'conversation_id' correctly identified as nullable UUID\")\n        \n        # Verify doc_scope enum\n        doc_scope_def = properties.get('doc_scope', {})\n        if 'enum' in doc_scope_def:\n            enum_values = doc_scope_def['enum']\n            print(f\"✅ 'doc_scope' has enum values: {enum_values}\")\n            if 'personal' in enum_values:\n                print(\"✅ 'personal' correctly included in doc_scope enum\")\n        \n        print(f\"📊 Schema Analysis Summary:\")\n        print(f\"   Found Fields: {len(found_fields)}\")\n        print(f\"   Required Fields: {len(required_fields)}\")\n        print(f\"   Properties Generated: {len(properties)}\")\n        \n        return len(found_fields) >= 8  # Should find most of the expected fields\n        \n    except Exception as e:\n        print(f\"❌ Test failed: {e}\")\n        import traceback\n        traceback.print_exc()\n        return False\n\ndef test_no_hardcoded_schemas():\n    \"\"\"Test that we're no longer using hardcoded schema references.\"\"\"\n    print(\"\\n🔍 Testing Elimination of Hardcoded Schemas...\")\n    \n    try:\n        # Read the swagger_wrapper.py to verify hardcoded schemas are removed\n        wrapper_file = os.path.join(os.path.dirname(__file__), '..', 'application', 'single_app', 'swagger_wrapper.py')\n        \n        with open(wrapper_file, 'r', encoding='utf-8') as f:\n            content = f.read()\n        \n        # Check that we're no longer generating large hardcoded schema blocks\n        hardcoded_indicators = [\n            '\"ChatRequest\": {',\n            '\"DocumentUpdateRequest\": {',\n            'bing_search',  # This outdated parameter should not appear\n        ]\n        \n        found_hardcoded = []\n        for indicator in hardcoded_indicators:\n            if indicator in content:\n                found_hardcoded.append(indicator)\n        \n        if found_hardcoded:\n            print(f\"⚠️  Still found some hardcoded schema indicators: {found_hardcoded}\")\n        else:\n            print(\"✅ No hardcoded schema blocks found\")\n        \n        # Check for inline generation indicators\n        inline_indicators = [\n            '_analyze_function_request_body',\n            'inline schema',\n            '_infer_field_definition',\n            'Generated inline schema'\n        ]\n        \n        found_inline = []\n        for indicator in inline_indicators:\n            if indicator in content:\n                found_inline.append(indicator)\n                print(f\"✅ Found inline generation: {indicator}\")\n        \n        if len(found_inline) >= 3:\n            print(\"✅ Inline schema generation properly implemented\")\n            return True\n        else:\n            print(f\"❌ Missing inline generation indicators: expected >= 3, found {len(found_inline)}\")\n            return False\n        \n    except Exception as e:\n        print(f\"❌ Test failed: {e}\")\n        return False\n\ndef test_enhanced_field_inference():\n    \"\"\"Test the enhanced field definition inference.\"\"\"\n    print(\"\\n🔍 Testing Enhanced Field Inference...\")\n    \n    try:\n        from swagger_wrapper import _infer_field_definition, _extract_doc_scope_enum_from_source\n        \n        # Test field inference for different types\n        test_cases = [\n            ('message', 'string with minLength'),\n            ('conversation_id', 'uuid format with nullable'),\n            ('hybrid_search', 'boolean with default false'),\n            ('doc_scope', 'string with enum'),\n            ('top_n', 'integer with minimum'),\n        ]\n        \n        sample_source = \"\"\"\n        data = request.get_json()\n        message = data.get('message')\n        conversation_id = data.get('conversation_id')\n        hybrid_search = data.get('hybrid_search')\n        doc_scope = data.get('doc_scope')\n        top_n = data.get('top_n')\n        \"\"\"\n        \n        all_passed = True\n        for field_name, expected_type in test_cases:\n            definition = _infer_field_definition(field_name, sample_source)\n            \n            print(f\"Field '{field_name}': {definition}\")\n            \n            # Basic validation\n            if 'type' not in definition:\n                print(f\"❌ '{field_name}' missing type\")\n                all_passed = False\n            elif 'description' not in definition:\n                print(f\"❌ '{field_name}' missing description\")\n                all_passed = False\n            else:\n                print(f\"✅ '{field_name}' properly defined\")\n        \n        # Test doc_scope enum extraction\n        enum_values = _extract_doc_scope_enum_from_source(sample_source)\n        print(f\"Doc scope enum values: {enum_values}\")\n        \n        if len(enum_values) >= 3:  # Should have at least user, group, all, personal\n            print(\"✅ Doc scope enum extraction working\")\n        else:\n            print(\"⚠️  Doc scope enum extraction may need improvement\")\n        \n        return all_passed\n        \n    except Exception as e:\n        print(f\"❌ Enhanced field inference test failed: {e}\")\n        import traceback\n        traceback.print_exc()\n        return False\n\nif __name__ == \"__main__\":\n    print(\"🧪 Running Inline Schema Generation Tests...\")\n    print(f\"🎯 Goal: Eliminate hardcoded schemas and generate from actual routes\")\n    \n    test1_result = test_inline_schema_generation()\n    test2_result = test_no_hardcoded_schemas()\n    test3_result = test_enhanced_field_inference()\n    \n    all_passed = test1_result and test2_result and test3_result\n    \n    print(f\"\\n📋 Test Results:\")\n    print(f\"   Inline Schema Generation: {'✅ PASS' if test1_result else '❌ FAIL'}\")\n    print(f\"   Hardcoded Schema Elimination: {'✅ PASS' if test2_result else '❌ FAIL'}\")\n    print(f\"   Enhanced Field Inference: {'✅ PASS' if test3_result else '❌ FAIL'}\")\n    print(f\"   Overall: {'✅ ALL TESTS PASSED' if all_passed else '❌ SOME TESTS FAILED'}\")\n    \n    if all_passed:\n        print(\"\\n🎉 Inline schema generation is working perfectly!\")\n        print(\"📈 Schemas are now generated directly from actual route implementations\")\n        print(\"🚫 Eliminated outdated parameters like 'bing_search'\")\n        print(\"⚡ No more hardcoded schema references - everything is dynamic and accurate\")\n    else:\n        print(\"\\n❌ Some tests failed - review implementation\")\n    \n    sys.exit(0 if all_passed else 1)
\ No newline at end of file
+        return len(found_fields) >= 8  # Should find most of the expected fields
+
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_no_hardcoded_schemas():
+    """Test that we're no longer using hardcoded schema references."""
+    print("\n🔍 Testing Elimination of Hardcoded Schemas...")
+
+    try:
+        wrapper_file = os.path.join(os.path.dirname(__file__), '..', 'application', 'single_app', 'swagger_wrapper.py')
+
+        with open(wrapper_file, 'r', encoding='utf-8') as file_handle:
+            content = file_handle.read()
+
+        hardcoded_indicators = [
+            '"ChatRequest": {',
+            '"DocumentUpdateRequest": {',
+            'bing_search',
+        ]
+
+        found_hardcoded = []
+        for indicator in hardcoded_indicators:
+            if indicator in content:
+                found_hardcoded.append(indicator)
+
+        if found_hardcoded:
+            print(f"⚠️  Still found some hardcoded schema indicators: {found_hardcoded}")
+        else:
+            print("✅ No hardcoded schema blocks found")
+
+        inline_indicators = [
+            '_analyze_function_request_body',
+            'inline schema',
+            '_infer_field_definition',
+            'Generated inline schema'
+        ]
+
+        found_inline = []
+        for indicator in inline_indicators:
+            if indicator in content:
+                found_inline.append(indicator)
+                print(f"✅ Found inline generation: {indicator}")
+
+        if len(found_inline) >= 3:
+            print("✅ Inline schema generation properly implemented")
+            return True
+
+        print(f"❌ Missing inline generation indicators: expected >= 3, found {len(found_inline)}")
+        return False
+
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        return False
+
+
+def test_enhanced_field_inference():
+    """Test the enhanced field definition inference."""
+    print("\n🔍 Testing Enhanced Field Inference...")
+
+    try:
+        from swagger_wrapper import _infer_field_definition, _extract_doc_scope_enum_from_source
+
+        test_cases = [
+            ('message', 'string with minLength'),
+            ('conversation_id', 'uuid format with nullable'),
+            ('hybrid_search', 'boolean with default false'),
+            ('doc_scope', 'string with enum'),
+            ('top_n', 'integer with minimum'),
+        ]
+
+        sample_source = """
+        data = request.get_json()
+        message = data.get('message')
+        conversation_id = data.get('conversation_id')
+        hybrid_search = data.get('hybrid_search')
+        doc_scope = data.get('doc_scope')
+        top_n = data.get('top_n')
+        """
+
+        all_passed = True
+        for field_name, _expected_type in test_cases:
+            definition = _infer_field_definition(field_name, sample_source)
+
+            print(f"Field '{field_name}': {definition}")
+
+            if 'type' not in definition:
+                print(f"❌ '{field_name}' missing type")
+                all_passed = False
+            elif 'description' not in definition:
+                print(f"❌ '{field_name}' missing description")
+                all_passed = False
+            else:
+                print(f"✅ '{field_name}' properly defined")
+
+        enum_values = _extract_doc_scope_enum_from_source(sample_source)
+        print(f"Doc scope enum values: {enum_values}")
+
+        if len(enum_values) >= 3:
+            print("✅ Doc scope enum extraction working")
+        else:
+            print("⚠️  Doc scope enum extraction may need improvement")
+
+        return all_passed
+
+    except Exception as e:
+        print(f"❌ Enhanced field inference test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == "__main__":
+    print("🧪 Running Inline Schema Generation Tests...")
+    print("🎯 Goal: Eliminate hardcoded schemas and generate from actual routes")
+
+    test1_result = test_inline_schema_generation()
+    test2_result = test_no_hardcoded_schemas()
+    test3_result = test_enhanced_field_inference()
+
+    all_passed = test1_result and test2_result and test3_result
+
+    print("\n📋 Test Results:")
+    print(f"   Inline Schema Generation: {'✅ PASS' if test1_result else '❌ FAIL'}")
+    print(f"   Hardcoded Schema Elimination: {'✅ PASS' if test2_result else '❌ FAIL'}")
+    print(f"   Enhanced Field Inference: {'✅ PASS' if test3_result else '❌ FAIL'}")
+    print(f"   Overall: {'✅ ALL TESTS PASSED' if all_passed else '❌ SOME TESTS FAILED'}")
+
+    if all_passed:
+        print("\n🎉 Inline schema generation is working perfectly!")
+        print("📈 Schemas are now generated directly from actual route implementations")
+        print("🚫 Eliminated outdated parameters like 'bing_search'")
+        print("⚡ No more hardcoded schema references - everything is dynamic and accurate")
+    else:
+        print("\n❌ Some tests failed - review implementation")
+
+    sys.exit(0 if all_passed else 1)
\ No newline at end of file
diff --git a/functional_tests/test_openapi_upload_only_flow.py b/functional_tests/test_openapi_upload_only_flow.py
new file mode 100644
index 00000000..6e31335c
--- /dev/null
+++ b/functional_tests/test_openapi_upload_only_flow.py
@@ -0,0 +1,66 @@
+#!/usr/bin/env python3
+# test_openapi_upload_only_flow.py
+"""
+Functional test for upload-only OpenAPI configuration flow.
+Version: 0.239.143
+Implemented in: 0.239.143
+
+This test ensures the OpenAPI plugin flow no longer exposes backend URL import
+endpoints or legacy URL source handling, and continues to rely on uploaded spec
+content in the frontend configuration flow.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+ROUTE_FILE = ROOT / 'application' / 'single_app' / 'route_openapi.py'
+SECURITY_FILE = ROOT / 'application' / 'single_app' / 'openapi_security.py'
+FACTORY_FILE = ROOT / 'application' / 'single_app' / 'semantic_kernel_plugins' / 'openapi_plugin_factory.py'
+STEPPER_FILE = ROOT / 'application' / 'single_app' / 'static' / 'js' / 'plugin_modal_stepper.js'
+CONFIG_FILE = ROOT / 'application' / 'single_app' / 'config.py'
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def assert_not_contains(file_path: Path, unexpected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if unexpected in content:
+        raise AssertionError(f"Did not expect to find {unexpected!r} in {file_path}")
+
+
+def test_openapi_upload_only_flow() -> bool:
+    print('Testing OpenAPI upload-only flow markers...')
+
+    assert_contains(ROUTE_FILE, "@app.route('/api/openapi/upload', methods=['POST'])")
+    assert_not_contains(ROUTE_FILE, "/api/openapi/validate-url")
+    assert_not_contains(ROUTE_FILE, "/api/openapi/download-from-url")
+
+    assert_not_contains(SECURITY_FILE, 'def validate_url(')
+    assert_not_contains(SECURITY_FILE, 'def validate_url_content(')
+    assert_not_contains(SECURITY_FILE, 'def validate_openapi_url(')
+
+    assert_not_contains(FACTORY_FILE, "source_type == 'url'")
+    assert_contains(STEPPER_FILE, "throw new Error('Please upload an OpenAPI specification file')")
+    assert_contains(STEPPER_FILE, "additionalFields.openapi_source_type = 'content';")
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.143"')
+
+    print('OpenAPI upload-only flow checks passed!')
+    return True
+
+
+if __name__ == '__main__':
+    try:
+        success = test_openapi_upload_only_flow()
+    except Exception as exc:
+        print(f'Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_per_message_export.py b/functional_tests/test_per_message_export.py
new file mode 100644
index 00000000..201c4b07
--- /dev/null
+++ b/functional_tests/test_per_message_export.py
@@ -0,0 +1,380 @@
+#!/usr/bin/env python3
+# test_per_message_export.py
+"""
+Functional tests for the per-message export feature and Word route regression fix.
+Version: 0.239.128
+Implemented in: 0.239.128
+
+Covers:
+ - Happy path: Word document built successfully from a valid message.
+ - Markdown export logic: correct header, timestamp and content rendered.
+ - Route regression: backend source defines POST /api/message/export-word.
+ - Auth failure: unauthenticated caller receives 401.
+ - Ownership failure: caller who does not own the conversation receives 403.
+"""
+
+import ast
+import sys
+import os
+import io
+
+sys.path.insert(
+    0,
+    os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app')
+)
+
+
+# ---------------------------------------------------------------------------
+# Helpers – replicate key logic from route_backend_conversation_export.py
+# so the tests run without a live Flask + Cosmos DB environment.
+# ---------------------------------------------------------------------------
+
+def _normalize_content(content):
+    """Replicate _normalize_content from route_backend_conversation_export."""
+    if isinstance(content, str):
+        return content
+    if isinstance(content, list):
+        parts = []
+        for item in content:
+            if isinstance(item, dict):
+                if item.get('type') == 'text':
+                    parts.append(item.get('text', ''))
+                elif item.get('type') == 'image_url':
+                    parts.append('[Image]')
+                else:
+                    parts.append(str(item))
+            else:
+                parts.append(str(item))
+        return '\n'.join(parts)
+    if isinstance(content, dict):
+        if content.get('type') == 'text':
+            return content.get('text', '')
+        return str(content)
+    return str(content) if content else ''
+
+
+def _verify_ownership(conversation, requesting_user_id):
+    """Return (ok, status_code, error_msg)."""
+    if conversation is None:
+        return False, 404, 'Conversation not found'
+    if conversation.get('user_id') != requesting_user_id:
+        return False, 403, 'Access denied'
+    return True, 200, None
+
+
+def _check_auth(user_id):
+    """Simulate the authentication guard at the start of the endpoint."""
+    if not user_id:
+        return False, 401, 'User not authenticated'
+    return True, 200, None
+
+
+def _build_markdown_export(role, content, sender, timestamp):
+    """Replicate the client-side Markdown export logic from chat-message-export.js."""
+    lines = []
+    lines.append(f"### {sender}")
+    if timestamp:
+        lines.append(f"*{timestamp}*")
+    lines.append('')
+    lines.append(content)
+    lines.append('')
+    return '\n'.join(lines)
+
+
+# ---------------------------------------------------------------------------
+# Tests
+# ---------------------------------------------------------------------------
+
+def test_happy_path_word_export():
+    """Happy path: Word document is built without error for a valid message."""
+    print("🔍 Testing happy path – Word document generation...")
+
+    try:
+        from docx import Document as DocxDocument
+        from docx.shared import Pt
+    except ImportError as exc:
+        print(f"  ⚠️  python-docx not installed, skipping Word generation check: {exc}")
+        print("✅ test_happy_path_word_export skipped (dependency missing)")
+        return True
+
+    # Arrange
+    requesting_user_id = 'user-alice'
+    conversation = {'id': 'conv-001', 'user_id': 'user-alice'}
+    message = {
+        'id': 'msg-001',
+        'role': 'assistant',
+        'content': '**Hello**, world!\n\nThis is a test.',
+        'timestamp': '2025-06-01T10:00:00Z',
+        'citations': [
+            {'title': 'Reference Doc', 'url': 'https://example.com/ref'}
+        ]
+    }
+
+    # Auth check
+    auth_ok, auth_status, auth_err = _check_auth(requesting_user_id)
+    assert auth_ok, f"Auth should pass, got {auth_status}: {auth_err}"
+
+    # Ownership check
+    ok, status, err = _verify_ownership(conversation, requesting_user_id)
+    assert ok, f"Ownership check should pass, got {status}: {err}"
+
+    # Build Word document
+    doc = DocxDocument()
+    doc.add_heading('Message Export', level=1)
+
+    role = message.get('role', 'unknown').capitalize()
+    timestamp = message.get('timestamp', '')
+
+    meta_para = doc.add_paragraph()
+    meta_run = meta_para.add_run(f"Role: {role}")
+    meta_run.bold = True
+    if timestamp:
+        meta_para.add_run(f"    {timestamp}")
+
+    doc.add_paragraph('')
+
+    content = _normalize_content(message.get('content', ''))
+
+    # Add content as a paragraph (simplified – full logic tested in route unit)
+    doc.add_paragraph(content)
+
+    citations = message.get('citations', [])
+    if citations:
+        doc.add_heading('Citations', level=2)
+        for cit in citations:
+            source = cit.get('title') or cit.get('url', 'Unknown')
+            doc.add_paragraph(source, style='List Bullet')
+
+    # Serialise to buffer – if this raises, the test fails
+    buffer = io.BytesIO()
+    doc.save(buffer)
+    buffer.seek(0)
+    docx_bytes = buffer.read()
+
+    assert len(docx_bytes) > 0, "Generated docx should be non-empty"
+
+    # Round-trip verify
+    buffer.seek(0)
+    loaded = DocxDocument(io.BytesIO(docx_bytes))
+    headings = [p.text for p in loaded.paragraphs if p.style.name.startswith('Heading')]
+    assert 'Message Export' in headings, "Document should have 'Message Export' heading"
+
+    print("✅ test_happy_path_word_export passed!")
+    return True
+
+
+def test_happy_path_markdown_export():
+    """Happy path: Markdown file content is correctly formatted."""
+    print("🔍 Testing happy path – Markdown export...")
+
+    role = 'assistant'
+    content = 'Here is a **bold** answer.'
+    sender = 'Assistant'
+    timestamp = '2025-06-01T10:05:00Z'
+
+    markdown = _build_markdown_export(role, content, sender, timestamp)
+
+    assert '### Assistant' in markdown, "Should have role heading"
+    assert f'*{timestamp}*' in markdown, "Should include timestamp"
+    assert content in markdown, "Should include message content"
+    # File should start with the heading line
+    assert markdown.startswith('### Assistant'), "Heading should be first line"
+
+    print("✅ test_happy_path_markdown_export passed!")
+    return True
+
+
+def test_export_word_route_definition_present():
+    """Route regression: the backend must define POST /api/message/export-word."""
+    print("🔍 Testing backend route definition for Word export...")
+
+    route_file = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'route_backend_conversation_export.py'
+    )
+
+    with open(route_file, 'r', encoding='utf-8') as handle:
+        source = handle.read()
+
+    tree = ast.parse(source)
+    register_func = next(
+        (
+            node for node in tree.body
+            if isinstance(node, ast.FunctionDef)
+            and node.name == 'register_route_backend_conversation_export'
+        ),
+        None
+    )
+
+    assert register_func is not None, 'register_route_backend_conversation_export should exist'
+
+    export_route_found = False
+    for node in register_func.body:
+        if not isinstance(node, ast.FunctionDef):
+            continue
+
+        for decorator in node.decorator_list:
+            if not isinstance(decorator, ast.Call):
+                continue
+
+            func = decorator.func
+            if not isinstance(func, ast.Attribute) or func.attr != 'route':
+                continue
+
+            if not decorator.args:
+                continue
+
+            route_arg = decorator.args[0]
+            if not isinstance(route_arg, ast.Constant) or route_arg.value != '/api/message/export-word':
+                continue
+
+            methods_kw = next((keyword for keyword in decorator.keywords if keyword.arg == 'methods'), None)
+            assert methods_kw is not None, 'Export Word route should declare allowed methods'
+            assert isinstance(methods_kw.value, (ast.List, ast.Tuple)), 'Route methods should be a list or tuple'
+
+            methods = [
+                item.value for item in methods_kw.value.elts
+                if isinstance(item, ast.Constant)
+            ]
+            assert 'POST' in methods, f'Expected POST method, found {methods}'
+            assert node.name == 'api_export_message_word', f'Unexpected route handler name: {node.name}'
+            export_route_found = True
+            break
+
+        if export_route_found:
+            break
+
+    assert export_route_found, 'Expected POST /api/message/export-word to be defined'
+
+    print("✅ test_export_word_route_definition_present passed!")
+    return True
+
+
+def test_auth_failure_unauthenticated():
+    """Auth failure: an unauthenticated caller (no user_id) should get 401."""
+    print("🔍 Testing auth failure – unauthenticated request...")
+
+    for bad_user_id in (None, '', False):
+        ok, status, err = _check_auth(bad_user_id)
+        assert not ok, f"Auth should fail for user_id={bad_user_id!r}"
+        assert status == 401, f"Expected 401, got {status}"
+        assert err == 'User not authenticated', f"Unexpected error message: {err}"
+
+    print("✅ test_auth_failure_unauthenticated passed!")
+    return True
+
+
+def test_ownership_failure_wrong_user():
+    """Ownership failure: user requesting another user's conversation gets 403."""
+    print("🔍 Testing ownership failure – wrong user...")
+
+    conversation = {'id': 'conv-bob', 'user_id': 'user-bob'}
+    requesting_user = 'user-alice'
+
+    ok, status, err = _verify_ownership(conversation, requesting_user)
+
+    assert not ok, "Ownership check should fail"
+    assert status == 403, f"Expected 403, got {status}"
+    assert err == 'Access denied', f"Unexpected error message: {err}"
+
+    print("✅ test_ownership_failure_wrong_user passed!")
+    return True
+
+
+def test_ownership_failure_missing_conversation():
+    """Ownership failure: conversation not found should return 404."""
+    print("🔍 Testing ownership failure – conversation not found...")
+
+    ok, status, err = _verify_ownership(None, 'user-alice')
+
+    assert not ok, "Ownership check should fail for missing conversation"
+    assert status == 404, f"Expected 404, got {status}"
+    assert err == 'Conversation not found', f"Unexpected error message: {err}"
+
+    print("✅ test_ownership_failure_missing_conversation passed!")
+    return True
+
+
+def test_normalize_content_variants():
+    """Content normalisation handles strings, lists, and dicts correctly."""
+    print("🔍 Testing content normalisation...")
+
+    # Plain string – unchanged
+    assert _normalize_content('hello') == 'hello'
+
+    # List of text parts
+    result = _normalize_content([
+        {'type': 'text', 'text': 'Part 1'},
+        {'type': 'text', 'text': 'Part 2'},
+    ])
+    assert result == 'Part 1\nPart 2', f"Unexpected: {result!r}"
+
+    # Image entry in list
+    result = _normalize_content([
+        {'type': 'text', 'text': 'Before image'},
+        {'type': 'image_url', 'image_url': {'url': 'https://example.com/img.png'}},
+    ])
+    assert '[Image]' in result, "Image entries should render as [Image]"
+
+    # Dict with type=text
+    assert _normalize_content({'type': 'text', 'text': 'Hi'}) == 'Hi'
+
+    # None / empty
+    assert _normalize_content(None) == ''
+    assert _normalize_content('') == ''
+
+    print("✅ test_normalize_content_variants passed!")
+    return True
+
+
+def test_markdown_export_no_timestamp():
+    """Markdown export omits the timestamp line when timestamp is empty."""
+    print("🔍 Testing Markdown export without timestamp...")
+
+    markdown = _build_markdown_export('user', 'Hello!', 'User', '')
+
+    assert '### User' in markdown
+    assert 'Hello!' in markdown
+    # No italicised timestamp line should be present
+    lines = markdown.splitlines()
+    italic_lines = [line for line in lines if line.startswith('*') and line.endswith('*')]
+    assert not italic_lines, f"Should be no timestamp lines, found: {italic_lines}"
+
+    print("✅ test_markdown_export_no_timestamp passed!")
+    return True
+
+
+# ---------------------------------------------------------------------------
+# Runner
+# ---------------------------------------------------------------------------
+
+if __name__ == "__main__":
+    tests = [
+        test_happy_path_word_export,
+        test_happy_path_markdown_export,
+        test_export_word_route_definition_present,
+        test_auth_failure_unauthenticated,
+        test_ownership_failure_wrong_user,
+        test_ownership_failure_missing_conversation,
+        test_normalize_content_variants,
+        test_markdown_export_no_timestamp,
+    ]
+    results = []
+
+    for test_fn in tests:
+        print(f"\n🧪 Running {test_fn.__name__}...")
+        try:
+            results.append(test_fn())
+        except Exception as exc:
+            print(f"❌ {test_fn.__name__} failed: {exc}")
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    success = all(results)
+    passed = sum(1 for r in results if r)
+    print(f"\n📊 Results: {passed}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_persistent_conversation_summary.py b/functional_tests/test_persistent_conversation_summary.py
new file mode 100644
index 00000000..e8d95ffa
--- /dev/null
+++ b/functional_tests/test_persistent_conversation_summary.py
@@ -0,0 +1,279 @@
+#!/usr/bin/env python3
+# test_persistent_conversation_summary.py
+"""
+Functional test for persistent conversation summaries.
+Version: 0.239.030
+Implemented in: 0.239.030
+
+This test ensures that:
+1. generate_conversation_summary() returns properly structured summary data
+2. _build_summary_intro() uses cached summary when message_time_end hasn't changed
+3. _build_summary_intro() regenerates when messages are newer than cached summary
+4. Summary data includes message_time_start and message_time_end fields
+5. The truncation helper works correctly
+"""
+
+import os
+import sys
+
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+from route_backend_conversation_export import (  # noqa: E402
+    _build_summary_intro,
+    _truncate_for_summary,
+    SUMMARY_SOURCE_CHAR_LIMIT,
+)
+
+
+def test_summary_intro_cache_hit():
+    """Test that _build_summary_intro returns cached summary when message_time_end matches."""
+    print("🔍 Testing summary cache hit (no new messages)...")
+
+    existing_summary = {
+        'content': 'Cached summary text.',
+        'model_deployment': 'gpt-4o',
+        'generated_at': '2026-01-15T10:00:00',
+        'message_time_start': '2026-01-15T09:00:00',
+        'message_time_end': '2026-01-15T09:55:00'
+    }
+
+    conversation = {
+        'id': 'conv-test-1',
+        'title': 'Test Conversation',
+        'summary': existing_summary
+    }
+
+    messages = [
+        {'role': 'user', 'content_text': 'Hello', 'speaker_label': 'USER'},
+        {'role': 'assistant', 'content_text': 'Hi there', 'speaker_label': 'ASSISTANT'}
+    ]
+
+    sanitized_conversation = {'title': 'Test Conversation'}
+
+    result = _build_summary_intro(
+        messages=messages,
+        conversation=conversation,
+        sanitized_conversation=sanitized_conversation,
+        settings={},
+        enabled=True,
+        summary_model_deployment='gpt-4o',
+        message_time_start='2026-01-15T09:00:00',
+        message_time_end='2026-01-15T09:55:00'  # Same as cached
+    )
+
+    assert result['enabled'] is True, f"Expected enabled=True, got {result['enabled']}"
+    assert result['generated'] is True, f"Expected generated=True, got {result['generated']}"
+    assert result['content'] == 'Cached summary text.', f"Expected cached content, got: {result['content']}"
+    assert result['model_deployment'] == 'gpt-4o', f"Expected gpt-4o, got: {result['model_deployment']}"
+    assert result['generated_at'] == '2026-01-15T10:00:00', f"Expected cached timestamp, got: {result['generated_at']}"
+    assert result['error'] is None, f"Expected no error, got: {result['error']}"
+
+    print("✅ Summary cache hit test passed!")
+    return True
+
+
+def test_summary_intro_cache_hit_older_message():
+    """Test cache hit when cached message_time_end is NEWER than current (edge case)."""
+    print("🔍 Testing summary cache hit with older message_time_end...")
+
+    existing_summary = {
+        'content': 'Summary covers more messages.',
+        'model_deployment': 'gpt-4o',
+        'generated_at': '2026-01-15T12:00:00',
+        'message_time_start': '2026-01-15T09:00:00',
+        'message_time_end': '2026-01-15T11:55:00'
+    }
+
+    conversation = {
+        'id': 'conv-test-2',
+        'summary': existing_summary
+    }
+
+    sanitized_conversation = {'title': 'Test'}
+
+    result = _build_summary_intro(
+        messages=[{'role': 'user', 'content_text': 'Hi', 'speaker_label': 'USER'}],
+        conversation=conversation,
+        sanitized_conversation=sanitized_conversation,
+        settings={},
+        enabled=True,
+        summary_model_deployment='gpt-4o',
+        message_time_start='2026-01-15T09:00:00',
+        message_time_end='2026-01-15T10:00:00'  # Older than cached
+    )
+
+    assert result['generated'] is True, "Should use cache when cached end >= current end"
+    assert result['content'] == 'Summary covers more messages.', "Should return cached content"
+
+    print("✅ Summary cache hit (older message) test passed!")
+    return True
+
+
+def test_summary_intro_disabled():
+    """Test that disabled summary returns immediately."""
+    print("🔍 Testing summary disabled state...")
+
+    result = _build_summary_intro(
+        messages=[],
+        conversation={'id': 'conv-disabled'},
+        sanitized_conversation={'title': 'Test'},
+        settings={},
+        enabled=False,
+        summary_model_deployment=''
+    )
+
+    assert result['enabled'] is False, f"Expected enabled=False, got {result['enabled']}"
+    assert result['generated'] is False, f"Expected generated=False, got {result['generated']}"
+    assert result['content'] == '', f"Expected empty content, got: {result['content']}"
+
+    print("✅ Summary disabled state test passed!")
+    return True
+
+
+def test_summary_intro_stale_cache():
+    """Test that a stale cache (newer messages exist) triggers regeneration attempt."""
+    print("🔍 Testing summary stale cache detection...")
+
+    existing_summary = {
+        'content': 'Old cached summary.',
+        'model_deployment': 'gpt-4o',
+        'generated_at': '2026-01-15T10:00:00',
+        'message_time_start': '2026-01-15T09:00:00',
+        'message_time_end': '2026-01-15T09:55:00'
+    }
+
+    conversation = {
+        'id': 'conv-stale',
+        'summary': existing_summary
+    }
+
+    sanitized_conversation = {'title': 'Test'}
+
+    # message_time_end is NEWER than cached — should try to regenerate
+    # Since we don't have a real OpenAI client, this will fail with an error,
+    # which confirms the cache was NOT used.
+    result = _build_summary_intro(
+        messages=[
+            {'role': 'user', 'content_text': 'New message', 'speaker_label': 'USER'}
+        ],
+        conversation=conversation,
+        sanitized_conversation=sanitized_conversation,
+        settings={},
+        enabled=True,
+        summary_model_deployment='gpt-4o',
+        message_time_start='2026-01-15T09:00:00',
+        message_time_end='2026-01-15T10:30:00'  # NEWER than cached 09:55
+    )
+
+    # Without a real GPT client, generation should fail
+    assert result['generated'] is False, "Without GPT client, generation should fail"
+    assert result['error'] is not None, "Should have an error when no GPT client available"
+    # The key assertion: it DID NOT return the cached content
+    assert result['content'] != 'Old cached summary.', "Should not use stale cached summary"
+
+    print("✅ Summary stale cache detection test passed!")
+    return True
+
+
+def test_summary_data_structure():
+    """Test that the summary_intro structure contains all expected fields."""
+    print("🔍 Testing summary data structure...")
+
+    result = _build_summary_intro(
+        messages=[],
+        conversation={'id': 'conv-struct'},
+        sanitized_conversation={'title': 'Test'},
+        settings={},
+        enabled=True,
+        summary_model_deployment='gpt-4o',
+        message_time_start=None,
+        message_time_end=None
+    )
+
+    expected_keys = {'enabled', 'generated', 'model_deployment', 'generated_at', 'content', 'error'}
+    actual_keys = set(result.keys())
+    assert expected_keys.issubset(actual_keys), f"Missing keys: {expected_keys - actual_keys}"
+
+    print("✅ Summary data structure test passed!")
+    return True
+
+
+def test_truncation_short_text():
+    """Test that short text passes through unchanged."""
+    print("🔍 Testing truncation with short text...")
+
+    short_text = "This is a short transcript."
+    result = _truncate_for_summary(short_text)
+    assert result == short_text, "Short text should pass through unchanged"
+
+    print("✅ Truncation short text test passed!")
+    return True
+
+
+def test_truncation_long_text():
+    """Test that long text gets truncated with marker."""
+    print("🔍 Testing truncation with long text...")
+
+    long_text = "A" * (SUMMARY_SOURCE_CHAR_LIMIT + 1000)
+    result = _truncate_for_summary(long_text)
+
+    assert len(result) < len(long_text), "Truncated text should be shorter"
+    assert "transcript truncated" in result, "Should contain truncation marker"
+
+    print("✅ Truncation long text test passed!")
+    return True
+
+
+def test_summary_no_content():
+    """Test that summary with no message content returns an error."""
+    print("🔍 Testing summary with empty messages...")
+
+    result = _build_summary_intro(
+        messages=[
+            {'role': 'user', 'content_text': '', 'speaker_label': 'USER'},
+            {'role': 'assistant', 'content_text': '', 'speaker_label': 'ASSISTANT'}
+        ],
+        conversation={'id': 'conv-empty'},
+        sanitized_conversation={'title': 'Test'},
+        settings={},
+        enabled=True,
+        summary_model_deployment='gpt-4o',
+        message_time_start=None,
+        message_time_end=None
+    )
+
+    assert result['generated'] is False, "Should not generate with empty content"
+    assert result['error'] is not None, "Should have an error message"
+    assert 'No message content' in result['error'], f"Unexpected error: {result['error']}"
+
+    print("✅ Summary with empty messages test passed!")
+    return True
+
+
+if __name__ == "__main__":
+    tests = [
+        test_summary_intro_cache_hit,
+        test_summary_intro_cache_hit_older_message,
+        test_summary_intro_disabled,
+        test_summary_intro_stale_cache,
+        test_summary_data_structure,
+        test_truncation_short_text,
+        test_truncation_long_text,
+        test_summary_no_content,
+    ]
+    results = []
+
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        try:
+            results.append(test())
+        except Exception as e:
+            print(f"❌ {test.__name__} failed with exception: {e}")
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    passed = sum(1 for r in results if r)
+    total = len(results)
+    print(f"\n📊 Results: {passed}/{total} tests passed")
+    sys.exit(0 if all(results) else 1)
diff --git a/functional_tests/test_pillow_psd_upload_hardening.py b/functional_tests/test_pillow_psd_upload_hardening.py
new file mode 100644
index 00000000..e2409030
--- /dev/null
+++ b/functional_tests/test_pillow_psd_upload_hardening.py
@@ -0,0 +1,87 @@
+# test_pillow_psd_upload_hardening.py
+"""
+Functional test for Pillow PSD upload hardening.
+Version: 0.239.136
+Implemented in: 0.239.134
+
+This test ensures the application pins Pillow to a patched version and limits
+admin image uploads to the PNG and JPEG formats that the route already allows.
+"""
+
+import os
+import sys
+
+
+sys.path.insert(0, os.path.dirname(os.path.dirname(os.path.abspath(__file__))))
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+REQUIREMENTS_PATH = os.path.join(ROOT_DIR, 'application', 'single_app', 'requirements.txt')
+ROUTE_PATH = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_frontend_admin_settings.py')
+CONFIG_PATH = os.path.join(ROOT_DIR, 'application', 'single_app', 'config.py')
+
+
+def read_text(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_pillow_version_is_patched():
+    print('Testing patched Pillow dependency pin...')
+
+    content = read_text(REQUIREMENTS_PATH)
+    if 'pillow==12.1.1' not in content:
+        print('Patched Pillow version pin not found in requirements.txt')
+        return False
+
+    print('Patched Pillow version pin found in requirements.txt')
+    return True
+
+
+def test_admin_image_uploads_allowlist_formats():
+    print('Testing admin image upload format allowlist...')
+
+    content = read_text(ROUTE_PATH)
+    checks = [
+        "ALLOWED_PIL_IMAGE_UPLOAD_FORMATS = ('PNG', 'JPEG')",
+        'Image.open(BytesIO(file_bytes), formats=list(ALLOWED_PIL_IMAGE_UPLOAD_FORMATS))',
+        'open_allowed_uploaded_image(file_bytes, logo_file.filename)',
+        'open_allowed_uploaded_image(file_bytes, logo_dark_file.filename)',
+        'open_allowed_uploaded_image(file_bytes, favicon_file.filename)'
+    ]
+
+    missing_checks = [check for check in checks if check not in content]
+    if missing_checks:
+        print('Missing upload hardening checks:')
+        for missing_check in missing_checks:
+            print(f'  - {missing_check}')
+        return False
+
+    print('Admin image upload route restricts Pillow to PNG and JPEG parsing')
+    return True
+
+
+def test_config_version_updated():
+    print('Testing config version bump...')
+
+    content = read_text(CONFIG_PATH)
+    if 'VERSION = "0.239.136"' not in content:
+        print('Expected config version 0.239.136 not found')
+        return False
+
+    print('Config version updated to 0.239.136')
+    return True
+
+
+if __name__ == '__main__':
+    test_results = [
+        test_pillow_version_is_patched(),
+        test_admin_image_uploads_allowlist_formats(),
+        test_config_version_updated()
+    ]
+
+    passed_tests = sum(test_results)
+    total_tests = len(test_results)
+
+    print(f'Passed {passed_tests}/{total_tests} checks')
+    sys.exit(0 if all(test_results) else 1)
\ No newline at end of file
diff --git a/functional_tests/test_public_workspace_statistics.py b/functional_tests/test_public_workspace_statistics.py
index 312e683f..7f2f3ed0 100644
--- a/functional_tests/test_public_workspace_statistics.py
+++ b/functional_tests/test_public_workspace_statistics.py
@@ -1,8 +1,8 @@
 #!/usr/bin/env python3
 """
 Functional test for public workspace statistics and status management.
-Version: 0.234.112
-Implemented in: 0.234.112
+Version: 0.239.147
+Implemented in: 0.239.147
 
 This test ensures that public workspace statistics (last activity, recent activity count, 
 document metrics) and status management (active, locked, upload_disabled, inactive) work 
@@ -134,13 +134,19 @@ def test_activity_calculations():
         
         # Document 1: 10 days ago
         doc1_date = (datetime.now(timezone.utc) - timedelta(days=10)).strftime('%Y-%m-%d')
+        doc1_id = str(uuid.uuid4())
         create_document(
             file_name="test_old_doc.pdf",
             public_workspace_id=ws_id,
             user_id="test-user-123",
-            document_id=str(uuid.uuid4()),
+            document_id=doc1_id,
             num_file_chunks=0,
-            status='Completed',
+            status='Completed'
+        )
+        update_document(
+            document_id=doc1_id,
+            user_id="test-user-123",
+            public_workspace_id=ws_id,
             upload_date=doc1_date,
             number_of_pages=5
         )
@@ -148,13 +154,19 @@ def test_activity_calculations():
         
         # Document 2: 3 days ago (recent)
         doc2_date = (datetime.now(timezone.utc) - timedelta(days=3)).strftime('%Y-%m-%d')
+        doc2_id = str(uuid.uuid4())
         create_document(
             file_name="test_recent_doc.pdf",
             public_workspace_id=ws_id,
             user_id="test-user-123",
-            document_id=str(uuid.uuid4()),
+            document_id=doc2_id,
             num_file_chunks=0,
-            status='Completed',
+            status='Completed'
+        )
+        update_document(
+            document_id=doc2_id,
+            user_id="test-user-123",
+            public_workspace_id=ws_id,
             upload_date=doc2_date,
             number_of_pages=3
         )
@@ -162,13 +174,19 @@ def test_activity_calculations():
         
         # Document 3: Today (recent)
         doc3_date = datetime.now(timezone.utc).strftime('%Y-%m-%d')
+        doc3_id = str(uuid.uuid4())
         create_document(
             file_name="test_today_doc.pdf",
             public_workspace_id=ws_id,
             user_id="test-user-123",
-            document_id=str(uuid.uuid4()),
+            document_id=doc3_id,
             num_file_chunks=0,
-            status='Completed',
+            status='Completed'
+        )
+        update_document(
+            document_id=doc3_id,
+            user_id="test-user-123",
+            public_workspace_id=ws_id,
             upload_date=doc3_date,
             number_of_pages=2
         )
@@ -309,7 +327,7 @@ def run_all_tests():
     print("\n" + "=" * 60)
     print("🚀 PUBLIC WORKSPACE STATISTICS FUNCTIONAL TEST")
     print("=" * 60)
-    print(f"Version: 0.234.112")
+    print(f"Version: 0.239.147")
     print(f"Test Date: {datetime.now().strftime('%Y-%m-%d %H:%M:%S')}")
     print("=" * 60)
     
diff --git a/functional_tests/test_reasoning_effort_initial_sync.py b/functional_tests/test_reasoning_effort_initial_sync.py
new file mode 100644
index 00000000..423a3da7
--- /dev/null
+++ b/functional_tests/test_reasoning_effort_initial_sync.py
@@ -0,0 +1,145 @@
+#!/usr/bin/env python3
+# test_reasoning_effort_initial_sync.py
+"""
+Functional test for reasoning effort initial state sync.
+Version: 0.239.125
+Implemented in: 0.239.125
+
+This test ensures that the chat page applies the preferred model before
+initializing reasoning effort state so the reasoning button reflects the saved
+level on first load instead of waiting for a manual change.
+"""
+
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+CHAT_REASONING_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-reasoning.js',
+)
+CHAT_ONLOAD_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-onload.js',
+)
+CONFIG_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'config.py',
+)
+
+
+def read_file(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_reasoning_toggle_uses_applied_startup_settings():
+    """Verify reasoning initialization can consume already-loaded user settings."""
+    print('🔍 Testing reasoning toggle startup settings handling...')
+
+    try:
+        content = read_file(CHAT_REASONING_FILE)
+
+        required_snippets = [
+            'function applyReasoningSettings(settings = {}) {',
+            'export function initializeReasoningToggle(initialSettings = null) {',
+            'applyReasoningSettings(initialSettings);',
+            'loadUserSettings().then(settings => {',
+            'syncReasoningStateForCurrentModel();',
+            "modelSelect.addEventListener('change', () => {",
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing reasoning startup sync logic: {missing}'
+
+        print('✅ Reasoning toggle startup settings handling passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_chat_onload_sets_model_before_reasoning_initialization():
+    """Verify chat startup applies the preferred model before reasoning init."""
+    print('🔍 Testing chat onload reasoning initialization order...')
+
+    try:
+        content = read_file(CHAT_ONLOAD_FILE)
+
+        required_snippets = [
+            'const userSettingsPromise = loadUserSettings();',
+            'const userSettings = await userSettingsPromise;',
+            'modelSelect.value = userSettings.preferredModelDeployment;',
+            'initializeModelSelector();',
+            'initializeReasoningToggle(userSettings);',
+        ]
+
+        missing = [snippet for snippet in required_snippets if snippet not in content]
+        assert not missing, f'Missing onload startup ordering logic: {missing}'
+
+        preferred_model_index = content.index('modelSelect.value = userSettings.preferredModelDeployment;')
+        reasoning_init_index = content.index('initializeReasoningToggle(userSettings);')
+        assert preferred_model_index < reasoning_init_index, (
+            'Expected preferred model to be applied before reasoning initialization'
+        )
+
+        print('✅ Chat onload reasoning initialization order passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_config_version_bumped_for_reasoning_sync_fix():
+    """Verify config version was bumped for the reasoning startup sync fix."""
+    print('🔍 Testing config version bump...')
+
+    try:
+        content = read_file(CONFIG_FILE)
+        assert 'VERSION = "0.239.125"' in content, 'Expected config.py version 0.239.125'
+
+        print('✅ Config version bump passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_reasoning_toggle_uses_applied_startup_settings,
+        test_chat_onload_sets_model_before_reasoning_initialization,
+        test_config_version_bumped_for_reasoning_sync_fix,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_route_backend_chats_redundant_assignment.py b/functional_tests/test_route_backend_chats_redundant_assignment.py
new file mode 100644
index 00000000..8c5a46a7
--- /dev/null
+++ b/functional_tests/test_route_backend_chats_redundant_assignment.py
@@ -0,0 +1,74 @@
+#!/usr/bin/env python3
+# test_route_backend_chats_redundant_assignment.py
+"""
+Functional test for redundant self-assignment removal in route_backend_chats.py.
+Version: 0.239.148
+Implemented in: 0.239.148
+
+This test ensures route_backend_chats.py does not contain standalone assignments
+that assign a local name to itself, which would be a no-op and usually indicates
+an implementation mistake.
+"""
+
+import ast
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+TARGET_FILE = ROOT / 'application' / 'single_app' / 'route_backend_chats.py'
+CONFIG_FILE = ROOT / 'application' / 'single_app' / 'config.py'
+FIX_DOC_FILE = ROOT / 'docs' / 'explanation' / 'fixes' / 'REDUNDANT_CONVERSATION_ID_ASSIGNMENT_FIX.md'
+CURRENT_VERSION = '0.239.148'
+
+
+def find_self_assignments(tree: ast.AST) -> list[tuple[str, int]]:
+    matches = []
+    for node in ast.walk(tree):
+        if not isinstance(node, ast.Assign):
+            continue
+        if len(node.targets) != 1:
+            continue
+
+        target = node.targets[0]
+        if isinstance(target, ast.Name) and isinstance(node.value, ast.Name):
+            if target.id == node.value.id:
+                matches.append((target.id, node.lineno))
+
+    return matches
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_route_backend_chats_redundant_assignment() -> bool:
+    print('Testing route_backend_chats.py for redundant self-assignments...')
+
+    source = TARGET_FILE.read_text(encoding='utf-8')
+    tree = ast.parse(source, filename=str(TARGET_FILE))
+    matches = find_self_assignments(tree)
+
+    if matches:
+        formatted_matches = ', '.join(f'{name} at line {line}' for name, line in matches)
+        raise AssertionError(f'Found redundant self-assignments: {formatted_matches}')
+
+    assert_contains(CONFIG_FILE, f'VERSION = "{CURRENT_VERSION}"')
+    assert_contains(FIX_DOC_FILE, f'Fixed/Implemented in version: **{CURRENT_VERSION}**')
+
+    print('route_backend_chats.py redundant self-assignment checks passed!')
+    return True
+
+
+if __name__ == '__main__':
+    try:
+        success = test_route_backend_chats_redundant_assignment()
+    except Exception as exc:
+        print(f'Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_smart_http_plugin_content_management.py b/functional_tests/test_smart_http_plugin_content_management.py
index d010846e..aa8c9fcd 100644
--- a/functional_tests/test_smart_http_plugin_content_management.py
+++ b/functional_tests/test_smart_http_plugin_content_management.py
@@ -1,10 +1,11 @@
 #!/usr/bin/env python3
 """
 Functional test for Smart HTTP Plugin content size management.
-Version: 0.228.006
+Version: 0.239.150
 Implemented in: 0.228.003
 Updated in: 0.228.005 (added PDF URL support testing)
 Updated in: 0.228.006 (added agent citation support testing)
+Updated in: 0.239.150 (removed unreachable duplicate exception handler)
 
 This test ensures that the Smart HTTP Plugin properly handles large web content
 and prevents token limit exceeded errors by intelligently truncating content.
@@ -138,10 +139,6 @@ async def test_citation_support():
             except Exception as e:
                 print(f"⚠️ Citation test failed: {e}")
                 return True  # Don't fail the test for citation issues
-                    
-            except Exception as e:
-                print(f"⚠️ PDF detection test failed: {e}")
-                return False
         
         # Run async tests
         async def run_all_tests():
diff --git a/functional_tests/test_sql_auto_schema_companion.py b/functional_tests/test_sql_auto_schema_companion.py
new file mode 100644
index 00000000..2f488c97
--- /dev/null
+++ b/functional_tests/test_sql_auto_schema_companion.py
@@ -0,0 +1,222 @@
+#!/usr/bin/env python3
+"""
+Functional test for SQL Auto Schema Companion Plugin creation.
+Version: 0.239.015
+Implemented in: 0.239.015
+
+This test ensures that when a SQL Query plugin is loaded via LoggedPluginLoader,
+a companion SQL Schema plugin is automatically created and registered in the kernel.
+This fixes the issue where agents with only a sql_query action configured would have
+no schema awareness, causing the LLM to ask for clarification instead of querying.
+
+Additionally validates that:
+- SQLQueryPlugin @kernel_function descriptions are resilient (don't demand non-existent functions)
+- The _extract_sql_schema_for_instructions fallback detects SQLQueryPlugin instances
+"""
+
+import sys
+import os
+import inspect
+
+# Add the application directory to the path for imports
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+def test_companion_schema_plugin_method_exists():
+    """Test that LoggedPluginLoader has the _auto_create_companion_schema_plugin method."""
+    print("🔍 Testing _auto_create_companion_schema_plugin method exists...")
+    
+    try:
+        from semantic_kernel_plugins.logged_plugin_loader import LoggedPluginLoader
+        
+        assert hasattr(LoggedPluginLoader, '_auto_create_companion_schema_plugin'), \
+            "LoggedPluginLoader must have _auto_create_companion_schema_plugin method"
+        
+        method = getattr(LoggedPluginLoader, '_auto_create_companion_schema_plugin')
+        assert callable(method), "_auto_create_companion_schema_plugin must be callable"
+        
+        # Verify method signature includes query_manifest and query_plugin_name
+        sig = inspect.signature(method)
+        param_names = list(sig.parameters.keys())
+        assert 'query_manifest' in param_names, "Method must accept query_manifest parameter"
+        assert 'query_plugin_name' in param_names, "Method must accept query_plugin_name parameter"
+        
+        print("✅ _auto_create_companion_schema_plugin method exists with correct signature!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_companion_creation_triggered_for_sql_query():
+    """Test that load_plugin_from_manifest triggers companion creation for sql_query type."""
+    print("🔍 Testing companion creation trigger in load_plugin_from_manifest...")
+    
+    try:
+        from semantic_kernel_plugins.logged_plugin_loader import LoggedPluginLoader
+        
+        source = inspect.getsource(LoggedPluginLoader.load_plugin_from_manifest)
+        
+        # The method should check for sql_query type and call companion creation
+        assert "plugin_type == 'sql_query'" in source, \
+            "load_plugin_from_manifest must check for sql_query plugin type"
+        assert "_auto_create_companion_schema_plugin" in source, \
+            "load_plugin_from_manifest must call _auto_create_companion_schema_plugin"
+        
+        print("✅ load_plugin_from_manifest triggers companion creation for sql_query!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_schema_plugin_name_derivation():
+    """Test that the companion schema plugin name is correctly derived."""
+    print("🔍 Testing schema plugin name derivation logic...")
+    
+    try:
+        from semantic_kernel_plugins.logged_plugin_loader import LoggedPluginLoader
+        
+        source = inspect.getsource(LoggedPluginLoader._auto_create_companion_schema_plugin)
+        
+        # Should handle _query suffix replacement
+        assert "endswith('_query')" in source, \
+            "Method should check for _query suffix"
+        assert "'_schema'" in source, \
+            "Method should derive _schema suffix"
+        
+        # Should check if schema plugin already exists
+        assert "self.kernel.plugins" in source, \
+            "Method should check kernel.plugins for existing schema plugin"
+        
+        # Should create SQLSchemaPlugin
+        assert "SQLSchemaPlugin" in source, \
+            "Method should create a SQLSchemaPlugin instance"
+        
+        # Should register with kernel
+        assert "_register_plugin_with_kernel" in source, \
+            "Method should register the companion plugin with the kernel"
+        
+        print("✅ Schema plugin name derivation and creation logic is correct!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_resilient_query_plugin_descriptions():
+    """Test that SQLQueryPlugin descriptions don't demand non-existent schema functions."""
+    print("🔍 Testing SQLQueryPlugin @kernel_function descriptions are resilient...")
+    
+    try:
+        from semantic_kernel_plugins.sql_query_plugin import SQLQueryPlugin
+        
+        source = inspect.getsource(SQLQueryPlugin)
+        
+        # Descriptions should NOT contain "you MUST first call" which creates a hard dependency
+        assert "you MUST first call" not in source, \
+            "Descriptions should not contain 'you MUST first call' (creates hard dependency on non-existent functions)"
+        assert "You MUST first discover" not in source, \
+            "Descriptions should not contain 'You MUST first discover' (creates hard dependency)"
+        
+        # Descriptions SHOULD contain resilient language
+        assert "If the database schema is provided in your instructions" in source, \
+            "Descriptions should reference schema from instructions as primary source"
+        
+        # Should still mention the schema plugin as a fallback
+        assert "get_database_schema" in source, \
+            "Descriptions should still reference get_database_schema as a fallback option"
+        
+        print("✅ SQLQueryPlugin descriptions are resilient and don't demand non-existent functions!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_schema_extraction_fallback():
+    """Test that _extract_sql_schema_for_instructions has a fallback for SQLQueryPlugin."""
+    print("🔍 Testing _extract_sql_schema_for_instructions fallback...")
+    
+    try:
+        from semantic_kernel_loader import _extract_sql_schema_for_instructions
+        
+        source = inspect.getsource(_extract_sql_schema_for_instructions)
+        
+        # Should have fallback logic
+        assert "Fallback" in source or "fallback" in source, \
+            "Function should contain fallback logic"
+        
+        # Should check for SQLQueryPlugin
+        assert "SQLQueryPlugin" in source, \
+            "Fallback should check for SQLQueryPlugin instances"
+        
+        # Should create temporary schema extractor
+        assert "temp_manifest" in source or "temp_schema" in source, \
+            "Fallback should create a temporary schema extractor"
+        
+        print("✅ _extract_sql_schema_for_instructions has SQLQueryPlugin fallback!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_version_updated():
+    """Test that the version has been updated for this fix."""
+    print("🔍 Testing version update...")
+    
+    try:
+        # Read config.py directly to check version
+        config_path = os.path.join(os.path.dirname(os.path.abspath(__file__)), 
+                                   '..', 'application', 'single_app', 'config.py')
+        with open(config_path, 'r') as f:
+            config_content = f.read()
+        
+        assert 'VERSION = "0.239.015"' in config_content, \
+            f"config.py should contain VERSION = \"0.239.015\""
+        
+        print("✅ Version correctly updated to 0.239.015!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == "__main__":
+    tests = [
+        test_companion_schema_plugin_method_exists,
+        test_companion_creation_triggered_for_sql_query,
+        test_schema_plugin_name_derivation,
+        test_resilient_query_plugin_descriptions,
+        test_schema_extraction_fallback,
+        test_version_updated,
+    ]
+    results = []
+    
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+    
+    success = all(results)
+    passed = sum(results)
+    total = len(results)
+    print(f"\n📊 Results: {passed}/{total} tests passed")
+    
+    if not success:
+        failed_tests = [t.__name__ for t, r in zip(tests, results) if not r]
+        print(f"❌ Failed tests: {', '.join(failed_tests)}")
+    
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_sql_plugin_key_vault_secret_storage.py b/functional_tests/test_sql_plugin_key_vault_secret_storage.py
new file mode 100644
index 00000000..83aa6f7e
--- /dev/null
+++ b/functional_tests/test_sql_plugin_key_vault_secret_storage.py
@@ -0,0 +1,502 @@
+#!/usr/bin/env python3
+# test_sql_plugin_key_vault_secret_storage.py
+"""
+Functional test for SQL plugin Key Vault secret storage.
+Version: 0.239.114
+Implemented in: 0.239.114
+
+This test ensures that SQL plugin secret-bearing fields are stored in Key Vault,
+preserved across edits, and cleaned up correctly when Key Vault storage is enabled.
+"""
+
+# pyright: reportMissingImports=false
+
+import importlib
+import os
+import sys
+import types
+from copy import deepcopy
+
+
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+class FakeRetrievedSecret:
+    def __init__(self, value):
+        self.value = value
+
+
+class FakeSecretClient:
+    stored_secrets = {}
+    deleted_secrets = []
+
+    def __init__(self, vault_url, credential):
+        self.vault_url = vault_url
+        self.credential = credential
+
+    @classmethod
+    def reset(cls):
+        cls.stored_secrets = {}
+        cls.deleted_secrets = []
+
+    def set_secret(self, name, value):
+        FakeSecretClient.stored_secrets[name] = value
+
+    def get_secret(self, name):
+        return FakeRetrievedSecret(FakeSecretClient.stored_secrets[name])
+
+    def begin_delete_secret(self, name):
+        FakeSecretClient.deleted_secrets.append(name)
+        FakeSecretClient.stored_secrets.pop(name, None)
+
+
+class FakeCosmosResourceNotFoundError(Exception):
+    pass
+
+
+class FakeActionContainer:
+    def __init__(self, existing_action=None):
+        self.items = {}
+        if existing_action:
+            self.items[existing_action['id']] = dict(existing_action)
+
+    def read_item(self, item, partition_key):
+        action = self.items.get(item)
+        if action is None:
+            raise FakeCosmosResourceNotFoundError('not found')
+
+        action_partition = action.get('user_id') or action.get('group_id') or action.get('id')
+        if action_partition != partition_key:
+            raise FakeCosmosResourceNotFoundError('not found')
+        return dict(action)
+
+    def query_items(self, query=None, parameters=None, partition_key=None, enable_cross_partition_query=False):
+        parameters = parameters or []
+        query_name = next((param['value'] for param in parameters if param['name'] == '@name'), None)
+
+        results = []
+        for action in self.items.values():
+            action_partition = action.get('user_id') or action.get('group_id') or action.get('id')
+            if partition_key and action_partition != partition_key:
+                continue
+            if query_name and action.get('name') != query_name:
+                continue
+            results.append(dict(action))
+        return results
+
+    def upsert_item(self, body):
+        self.items[body['id']] = dict(body)
+        return dict(body)
+
+    def delete_item(self, item, partition_key):
+        action = self.read_item(item, partition_key)
+        self.items.pop(action['id'], None)
+
+
+class ActionKeyVaultRecorder:
+    def __init__(self):
+        self.save_calls = []
+        self.get_calls = []
+        self.delete_calls = []
+
+    def save(self, action_data, scope_value, scope, existing_plugin=None):
+        self.save_calls.append(
+            {
+                'scope_value': scope_value,
+                'scope': scope,
+                'existing_plugin': deepcopy(existing_plugin),
+            }
+        )
+        saved = dict(action_data)
+        additional_fields = dict(saved.get('additionalFields', {}))
+        existing_fields = (existing_plugin or {}).get('additionalFields', {})
+        for field_name in ('connection_string', 'password'):
+            if additional_fields.get(field_name) == 'Stored_In_KeyVault' and existing_fields.get(field_name):
+                additional_fields[field_name] = existing_fields[field_name]
+        saved['additionalFields'] = additional_fields
+        return saved
+
+    def get(self, action_data, scope_value, scope, return_type):
+        self.get_calls.append(return_type)
+        return dict(action_data)
+
+    def delete(self, action, scope_value, scope):
+        self.delete_calls.append({'scope_value': scope_value, 'scope': scope, 'action': deepcopy(action)})
+        return action
+
+
+def _restore_modules(original_modules):
+    for module_name, original_module in original_modules.items():
+        if original_module is None:
+            sys.modules.pop(module_name, None)
+        else:
+            sys.modules[module_name] = original_module
+
+
+def _load_functions_keyvault():
+    config_stub = types.ModuleType('config')
+    config_stub.KEY_VAULT_DOMAIN = '.vault.azure.net'
+
+    appinsights_stub = types.ModuleType('functions_appinsights')
+    appinsights_stub.log_event = lambda *args, **kwargs: None
+
+    auth_stub = types.ModuleType('functions_authentication')
+    settings_stub = types.ModuleType('functions_settings')
+
+    settings_cache_stub = types.ModuleType('app_settings_cache')
+    settings_cache_stub.get_settings_cache = lambda: {
+        'enable_key_vault_secret_storage': True,
+        'key_vault_name': 'unit-test-vault',
+        'key_vault_identity': None,
+    }
+
+    azure_stub = types.ModuleType('azure')
+    identity_stub = types.ModuleType('azure.identity')
+    keyvault_stub = types.ModuleType('azure.keyvault')
+    secrets_stub = types.ModuleType('azure.keyvault.secrets')
+
+    class FakeDefaultAzureCredential:
+        def __init__(self, *args, **kwargs):
+            self.args = args
+            self.kwargs = kwargs
+
+    identity_stub.DefaultAzureCredential = FakeDefaultAzureCredential
+    secrets_stub.SecretClient = FakeSecretClient
+    azure_stub.identity = identity_stub
+    azure_stub.keyvault = keyvault_stub
+    keyvault_stub.secrets = secrets_stub
+
+    original_modules = {}
+    for module_name, module_stub in {
+        'config': config_stub,
+        'functions_appinsights': appinsights_stub,
+        'functions_authentication': auth_stub,
+        'functions_settings': settings_stub,
+        'app_settings_cache': settings_cache_stub,
+        'azure': azure_stub,
+        'azure.identity': identity_stub,
+        'azure.keyvault': keyvault_stub,
+        'azure.keyvault.secrets': secrets_stub,
+    }.items():
+        original_modules[module_name] = sys.modules.get(module_name)
+        sys.modules[module_name] = module_stub
+
+    original_modules['functions_keyvault'] = sys.modules.get('functions_keyvault')
+    sys.modules.pop('functions_keyvault', None)
+
+    module = importlib.import_module('functions_keyvault')
+    return module, original_modules
+
+
+def _load_action_module(module_name, container, recorder):
+    config_stub = types.ModuleType('config')
+    if module_name == 'functions_personal_actions':
+        config_stub.cosmos_personal_actions_container = container
+    elif module_name == 'functions_global_actions':
+        config_stub.cosmos_global_actions_container = container
+    elif module_name == 'functions_group_actions':
+        config_stub.cosmos_group_actions_container = container
+    else:
+        raise ValueError(f'Unsupported module name: {module_name}')
+
+    keyvault_stub = types.ModuleType('functions_keyvault')
+
+    class SecretReturnType:
+        TRIGGER = 'trigger'
+        NAME = 'name'
+        VALUE = 'value'
+
+    keyvault_stub.SecretReturnType = SecretReturnType
+    keyvault_stub.keyvault_plugin_save_helper = recorder.save
+    keyvault_stub.keyvault_plugin_get_helper = recorder.get
+    keyvault_stub.keyvault_plugin_delete_helper = recorder.delete
+
+    auth_stub = types.ModuleType('functions_authentication')
+    auth_stub.get_current_user_id = lambda: 'admin-123'
+
+    settings_stub = types.ModuleType('functions_settings')
+    settings_stub.get_user_settings = lambda user_id: {'settings': {'plugins': []}}
+    settings_stub.update_user_settings = lambda user_id, settings: True
+
+    debug_stub = types.ModuleType('functions_debug')
+    debug_stub.debug_print = lambda *args, **kwargs: None
+
+    flask_stub = types.ModuleType('flask')
+    flask_stub.current_app = None
+
+    azure_stub = types.ModuleType('azure')
+    cosmos_stub = types.ModuleType('azure.cosmos')
+    cosmos_exceptions_stub = types.ModuleType('azure.cosmos.exceptions')
+    cosmos_exceptions_stub.CosmosResourceNotFoundError = FakeCosmosResourceNotFoundError
+    cosmos_stub.exceptions = cosmos_exceptions_stub
+    azure_stub.cosmos = cosmos_stub
+
+    original_modules = {}
+    module_stubs = {
+        'config': config_stub,
+        'functions_keyvault': keyvault_stub,
+        'functions_authentication': auth_stub,
+        'functions_settings': settings_stub,
+        'functions_debug': debug_stub,
+        'flask': flask_stub,
+        'azure': azure_stub,
+        'azure.cosmos': cosmos_stub,
+        'azure.cosmos.exceptions': cosmos_exceptions_stub,
+    }
+
+    for module_name_to_stub, module_stub in module_stubs.items():
+        original_modules[module_name_to_stub] = sys.modules.get(module_name_to_stub)
+        sys.modules[module_name_to_stub] = module_stub
+
+    original_modules[module_name] = sys.modules.get(module_name)
+    sys.modules.pop(module_name, None)
+
+    module = importlib.import_module(module_name)
+    return module, SecretReturnType, original_modules
+
+
+def test_sql_keyvault_helper_stores_resolves_and_deletes_secrets():
+    """Validate SQL helper stores connection secrets in Key Vault and preserves them across edits."""
+    print('🔍 Testing SQL Key Vault helper secret lifecycle...')
+    FakeSecretClient.reset()
+    module, original_modules = _load_functions_keyvault()
+
+    try:
+        plugin = {
+            'name': 'sql_orders',
+            'type': 'sql_query',
+            'auth': {'type': 'user'},
+            'additionalFields': {
+                'database_type': 'sqlserver',
+                'connection_string': 'Server=tcp:example.database.windows.net;Database=Orders;Uid=readonly;Pwd=S3cr3t!',
+                'username': 'readonly',
+                'password': 'S3cr3t!',
+            },
+        }
+
+        saved = module.keyvault_plugin_save_helper(plugin, scope_value='user-123', scope='user')
+        connection_reference = saved['additionalFields']['connection_string']
+        password_reference = saved['additionalFields']['password']
+
+        assert module.validate_secret_name_dynamic(connection_reference), 'connection_string should be replaced with a Key Vault reference'
+        assert module.validate_secret_name_dynamic(password_reference), 'password should be replaced with a Key Vault reference'
+        assert saved['additionalFields']['username'] == 'readonly', 'username should remain stored as a non-secret field'
+        assert FakeSecretClient.stored_secrets[connection_reference].startswith('Server=tcp:example.database.windows.net')
+        assert FakeSecretClient.stored_secrets[password_reference] == 'S3cr3t!'
+
+        trigger_view = module.keyvault_plugin_get_helper(saved, scope_value='user-123', scope='user', return_type=module.SecretReturnType.TRIGGER)
+        assert trigger_view['additionalFields']['connection_string'] == module.ui_trigger_word
+        assert trigger_view['additionalFields']['password'] == module.ui_trigger_word
+
+        value_view = module.keyvault_plugin_get_helper(saved, scope_value='user-123', scope='user', return_type=module.SecretReturnType.VALUE)
+        assert value_view['additionalFields']['connection_string'].startswith('Server=tcp:example.database.windows.net')
+        assert value_view['additionalFields']['password'] == 'S3cr3t!'
+
+        edited = deepcopy(trigger_view)
+        preserved = module.keyvault_plugin_save_helper(
+            edited,
+            scope_value='user-123',
+            scope='user',
+            existing_plugin=saved,
+        )
+        assert preserved['additionalFields']['connection_string'] == connection_reference, 'placeholder updates should preserve the stored Key Vault reference'
+        assert preserved['additionalFields']['password'] == password_reference, 'password placeholder updates should preserve the stored Key Vault reference'
+
+        module.keyvault_plugin_delete_helper(preserved, scope_value='user-123', scope='user')
+        assert connection_reference in FakeSecretClient.deleted_secrets
+        assert password_reference in FakeSecretClient.deleted_secrets
+
+        print('✅ SQL helper stores, resolves, preserves, and deletes Key Vault-backed SQL secrets')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+def test_personal_action_wrapper_preserves_existing_sql_secret_references():
+    """Validate personal action save/delete passes stored SQL Key Vault references through the helper."""
+    print('🔍 Testing personal action SQL Key Vault wrapper behavior...')
+    existing_action = {
+        'id': 'personal-action-1',
+        'user_id': 'user-123',
+        'name': 'sql_orders',
+        'displayName': 'SQL Orders',
+        'type': 'sql_query',
+        'description': 'SQL query action',
+        'endpoint': 'sql://sql_query',
+        'auth': {'type': 'user'},
+        'metadata': {},
+        'additionalFields': {
+            'database_type': 'sqlserver',
+            'connection_string': 'user-123--action-addset--user--sql-orders-connection-string',
+            'password': 'user-123--action-addset--user--sql-orders-password',
+        },
+    }
+    container = FakeActionContainer(existing_action=existing_action)
+    recorder = ActionKeyVaultRecorder()
+    module, secret_return_type, original_modules = _load_action_module('functions_personal_actions', container, recorder)
+
+    try:
+        saved = module.save_personal_action(
+            'user-123',
+            {
+                'id': 'personal-action-1',
+                'name': 'sql_orders_renamed',
+                'displayName': 'SQL Orders Renamed',
+                'type': 'sql_query',
+                'description': 'Updated SQL query action',
+                'endpoint': 'sql://sql_query',
+                'auth': {'type': 'user'},
+                'metadata': {},
+                'additionalFields': {
+                    'database_type': 'sqlserver',
+                    'connection_string': 'Stored_In_KeyVault',
+                    'password': 'Stored_In_KeyVault',
+                },
+            },
+        )
+
+        assert recorder.save_calls[-1]['existing_plugin']['id'] == 'personal-action-1'
+        assert saved['additionalFields']['connection_string'] == existing_action['additionalFields']['connection_string']
+        assert saved['additionalFields']['password'] == existing_action['additionalFields']['password']
+
+        deleted = module.delete_personal_action('user-123', 'personal-action-1')
+        assert deleted is True
+        assert recorder.delete_calls[-1]['action']['additionalFields']['connection_string'] == existing_action['additionalFields']['connection_string']
+        assert secret_return_type.NAME in recorder.get_calls
+
+        print('✅ Personal action wrapper preserves and deletes existing SQL Key Vault references')
+        return True
+    finally:
+        _restore_modules(original_modules)
+
+
+def test_global_and_group_action_wrappers_preserve_existing_sql_secret_references():
+    """Validate global and group wrappers pass existing SQL Key Vault references into save/delete flows."""
+    print('🔍 Testing global and group action SQL Key Vault wrapper behavior...')
+
+    global_existing = {
+        'id': 'global-action-1',
+        'name': 'global_sql_orders',
+        'displayName': 'Global SQL Orders',
+        'type': 'sql_query',
+        'description': 'Global SQL query action',
+        'endpoint': 'sql://sql_query',
+        'auth': {'type': 'user'},
+        'metadata': {},
+        'additionalFields': {
+            'database_type': 'sqlserver',
+            'connection_string': 'global-action-1--action-addset--global--global-sql-orders-connection-string',
+            'password': 'global-action-1--action-addset--global--global-sql-orders-password',
+        },
+        'created_by': 'admin-123',
+        'created_at': '2026-03-17T12:00:00',
+    }
+    global_container = FakeActionContainer(existing_action=global_existing)
+    global_recorder = ActionKeyVaultRecorder()
+    global_module, global_secret_return_type, global_original_modules = _load_action_module('functions_global_actions', global_container, global_recorder)
+
+    try:
+        saved_global = global_module.save_global_action(
+            {
+                'id': 'global-action-1',
+                'name': 'global_sql_orders',
+                'displayName': 'Global SQL Orders',
+                'type': 'sql_query',
+                'description': 'Updated global SQL query action',
+                'endpoint': 'sql://sql_query',
+                'auth': {'type': 'user'},
+                'metadata': {},
+                'additionalFields': {
+                    'database_type': 'sqlserver',
+                    'connection_string': 'Stored_In_KeyVault',
+                    'password': 'Stored_In_KeyVault',
+                },
+            },
+            user_id='admin-123',
+        )
+
+        assert global_recorder.save_calls[-1]['existing_plugin']['id'] == 'global-action-1'
+        assert saved_global['additionalFields']['connection_string'] == global_existing['additionalFields']['connection_string']
+
+        deleted_global = global_module.delete_global_action('global-action-1')
+        assert deleted_global is True
+        assert global_recorder.delete_calls[-1]['action']['additionalFields']['connection_string'] == global_existing['additionalFields']['connection_string']
+        assert global_secret_return_type.NAME in global_recorder.get_calls
+    finally:
+        _restore_modules(global_original_modules)
+
+    group_existing = {
+        'id': 'group-action-1',
+        'group_id': 'group-123',
+        'name': 'group_sql_orders',
+        'displayName': 'Group SQL Orders',
+        'type': 'sql_query',
+        'description': 'Group SQL query action',
+        'endpoint': 'sql://sql_query',
+        'auth': {'type': 'user'},
+        'metadata': {},
+        'additionalFields': {
+            'database_type': 'sqlserver',
+            'connection_string': 'group-123--action-addset--group--group-sql-orders-connection-string',
+            'password': 'group-123--action-addset--group--group-sql-orders-password',
+        },
+    }
+    group_container = FakeActionContainer(existing_action=group_existing)
+    group_recorder = ActionKeyVaultRecorder()
+    group_module, _, group_original_modules = _load_action_module('functions_group_actions', group_container, group_recorder)
+
+    try:
+        saved_group = group_module.save_group_action(
+            'group-123',
+            {
+                'id': 'group-action-1',
+                'name': 'group_sql_orders',
+                'displayName': 'Group SQL Orders',
+                'type': 'sql_query',
+                'description': 'Updated group SQL query action',
+                'endpoint': 'sql://sql_query',
+                'auth': {'type': 'user'},
+                'metadata': {},
+                'additionalFields': {
+                    'database_type': 'sqlserver',
+                    'connection_string': 'Stored_In_KeyVault',
+                    'password': 'Stored_In_KeyVault',
+                },
+            },
+            user_id='owner-123',
+        )
+
+        assert group_recorder.save_calls[-1]['existing_plugin']['id'] == 'group-action-1'
+        assert saved_group['additionalFields']['connection_string'] == group_existing['additionalFields']['connection_string']
+
+        deleted_group = group_module.delete_group_action('group-123', 'group-action-1')
+        assert deleted_group is True
+        assert group_recorder.delete_calls[-1]['action']['additionalFields']['connection_string'] == group_existing['additionalFields']['connection_string']
+
+        print('✅ Global and group action wrappers preserve and delete SQL Key Vault references correctly')
+        return True
+    finally:
+        _restore_modules(group_original_modules)
+
+
+if __name__ == '__main__':
+    tests = [
+        test_sql_keyvault_helper_stores_resolves_and_deletes_secrets,
+        test_personal_action_wrapper_preserves_existing_sql_secret_references,
+        test_global_and_group_action_wrappers_preserve_existing_sql_secret_references,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        try:
+            results.append(test())
+        except Exception as exc:
+            print(f'❌ {test.__name__} failed: {exc}')
+            import traceback
+            traceback.print_exc()
+            results.append(False)
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(bool(result) for result in results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_sql_query_plugin_schema_awareness.py b/functional_tests/test_sql_query_plugin_schema_awareness.py
new file mode 100644
index 00000000..b16f73bd
--- /dev/null
+++ b/functional_tests/test_sql_query_plugin_schema_awareness.py
@@ -0,0 +1,248 @@
+#!/usr/bin/env python3
+"""
+Functional test for SQL Query Plugin Schema Awareness improvements.
+Version: 0.239.014
+Implemented in: 0.239.014
+
+This test ensures that the SQL Query and SQL Schema plugins have proper
+workflow-guiding descriptions in their @kernel_function decorators, metadata
+properties, and that the new query_database convenience function exists.
+These improvements ensure agents discover database schema before attempting
+to generate and execute SQL queries.
+"""
+
+import sys
+import os
+
+# Add the application directory to the path for imports
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+def test_sql_schema_plugin_descriptions():
+    """Test that SQL Schema Plugin has prescriptive workflow descriptions."""
+    print("🔍 Testing SQL Schema Plugin @kernel_function descriptions...")
+    
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+        import inspect
+        
+        source = inspect.getsource(SQLSchemaPlugin)
+        
+        # Check get_database_schema has workflow guidance
+        assert "ALWAYS call this function FIRST" in source, \
+            "get_database_schema description should contain 'ALWAYS call this function FIRST'"
+        
+        assert "before executing any SQL queries" in source, \
+            "get_database_schema description should guide calling before queries"
+        
+        # Check get_table_list has workflow guidance
+        assert "Use this function first to discover which tables are available" in source, \
+            "get_table_list description should guide discovery workflow"
+        
+        # Check get_table_schema has workflow guidance
+        assert "Call this after discovering tables" in source, \
+            "get_table_schema description should reference discovery step"
+        
+        # Check get_relationships has workflow guidance
+        assert "JOIN conditions" in source, \
+            "get_relationships description should mention JOIN conditions"
+        
+        print("✅ SQL Schema Plugin descriptions contain workflow guidance!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_sql_query_plugin_descriptions():
+    """Test that SQL Query Plugin has prescriptive workflow descriptions."""
+    print("🔍 Testing SQL Query Plugin @kernel_function descriptions...")
+    
+    try:
+        from semantic_kernel_plugins.sql_query_plugin import SQLQueryPlugin
+        import inspect
+        
+        source = inspect.getsource(SQLQueryPlugin)
+        
+        # Check execute_query has schema requirement
+        assert "you MUST first call get_database_schema or get_table_list" in source, \
+            "execute_query description should require schema discovery first"
+        
+        assert "fully qualified table names" in source, \
+            "execute_query description should mention fully qualified table names"
+        
+        # Check execute_scalar has schema requirement
+        assert "MUST first discover the database schema" in source, \
+            "execute_scalar description should require schema discovery"
+        
+        # Check validate_query has useful guidance
+        assert "pre-check complex queries" in source, \
+            "validate_query description should guide pre-checking"
+        
+        print("✅ SQL Query Plugin descriptions contain workflow guidance!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_query_database_function_exists():
+    """Test that the new query_database convenience function exists."""
+    print("🔍 Testing query_database function existence...")
+    
+    try:
+        from semantic_kernel_plugins.sql_query_plugin import SQLQueryPlugin
+        
+        # Check the function exists on the class
+        assert hasattr(SQLQueryPlugin, 'query_database'), \
+            "SQLQueryPlugin should have a query_database method"
+        
+        # Check it's in get_functions list
+        # We need to create a minimal instance to test, but we can check the class method
+        import inspect
+        source = inspect.getsource(SQLQueryPlugin.query_database)
+        
+        assert "question" in source, \
+            "query_database should accept a 'question' parameter"
+        
+        assert "query" in source, \
+            "query_database should accept a 'query' parameter"
+        
+        assert "@kernel_function" in inspect.getsource(SQLQueryPlugin), \
+            "query_database should be decorated with @kernel_function"
+        
+        print("✅ query_database function exists with correct parameters!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_metadata_descriptions_updated():
+    """Test that metadata property descriptions include workflow guidance."""
+    print("🔍 Testing metadata property descriptions...")
+    
+    try:
+        from semantic_kernel_plugins.sql_query_plugin import SQLQueryPlugin
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+        import inspect
+        
+        # Check SQL Query Plugin metadata
+        query_source = inspect.getsource(SQLQueryPlugin)
+        assert "WORKFLOW: Before executing any query" in query_source, \
+            "SQL Query Plugin metadata should contain workflow guidance"
+        
+        # Check SQL Schema Plugin metadata
+        schema_source = inspect.getsource(SQLSchemaPlugin)
+        assert "ALWAYS call get_database_schema" in schema_source, \
+            "SQL Schema Plugin metadata should contain workflow guidance"
+        
+        print("✅ Metadata descriptions include workflow guidance!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_sql_plugin_creation_path_enabled():
+    """Test that the SQL plugin creation path is enabled in logged_plugin_loader."""
+    print("🔍 Testing SQL plugin creation path is enabled...")
+    
+    try:
+        from semantic_kernel_plugins.logged_plugin_loader import LoggedPluginLoader
+        import inspect
+        
+        source = inspect.getsource(LoggedPluginLoader._create_plugin_instance)
+        
+        # Check the SQL path is NOT commented out
+        assert "elif plugin_type in ['sql_schema', 'sql_query']:" in source, \
+            "SQL plugin creation path should be enabled (not commented out)"
+        
+        assert "return self._create_sql_plugin(manifest)" in source, \
+            "SQL plugin creation should call _create_sql_plugin"
+        
+        # Make sure it's not still commented
+        lines = source.split('\n')
+        for line in lines:
+            stripped = line.strip()
+            if "sql_schema" in stripped and "sql_query" in stripped and "elif" in stripped:
+                assert not stripped.startswith('#'), \
+                    "SQL plugin creation path should NOT be commented out"
+                break
+        
+        print("✅ SQL plugin creation path is enabled!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_schema_injection_function_exists():
+    """Test that the schema injection helper function exists in semantic_kernel_loader."""
+    print("🔍 Testing schema injection function exists...")
+    
+    try:
+        # Read the file and check for the function
+        loader_path = os.path.join(
+            os.path.dirname(os.path.abspath(__file__)), 
+            '..', 'application', 'single_app', 'semantic_kernel_loader.py'
+        )
+        
+        with open(loader_path, 'r', encoding='utf-8') as f:
+            content = f.read()
+        
+        assert "def _extract_sql_schema_for_instructions(kernel)" in content, \
+            "_extract_sql_schema_for_instructions function should exist in semantic_kernel_loader.py"
+        
+        assert "sql_schema_summary = _extract_sql_schema_for_instructions(kernel)" in content, \
+            "Schema injection should be called in the agent loading flow"
+        
+        assert "Available Database Schema" in content, \
+            "Schema injection should add 'Available Database Schema' header to instructions"
+        
+        assert "Do NOT ask the user for table or column names" in content, \
+            "Schema injection should instruct agent not to ask user for schema info"
+        
+        print("✅ Schema injection function exists and is integrated!")
+        return True
+        
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == "__main__":
+    tests = [
+        test_sql_schema_plugin_descriptions,
+        test_sql_query_plugin_descriptions,
+        test_query_database_function_exists,
+        test_metadata_descriptions_updated,
+        test_sql_plugin_creation_path_enabled,
+        test_schema_injection_function_exists,
+    ]
+    results = []
+    
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+    
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_sql_schema_sys_catalog_views.py b/functional_tests/test_sql_schema_sys_catalog_views.py
new file mode 100644
index 00000000..dd1c5f05
--- /dev/null
+++ b/functional_tests/test_sql_schema_sys_catalog_views.py
@@ -0,0 +1,231 @@
+#!/usr/bin/env python3
+"""
+Functional test for SQL Schema Plugin sys.* catalog view migration.
+Version: 0.239.016
+Implemented in: 0.239.016
+
+This test ensures that the SQL Schema Plugin uses sys.tables, sys.columns,
+sys.indexes (catalog views) instead of INFORMATION_SCHEMA views for SQL Server,
+fixing the issue where get_database_schema returned empty tables dict ('tables': {})
+despite the database having tables. The sys.* views are proven to work since the
+relationships query (which uses sys.foreign_keys) always returned valid data.
+
+Also validates robust pyodbc.Row handling (no isinstance(tuple) checks).
+"""
+
+import sys
+import os
+import inspect
+
+# Add the application directory to the path for imports
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app'))
+
+
+def test_tables_query_uses_sys_views():
+    """Test that SQL Server tables query uses sys.tables instead of INFORMATION_SCHEMA."""
+    print("🔍 Testing SQL Server tables query uses sys.tables...")
+
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+
+        source = inspect.getsource(SQLSchemaPlugin._get_tables_query)
+
+        # Should use sys.tables
+        assert "sys.tables" in source, \
+            "_get_tables_query should reference sys.tables"
+        assert "sys.schemas" in source, \
+            "_get_tables_query should reference sys.schemas"
+
+        # Should NOT use INFORMATION_SCHEMA.TABLES for sqlserver
+        # (other DB types still use their own patterns)
+        assert "INFORMATION_SCHEMA.TABLES" not in source, \
+            "_get_tables_query should NOT use INFORMATION_SCHEMA.TABLES (use sys.tables instead)"
+
+        print("✅ Tables query uses sys.tables!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_columns_query_uses_sys_views():
+    """Test that SQL Server columns query uses sys.columns instead of INFORMATION_SCHEMA."""
+    print("🔍 Testing SQL Server columns query uses sys.columns...")
+
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+
+        source = inspect.getsource(SQLSchemaPlugin._get_columns_query)
+
+        # Should use sys.columns
+        assert "sys.columns" in source, \
+            "_get_columns_query should reference sys.columns"
+        assert "TYPE_NAME" in source, \
+            "_get_columns_query should use TYPE_NAME() for data type resolution"
+
+        # Should NOT use INFORMATION_SCHEMA.COLUMNS for sqlserver
+        assert "INFORMATION_SCHEMA.COLUMNS" not in source, \
+            "_get_columns_query should NOT use INFORMATION_SCHEMA.COLUMNS (use sys.columns instead)"
+
+        print("✅ Columns query uses sys.columns!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_primary_keys_query_uses_sys_views():
+    """Test that SQL Server PK query uses sys.indexes instead of INFORMATION_SCHEMA."""
+    print("🔍 Testing SQL Server primary keys query uses sys.indexes...")
+
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+
+        source = inspect.getsource(SQLSchemaPlugin._get_primary_keys_query)
+
+        # Should use sys.indexes with is_primary_key
+        assert "sys.indexes" in source, \
+            "_get_primary_keys_query should reference sys.indexes"
+        assert "is_primary_key" in source, \
+            "_get_primary_keys_query should filter on is_primary_key"
+        assert "sys.index_columns" in source, \
+            "_get_primary_keys_query should reference sys.index_columns"
+
+        # Should NOT use INFORMATION_SCHEMA.KEY_COLUMN_USAGE for sqlserver section
+        # Note: PostgreSQL/MySQL sections may still use INFORMATION_SCHEMA (that's fine)
+        # We check that the sqlserver branch uses sys.* by verifying sys.indexes is present
+        assert "sys.index_columns" in source, \
+            "_get_primary_keys_query should reference sys.index_columns for SQL Server"
+
+        print("✅ Primary keys query uses sys.indexes!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_robust_row_handling_no_isinstance_tuple():
+    """Test that table row parsing doesn't rely on isinstance(tuple) checks."""
+    print("🔍 Testing robust pyodbc.Row handling...")
+
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+
+        # Check get_database_schema table iteration
+        schema_source = inspect.getsource(SQLSchemaPlugin.get_database_schema)
+
+        # Should NOT use isinstance(table, tuple) pattern
+        assert "isinstance(table, tuple)" not in schema_source, \
+            "get_database_schema should not use isinstance(table, tuple) — pyodbc.Row may not be a tuple"
+
+        # Should use try/except for robust indexing
+        assert "except (TypeError, IndexError)" in schema_source, \
+            "get_database_schema should handle TypeError/IndexError for row parsing"
+
+        # Check get_table_list too
+        list_source = inspect.getsource(SQLSchemaPlugin.get_table_list)
+        assert "isinstance(table_row, (list, tuple))" not in list_source, \
+            "get_table_list should not use isinstance tuple check"
+
+        # Check primary keys list comprehension
+        schema_data_source = inspect.getsource(SQLSchemaPlugin._get_table_schema_data)
+        assert "isinstance(pk, (list, tuple))" not in schema_data_source, \
+            "Primary key parsing should not use isinstance tuple check"
+
+        print("✅ Robust Row handling — no isinstance(tuple) dependencies!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_consistency_with_relationships_query():
+    """Test that all SQL Server queries use sys.* views consistently (like relationships)."""
+    print("🔍 Testing consistency: all SQL Server queries use sys.* views...")
+
+    try:
+        from semantic_kernel_plugins.sql_schema_plugin import SQLSchemaPlugin
+
+        # The relationships query already used sys.* — verify it still does
+        rel_source = inspect.getsource(SQLSchemaPlugin._get_relationships_data)
+        assert "sys.foreign_keys" in rel_source, \
+            "Relationships query should use sys.foreign_keys"
+        assert "sys.tables" in rel_source, \
+            "Relationships query should reference sys.tables"
+
+        # Tables query should also use sys.*
+        tables_source = inspect.getsource(SQLSchemaPlugin._get_tables_query)
+        assert "sys.tables" in tables_source
+
+        # Columns query should also use sys.*
+        cols_source = inspect.getsource(SQLSchemaPlugin._get_columns_query)
+        assert "sys.columns" in cols_source
+
+        # PK query should also use sys.*
+        pk_source = inspect.getsource(SQLSchemaPlugin._get_primary_keys_query)
+        assert "sys.indexes" in pk_source
+
+        print("✅ All SQL Server queries consistently use sys.* catalog views!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_version_updated():
+    """Test that the version has been updated for this fix."""
+    print("🔍 Testing version update...")
+
+    try:
+        config_path = os.path.join(os.path.dirname(os.path.abspath(__file__)),
+                                   '..', 'application', 'single_app', 'config.py')
+        with open(config_path, 'r') as f:
+            config_content = f.read()
+
+        assert 'VERSION = "0.239.016"' in config_content, \
+            f"config.py should contain VERSION = \"0.239.016\""
+
+        print("✅ Version correctly updated to 0.239.016!")
+        return True
+    except Exception as e:
+        print(f"❌ Test failed: {e}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == "__main__":
+    tests = [
+        test_tables_query_uses_sys_views,
+        test_columns_query_uses_sys_views,
+        test_primary_keys_query_uses_sys_views,
+        test_robust_row_handling_no_isinstance_tuple,
+        test_consistency_with_relationships_query,
+        test_version_updated,
+    ]
+    results = []
+
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    passed = sum(results)
+    total = len(results)
+    print(f"\n📊 Results: {passed}/{total} tests passed")
+
+    if not success:
+        failed_tests = [t.__name__ for t, r in zip(tests, results) if not r]
+        print(f"❌ Failed tests: {', '.join(failed_tests)}")
+
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_startup_scheduler_support.py b/functional_tests/test_startup_scheduler_support.py
new file mode 100644
index 00000000..0516fa92
--- /dev/null
+++ b/functional_tests/test_startup_scheduler_support.py
@@ -0,0 +1,64 @@
+#!/usr/bin/env python3
+# test_startup_scheduler_support.py
+"""
+Functional test for startup and scheduler separation support.
+Version: 0.239.129
+Implemented in: 0.239.129
+
+This test ensures that the web process uses shared background-task helpers,
+the dedicated scheduler entrypoint exists, and the deployment guidance
+documentation describes native Python and container startup behavior.
+"""
+
+import os
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+APP_FILE = ROOT / 'application' / 'single_app' / 'app.py'
+BACKGROUND_TASKS_FILE = ROOT / 'application' / 'single_app' / 'background_tasks.py'
+SCHEDULER_FILE = ROOT / 'application' / 'single_app' / 'simplechat_scheduler.py'
+DOC_FILE = ROOT / 'docs' / 'explanation' / 'features' / 'SIMPLECHAT_STARTUP.md'
+CONFIG_FILE = ROOT / 'application' / 'single_app' / 'config.py'
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding='utf-8')
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def test_startup_scheduler_support() -> bool:
+    print('Testing startup and scheduler support...')
+
+    assert_contains(APP_FILE, 'from background_tasks import start_background_task_threads')
+    assert_contains(APP_FILE, 'start_background_task_threads()')
+    assert_contains(APP_FILE, 'def should_start_background_tasks():')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def acquire_distributed_task_lock(task_name, lease_seconds):')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def release_distributed_task_lock(lock_document):')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def run_scheduler_forever():')
+    assert_contains(BACKGROUND_TASKS_FILE, 'def check_retention_policy_once():')
+    assert_contains(BACKGROUND_TASKS_FILE, "acquire_distributed_task_lock('approval_expiry', lease_seconds=1800)")
+    assert_contains(BACKGROUND_TASKS_FILE, "acquire_distributed_task_lock('retention_policy', lease_seconds=3600)")
+    assert_contains(SCHEDULER_FILE, 'def initialize_scheduler_runtime():')
+    assert_contains(SCHEDULER_FILE, 'run_scheduler_forever()')
+    assert_contains(DOC_FILE, '## Native Python App Service')
+    assert_contains(DOC_FILE, '## Container Runtime')
+    assert_contains(DOC_FILE, 'simplechat_scheduler.py')
+    assert_contains(CONFIG_FILE, 'VERSION = "0.239.129"')
+
+    print('Startup and scheduler support checks passed!')
+    return True
+
+
+if __name__ == '__main__':
+    try:
+        success = test_startup_scheduler_support()
+    except Exception as exc:
+        print(f'Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_streaming_only_chat_path.py b/functional_tests/test_streaming_only_chat_path.py
new file mode 100644
index 00000000..540e08a6
--- /dev/null
+++ b/functional_tests/test_streaming_only_chat_path.py
@@ -0,0 +1,82 @@
+# test_streaming_only_chat_path.py
+#!/usr/bin/env python3
+"""
+Functional test for streaming-only chat path migration.
+Version: 0.239.137
+Implemented in: 0.239.137
+
+This test ensures that first-party chat clients use the streaming chat path,
+that the legacy non-streaming fallback is not called directly from chat UI
+entry points, and that obsolete streaming toggle scaffolding is removed from
+the frontend now that streaming is the only supported chat mode.
+"""
+
+import sys
+from pathlib import Path
+
+
+ROOT = Path(__file__).resolve().parents[1]
+
+
+def assert_contains(file_path: Path, expected: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if expected not in content:
+        raise AssertionError(f"Expected to find {expected!r} in {file_path}")
+
+
+def assert_not_contains(file_path: Path, forbidden: str) -> None:
+    content = file_path.read_text(encoding="utf-8")
+    if forbidden in content:
+        raise AssertionError(f"Did not expect to find {forbidden!r} in {file_path}")
+
+
+def test_streaming_only_chat_path() -> bool:
+    print("Testing streaming-only chat path migration...")
+
+    chat_messages = ROOT / "application" / "single_app" / "static" / "js" / "chat" / "chat-messages.js"
+    chat_retry = ROOT / "application" / "single_app" / "static" / "js" / "chat" / "chat-retry.js"
+    chat_edit = ROOT / "application" / "single_app" / "static" / "js" / "chat" / "chat-edit.js"
+    chat_streaming = ROOT / "application" / "single_app" / "static" / "js" / "chat" / "chat-streaming.js"
+    route_backend_chats = ROOT / "application" / "single_app" / "route_backend_chats.py"
+    chats_template = ROOT / "application" / "single_app" / "templates" / "chats.html"
+    config_file = ROOT / "application" / "single_app" / "config.py"
+
+    assert_contains(chat_messages, "sendMessageWithStreaming(")
+    assert_not_contains(chat_messages, 'fetch("/api/chat"')
+    assert_not_contains(chat_retry, "fetch('/api/chat'")
+    assert_not_contains(chat_edit, "fetch('/api/chat'")
+
+    assert_contains(chat_streaming, "fetch('/api/chat/stream'")
+    assert_contains(chat_streaming, "finalData.image_url")
+    assert_contains(chat_streaming, "finalData.reload_messages")
+    assert_not_contains(chat_streaming, "initializeStreamingToggle")
+    assert_not_contains(chat_streaming, "isStreamingEnabled")
+    assert_not_contains(chat_streaming, "streaming-toggle-btn")
+
+    assert_contains(route_backend_chats, "compatibility_mode = bool(data.get('image_generation'))")
+    assert_contains(route_backend_chats, "generate_compatibility_response")
+    assert_contains(route_backend_chats, "'user_message_id': user_message_id")
+    assert_contains(route_backend_chats, 'Generating image based on')
+    assert_contains(route_backend_chats, 'Preparing image model request')
+    assert_contains(route_backend_chats, 'Image generated and ready to display')
+
+    assert_not_contains(chats_template, "streaming-toggle-btn")
+    assert_not_contains(chats_template, "streaming-badge")
+    assert_contains(route_backend_chats, "return build_background_stream_response(generate_compatibility_response)")
+    assert_contains(route_backend_chats, "return build_background_stream_response(generate)")
+    assert_contains(config_file, 'VERSION = "0.239.137"')
+
+    print("Streaming-only chat path checks passed!")
+    return True
+
+
+if __name__ == "__main__":
+    try:
+        success = test_streaming_only_chat_path()
+    except Exception as exc:
+        print(f"Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        success = False
+
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_streaming_thought_finalization.py b/functional_tests/test_streaming_thought_finalization.py
new file mode 100644
index 00000000..27acfa73
--- /dev/null
+++ b/functional_tests/test_streaming_thought_finalization.py
@@ -0,0 +1,108 @@
+#!/usr/bin/env python3
+# test_streaming_thought_finalization.py
+"""
+Functional test for streaming thought finalization fix.
+Version: 0.239.116
+Implemented in: 0.239.116
+
+This test ensures the streaming client buffers split SSE events and prevents
+late thought updates from replacing already-streamed assistant content.
+"""
+
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+STREAMING_FILE = os.path.join(
+    ROOT_DIR,
+    'application', 'single_app', 'static', 'js', 'chat', 'chat-streaming.js'
+)
+THOUGHTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application', 'single_app', 'static', 'js', 'chat', 'chat-thoughts.js'
+)
+
+
+def read_file_content(file_path):
+    with open(file_path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_streaming_parser_buffers_split_sse_events():
+    """Verify the streaming client buffers SSE frames across fetch chunks."""
+    print('🔍 Testing buffered SSE parsing...')
+
+    try:
+        content = read_file_content(STREAMING_FILE)
+
+        checks = {
+            'parseSseEventPayload helper': 'function parseSseEventPayload(eventBlock)' in content,
+            'stream buffer state': "let sseBuffer = ''" in content,
+            'buffer processor': 'function processSseBuffer(flush = false)' in content,
+            'stream chunk buffer append': "sseBuffer += decoder.decode(value, { stream: true }).replace(/\\r/g, '');" in content,
+            'final decoder flush': 'sseBuffer += decoder.decode();' in content,
+            'no naive chunk split': "const lines = chunk.split('\\n');" not in content,
+            'incomplete stream guard': 'Stream ended before completion metadata was received.' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f'  [{status}] {name}')
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as exc:
+        print(f'  [FAIL] Exception: {exc}')
+        return False
+
+
+def test_late_thoughts_do_not_replace_streamed_content():
+    """Verify thoughts stop overwriting the temp message after content starts."""
+    print('\n🔍 Testing streaming thought overwrite guards...')
+
+    try:
+        streaming_content = read_file_content(STREAMING_FILE)
+        thoughts_content = read_file_content(THOUGHTS_FILE)
+
+        checks = {
+            'content-start tracking': "messageElement.dataset.streamingHasContent = 'true';" in streaming_content,
+            'thought guard before render': "if (!hasStreamedContent && !streamCompleted) {" in streaming_content,
+            'thought module data guard': "if (messageElement.dataset.streamingHasContent === 'true') {" in thoughts_content,
+            'thought module early return': 'return;' in thoughts_content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f'  [{status}] {name}')
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as exc:
+        print(f'  [FAIL] Exception: {exc}')
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_streaming_parser_buffers_split_sse_events,
+        test_late_thoughts_do_not_replace_streamed_content,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        results.append(test())
+
+    passed = sum(1 for result in results if result)
+    total = len(results)
+    success = passed == total
+
+    print(f'\n📊 Results: {passed}/{total} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_analysis_rejects_discovery_only.py b/functional_tests/test_tabular_analysis_rejects_discovery_only.py
new file mode 100644
index 00000000..ea6fcd49
--- /dev/null
+++ b/functional_tests/test_tabular_analysis_rejects_discovery_only.py
@@ -0,0 +1,163 @@
+#!/usr/bin/env python3
+# test_tabular_analysis_rejects_discovery_only.py
+"""
+Functional test for tabular analytical-only orchestration fix.
+Version: 0.239.114
+Implemented in: 0.239.111
+
+This test ensures that discovery-only tabular tool calls are not accepted as
+completed analysis for analytical questions, and that the tabular SK pass is
+restricted to analytical tools on retry attempts.
+"""
+
+import ast
+import os
+import sys
+from types import SimpleNamespace
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'get_tabular_discovery_function_names',
+    'get_tabular_analysis_function_names',
+    'get_new_plugin_invocations',
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_error_message',
+    'split_tabular_analysis_invocations',
+    'split_tabular_plugin_invocations',
+    'filter_tabular_citation_invocations',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected constants and helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {'json': __import__('json')}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_discovery_only_calls_trigger_retry_guardrails():
+    """Verify discovery-only tool use is explicitly rejected in the route logic."""
+    print("🔍 Testing discovery-only retry guardrails...")
+
+    try:
+        _, route_content = load_tabular_route_helpers()
+
+        checks = {
+            'prompt disables discovery tools in analysis pass': 'Discovery functions are not available in this analysis run because schema context is already pre-loaded.' in route_content,
+            'analysis run filters callable functions': 'included_functions' in route_content,
+            'retry path requires analytical tools': 'FunctionChoiceBehavior.Required(' in route_content,
+            'retry logging mentions discovery tools': 'used only discovery tool(s)' in route_content,
+            'success logging counts successful analytical tools': 'Analysis complete via {len(successful_analytical_invocations)} analytical tool call(s)' in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected discovery-only guardrails: {failed_checks}"
+
+        print("✅ Discovery-only retry guardrails passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_citation_filter_prefers_analytical_calls():
+    """Verify discovery citations are removed when analytical calls also exist."""
+    print("🔍 Testing analytical citation filtering...")
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        filter_invocations = helpers['filter_tabular_citation_invocations']
+        split_invocations = helpers['split_tabular_plugin_invocations']
+
+        invocations = [
+            SimpleNamespace(function_name='describe_tabular_file'),
+            SimpleNamespace(function_name='lookup_value'),
+            SimpleNamespace(function_name='group_by_datetime_component'),
+            SimpleNamespace(function_name='query_tabular_data'),
+        ]
+
+        discovery_invocations, analytical_invocations, other_invocations = split_invocations(invocations)
+        filtered_invocations = filter_invocations(invocations)
+        filtered_function_names = [invocation.function_name for invocation in filtered_invocations]
+
+        assert len(discovery_invocations) == 1, 'Expected one discovery invocation.'
+        assert len(analytical_invocations) == 3, 'Expected three analytical invocations.'
+        assert len(other_invocations) == 0, 'Did not expect other invocation types.'
+        assert filtered_function_names == ['lookup_value', 'group_by_datetime_component', 'query_tabular_data'], (
+            f"Expected only analytical citations, got: {filtered_function_names}"
+        )
+
+        print("✅ Analytical citation filtering passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_new_invocation_slicing_supports_attempt_retries():
+    """Verify retry evaluation can isolate only the invocations from the latest attempt."""
+    print("🔍 Testing new invocation slicing...")
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        get_new_invocations = helpers['get_new_plugin_invocations']
+
+        invocations = [
+            SimpleNamespace(function_name='describe_tabular_file'),
+            SimpleNamespace(function_name='group_by_datetime_component'),
+            SimpleNamespace(function_name='aggregate_column'),
+        ]
+
+        sliced_invocations = get_new_invocations(invocations, baseline_count=1)
+        sliced_function_names = [invocation.function_name for invocation in sliced_invocations]
+
+        assert sliced_function_names == ['group_by_datetime_component', 'aggregate_column'], (
+            f"Unexpected invocation slice: {sliced_function_names}"
+        )
+
+        print("✅ New invocation slicing passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_discovery_only_calls_trigger_retry_guardrails,
+        test_citation_filter_prefers_analytical_calls,
+        test_new_invocation_slicing_supports_attempt_retries,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_tabular_computed_results_prompt_priority.py b/functional_tests/test_tabular_computed_results_prompt_priority.py
new file mode 100644
index 00000000..9ed158c6
--- /dev/null
+++ b/functional_tests/test_tabular_computed_results_prompt_priority.py
@@ -0,0 +1,136 @@
+#!/usr/bin/env python3
+# test_tabular_computed_results_prompt_priority.py
+"""
+Functional test for tabular computed-results prompt priority.
+Version: 0.239.118
+Implemented in: 0.239.118
+
+This test ensures retrieval augmentation prompts do not override successful
+tabular tool results and that tool-backed tabular results are marked as
+authoritative when they are handed to the outer GPT response.
+"""
+
+import ast
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'build_search_augmentation_system_prompt',
+    'build_tabular_computed_results_system_message',
+}
+
+
+def load_prompt_helpers():
+    """Load prompt helper functions from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_search_prompt_allows_computed_tool_results():
+    """Verify the search augmentation prompt no longer blocks later tool results."""
+    print('🔍 Testing search prompt compatibility with computed tool results...')
+
+    try:
+        helpers, _ = load_prompt_helpers()
+        build_search_prompt = helpers['build_search_augmentation_system_prompt']
+        prompt = build_search_prompt('Excerpt A')
+
+        assert 'computed tool-backed results included elsewhere in this conversation context' in prompt, prompt
+        assert 'Do not say that you lack direct access to the data' in prompt, prompt
+        assert "If the answer isn't in the excerpts, say so." not in prompt, prompt
+
+        print('✅ Search prompt compatibility passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_tabular_results_prompt_marks_results_authoritative():
+    """Verify successful tabular analysis is handed off as authoritative evidence."""
+    print('🔍 Testing authoritative tabular handoff prompt...')
+
+    try:
+        helpers, _ = load_prompt_helpers()
+        build_tabular_prompt = helpers['build_tabular_computed_results_system_message']
+        analysis = 'ReturnID=RET000123; TaxLiability=4200; CreditsClaimed=300; RefundAmount=1350'
+        prompt = build_tabular_prompt(
+            'the file(s) irs_treasury_multi_tab_workbook.xlsx',
+            analysis,
+        )
+
+        assert analysis in prompt, prompt
+        assert 'These are tool-backed results derived from the full underlying tabular data' in prompt, prompt
+        assert 'Treat them as authoritative for row-level facts, calculations, and numeric conclusions' in prompt, prompt
+
+        print('✅ Authoritative tabular handoff prompt passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_route_uses_shared_prompt_helpers_for_tabular_handoff():
+    """Verify the route uses shared helpers instead of the old excerpt-only prompt."""
+    print('🔍 Testing route prompt helper usage...')
+
+    try:
+        _, route_content = load_prompt_helpers()
+
+        checks = {
+            'shared search prompt helper': 'build_search_augmentation_system_prompt(retrieved_content)' in route_content,
+            'shared tabular handoff helper': 'build_tabular_computed_results_system_message(' in route_content,
+            'old excerpt-only prompt removed': "If the answer isn't in the excerpts, say so." not in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f'Missing expected prompt handoff behavior: {failed_checks}'
+
+        print('✅ Route prompt helper usage passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_search_prompt_allows_computed_tool_results,
+        test_tabular_results_prompt_marks_results_authoritative,
+        test_route_uses_shared_prompt_helpers_for_tabular_handoff,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        results.append(test())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_cross_sheet_bridge_analysis.py b/functional_tests/test_tabular_cross_sheet_bridge_analysis.py
new file mode 100644
index 00000000..d809c940
--- /dev/null
+++ b/functional_tests/test_tabular_cross_sheet_bridge_analysis.py
@@ -0,0 +1,172 @@
+#!/usr/bin/env python3
+# test_tabular_cross_sheet_bridge_analysis.py
+"""
+Functional test for cross-sheet bridge analysis guidance.
+Version: 0.239.140
+Implemented in: 0.239.140
+
+This test ensures grouped workbook questions that need one worksheet for
+canonical entities and another for fact rows stay in analysis mode, derive a
+reference-to-fact bridge plan from workbook structure, and keep prompt
+guardrails that discourage grouping boolean or membership-flag columns when the
+user asked for results per entity.
+"""
+
+import ast
+import os
+import re
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'is_tabular_schema_summary_question',
+    'is_tabular_entity_lookup_question',
+    'is_tabular_cross_sheet_bridge_question',
+    'get_tabular_execution_mode',
+    '_normalize_tabular_sheet_token',
+    '_tokenize_tabular_sheet_text',
+    '_score_tabular_sheet_match',
+    '_select_relevant_workbook_sheets',
+    '_build_tabular_cross_sheet_bridge_plan',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {
+        're': re,
+    }
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_cross_sheet_bridge_questions_stay_in_analysis_mode():
+    """Verify grouped cross-sheet questions stay analytical, not entity-lookup."""
+    print('🔍 Testing cross-sheet bridge intent detection...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        is_bridge_question = helpers['is_tabular_cross_sheet_bridge_question']
+        is_entity_lookup_question = helpers['is_tabular_entity_lookup_question']
+        get_execution_mode = helpers['get_tabular_execution_mode']
+
+        bridge_question = 'How many milestones does each solution engineer have?'
+
+        assert is_bridge_question(bridge_question), bridge_question
+        assert not is_entity_lookup_question(bridge_question), bridge_question
+        assert get_execution_mode(bridge_question) == 'analysis', bridge_question
+
+        print('✅ Cross-sheet bridge intent detection passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_bridge_plan_identifies_reference_and_fact_sheets():
+    """Verify the bridge helper prefers a smaller entity sheet and larger fact sheet."""
+    print('🔍 Testing cross-sheet bridge plan inference...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        build_bridge_plan = helpers['_build_tabular_cross_sheet_bridge_plan']
+
+        per_sheet = {
+            'milestones': {
+                'columns': ['id', 'Owner', 'StartDate', 'Status', 'owned_by_se'],
+                'row_count': 8201,
+            },
+            'solution_engineers': {
+                'columns': ['se', 'manager', 'exist_in_opp_owners'],
+                'row_count': 29,
+            },
+            'metadata': {
+                'columns': ['refresh_date', 'source'],
+                'row_count': 1,
+            },
+        }
+
+        plan = build_bridge_plan(
+            ['milestones', 'solution_engineers', 'metadata'],
+            'Show me how many milestones are assigned to each solution engineer.',
+            per_sheet=per_sheet,
+        )
+
+        assert plan is not None, plan
+        assert plan['reference_sheet'] == 'solution_engineers', plan
+        assert plan['fact_sheet'] == 'milestones', plan
+        assert plan['reference_row_count'] == 29, plan
+        assert plan['fact_row_count'] == 8201, plan
+        assert plan['relevant_sheets'][:2] == ['solution_engineers', 'milestones'], plan
+
+        print('✅ Cross-sheet bridge plan inference passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_route_prompt_contains_cross_sheet_bridge_guardrails():
+    """Verify the analysis prompt includes bridge guidance and flag-column guardrails."""
+    print('🔍 Testing cross-sheet bridge prompt guardrails...')
+
+    try:
+        _, route_content = load_tabular_route_helpers()
+
+        checks = {
+            'bridge plan block exists': 'CROSS-SHEET BRIDGE PLAN:' in route_content,
+            'bridge prompt prefers reference then fact': 'first use the reference worksheet to identify canonical entity or category names, then compute the requested metric from the fact worksheet' in route_content,
+            'flag-column guardrail exists': "Do not answer 'each X' by grouping a yes/no, boolean, or membership-flag column" in route_content,
+            'default-sheet guardrail exists': 'If a CROSS-SHEET BRIDGE PLAN is provided, query the listed worksheets explicitly and do not rely on a default sheet.' in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f'Missing expected cross-sheet bridge guardrails: {failed_checks}'
+
+        print('✅ Cross-sheet bridge prompt guardrails passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_cross_sheet_bridge_questions_stay_in_analysis_mode,
+        test_bridge_plan_identifies_reference_and_fact_sheets,
+        test_route_prompt_contains_cross_sheet_bridge_guardrails,
+    ]
+    results = []
+
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        results.append(test())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_entity_lookup_mode.py b/functional_tests/test_tabular_entity_lookup_mode.py
new file mode 100644
index 00000000..576fdebc
--- /dev/null
+++ b/functional_tests/test_tabular_entity_lookup_mode.py
@@ -0,0 +1,291 @@
+#!/usr/bin/env python3
+# test_tabular_entity_lookup_mode.py
+"""
+Functional test for cross-sheet entity lookup routing fix.
+Version: 0.239.120
+Implemented in: 0.239.119 (entity-lookup routing); 0.239.120 (concrete call example retry)
+
+This test ensures related-record workbook questions route to entity-lookup
+mode, rank relevant worksheets beyond the first successful sheet, keep the
+retry guardrails that prevent one-sheet success from ending the analysis too
+early, and generate concrete per-sheet filter_rows call examples when the model
+omits sheet_name on a multi-sheet workbook.
+"""
+
+import ast
+import json
+import os
+import sys
+from types import SimpleNamespace
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'is_tabular_schema_summary_question',
+    'is_tabular_entity_lookup_question',
+    'get_tabular_execution_mode',
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_selected_sheet',
+    'get_tabular_invocation_selected_sheets',
+    '_normalize_tabular_sheet_token',
+    '_tokenize_tabular_sheet_text',
+    '_score_tabular_sheet_match',
+    '_select_relevant_workbook_sheets',
+    'is_tabular_access_limited_analysis',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {
+        'json': json,
+        're': __import__('re'),
+    }
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_cross_sheet_questions_route_to_entity_lookup_mode():
+    """Verify related-record workbook prompts use entity-lookup mode."""
+    print('🔍 Testing cross-sheet entity-lookup intent detection...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        is_entity_lookup_question = helpers['is_tabular_entity_lookup_question']
+        get_execution_mode = helpers['get_tabular_execution_mode']
+
+        entity_lookup_question = (
+            'Find taxpayer TP000123. Show their profile, tax return summary, and '
+            'any related W-2, 1099, payment, refund, notice, audit, or '
+            'installment agreement records.'
+        )
+        full_story_question = (
+            'Choose one taxpayer who has records in as many worksheets as '
+            'possible and tell their full story from filing through payments, '
+            'refund or balance outcome, notices, audit activity, and any '
+            'installment agreement.'
+        )
+        schema_summary_question = 'Summarize this workbook and explain how the worksheets relate.'
+        analytical_question = 'What was the total tax withheld in 2025?'
+
+        assert is_entity_lookup_question(entity_lookup_question), entity_lookup_question
+        assert get_execution_mode(entity_lookup_question) == 'entity_lookup', entity_lookup_question
+        assert is_entity_lookup_question(full_story_question), full_story_question
+        assert get_execution_mode(full_story_question) == 'entity_lookup', full_story_question
+        assert not is_entity_lookup_question(schema_summary_question), schema_summary_question
+        assert get_execution_mode(schema_summary_question) == 'schema_summary', schema_summary_question
+        assert not is_entity_lookup_question(analytical_question), analytical_question
+        assert get_execution_mode(analytical_question) == 'analysis', analytical_question
+
+        print('✅ Cross-sheet entity-lookup intent detection passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_entity_lookup_sheet_selection_prioritizes_related_worksheets():
+    """Verify sheet scoring keeps multiple related IRS worksheets in scope."""
+    print('🔍 Testing entity-lookup worksheet ranking...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        select_relevant_sheets = helpers['_select_relevant_workbook_sheets']
+
+        sheet_names = [
+            'Taxpayers',
+            'TaxReturns',
+            'W2Forms',
+            'Form1099Income',
+            'EstimatedPayments',
+            'Refunds',
+            'Notices',
+            'Audits',
+            'InstallmentAgreements',
+            'ReferenceData',
+        ]
+        entity_lookup_question = (
+            'Find taxpayer TP000123. Show their profile, tax return summary, and '
+            'any related W-2, 1099, payment, refund, notice, audit, or '
+            'installment agreement records.'
+        )
+
+        relevant_sheets = select_relevant_sheets(sheet_names, entity_lookup_question)
+
+        assert 'Taxpayers' in relevant_sheets, relevant_sheets
+        assert 'TaxReturns' in relevant_sheets, relevant_sheets
+        assert 'W2Forms' in relevant_sheets, relevant_sheets
+        assert 'Form1099Income' in relevant_sheets, relevant_sheets
+        assert 'EstimatedPayments' in relevant_sheets, relevant_sheets
+        assert 'Refunds' in relevant_sheets, relevant_sheets
+        assert 'Notices' in relevant_sheets, relevant_sheets
+        assert 'Audits' in relevant_sheets, relevant_sheets
+        assert 'InstallmentAgreements' in relevant_sheets, relevant_sheets
+        assert 'ReferenceData' not in relevant_sheets, relevant_sheets
+
+        print('✅ Entity-lookup worksheet ranking passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_entity_lookup_retry_guardrails_detect_incomplete_successes():
+    """Verify one-sheet successes still trigger execution-gap retries."""
+    print('🔍 Testing entity-lookup retry guardrails...')
+
+    try:
+        helpers, route_content = load_tabular_route_helpers()
+        get_selected_sheets = helpers['get_tabular_invocation_selected_sheets']
+        is_access_limited_analysis = helpers['is_tabular_access_limited_analysis']
+
+        invocations = [
+            SimpleNamespace(
+                function_name='filter_rows',
+                parameters={'sheet_name': 'Taxpayers'},
+                result=json.dumps({
+                    'selected_sheet': 'Taxpayers',
+                    'returned_rows': 1,
+                }),
+            ),
+            SimpleNamespace(
+                function_name='filter_rows',
+                parameters={'sheet_name': 'Taxpayers'},
+                result=json.dumps({
+                    'selected_sheet': 'Taxpayers',
+                    'returned_rows': 1,
+                }),
+            ),
+        ]
+        incomplete_analysis = (
+            "I don't have direct access to the related tax return and payment rows yet, "
+            'but I can outline what I would retrieve next.'
+        )
+
+        selected_sheets = get_selected_sheets(invocations)
+
+        assert selected_sheets == ['Taxpayers'], selected_sheets
+        assert is_access_limited_analysis(incomplete_analysis), incomplete_analysis
+        assert 'execution_gap_messages=previous_execution_gap_messages' in route_content, route_content
+        assert 'len(selected_sheets) <= 1' in route_content, route_content
+        assert 'Do not rely on a default sheet for cross-sheet entity lookups.' in route_content, route_content
+        # New: verify concrete call example infrastructure is present
+        assert 'previous_failed_call_parameters' in route_content, 'missing previous_failed_call_parameters'
+        assert 'MULTI-SHEET RETRY REQUIRED' in route_content, 'missing MULTI-SHEET RETRY REQUIRED block'
+        assert 'Execute ALL of these calls now' in route_content, 'missing execute-all-calls instruction'
+
+        print('✅ Entity-lookup retry guardrails passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_entity_lookup_missing_sheet_feedback_generates_concrete_examples():
+    """Verify the retry prompt generates per-sheet filter_rows examples from failed call parameters."""
+    print('🔍 Testing concrete call example generation for entity-lookup retry...')
+
+    try:
+        route_src = os.path.join(os.path.dirname(os.path.abspath(__file__)), '..', 'application', 'single_app', 'route_backend_chats.py')
+        with open(route_src, encoding='utf-8') as fh:
+            route_content = fh.read()
+
+        # Simulate the missing_sheet_feedback generation logic extracted from build_system_prompt
+        # by executing the relevant block in isolation with dummy data
+        IRS_WORKBOOK = 'irs_treasury_multi_tab_workbook.xlsx'
+        RELATED_SHEETS = ['Taxpayers', 'TaxReturns', 'W2Forms', 'Form1099Income', 'EstimatedPayments', 'Refunds']
+
+        previous_failed_call_parameters = [
+            {'filename': IRS_WORKBOOK, 'column': 'TaxpayerID', 'operator': '==', 'value': 'TP000123'},
+        ]
+        workbook_related_sheet_hints = {IRS_WORKBOOK: RELATED_SHEETS}
+        workbook_sheet_hints = {IRS_WORKBOOK: 'Taxpayers'}
+        entity_lookup_mode = True
+        tool_error_messages = [
+            f"Workbook '{IRS_WORKBOOK}' has multiple sheets. Specify sheet_name or sheet_index on analytical calls."
+        ]
+
+        # Run the logic exactly as it appears in build_system_prompt
+        call_example_lines = []
+        if tool_error_messages and any(
+            'Specify sheet_name or sheet_index on analytical calls.' in em
+            for em in tool_error_messages
+        ) and entity_lookup_mode:
+            for failed_params in previous_failed_call_parameters[:2]:
+                fname = failed_params.get('filename', '')
+                col = failed_params.get('column', '')
+                op = failed_params.get('operator', '==')
+                val = failed_params.get('value', '')
+                if not fname or not col or not val:
+                    continue
+                related_sheets = workbook_related_sheet_hints.get(fname) or list(workbook_sheet_hints.values())
+                for sheet in related_sheets[:6]:
+                    call_example_lines.append(
+                        f'  filter_rows(filename="{fname}", sheet_name="{sheet}", column="{col}", operator="{op}", value="{val}")'
+                    )
+
+        assert len(call_example_lines) > 0, 'No call examples generated'
+        examples_block = '\n'.join(call_example_lines)
+
+        # Key assertions: all 6 relevant sheets appear in the examples
+        for sheet in RELATED_SHEETS[:6]:
+            assert f'sheet_name="{sheet}"' in examples_block, f'Missing sheet: {sheet}'
+
+        # Entity identifier is included correctly
+        assert 'column="TaxpayerID"' in examples_block, 'Missing column in examples'
+        assert 'value="TP000123"' in examples_block, 'Missing entity value in examples'
+
+        # Source code has the full prompt block
+        assert 'MULTI-SHEET RETRY REQUIRED' in route_content, 'Missing MULTI-SHEET RETRY REQUIRED prompt'
+        assert 'Execute ALL of these calls now (copy exactly as written)' in route_content, 'Missing copy-exactly prompt'
+
+        print('✅ Concrete call example generation passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_cross_sheet_questions_route_to_entity_lookup_mode,
+        test_entity_lookup_sheet_selection_prioritizes_related_worksheets,
+        test_entity_lookup_retry_guardrails_detect_incomplete_successes,
+        test_entity_lookup_missing_sheet_feedback_generates_concrete_examples,
+    ]
+
+    results = []
+    for test_function in tests:
+        print(f'\n🧪 Running {test_function.__name__}...')
+        results.append(test_function())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_multisheet_workbook_support.py b/functional_tests/test_tabular_multisheet_workbook_support.py
new file mode 100644
index 00000000..9d0ca041
--- /dev/null
+++ b/functional_tests/test_tabular_multisheet_workbook_support.py
@@ -0,0 +1,366 @@
+#!/usr/bin/env python3
+# test_tabular_multisheet_workbook_support.py
+"""
+Functional test for multi-sheet workbook analytical orchestration fix.
+Version: 0.239.124
+Implemented in: 0.239.111
+
+This test ensures that multi-sheet workbook analysis can select the likely
+worksheet, use lookup_value for row-and-column retrieval, and constrain the
+tabular SK pass to analytical tools.
+"""
+
+import ast
+import asyncio
+import importlib.util
+import json
+import os
+import re
+import sys
+
+import pandas as pd
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+PLUGIN_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'semantic_kernel_plugins',
+    'tabular_processing_plugin.py',
+)
+
+TARGET_ASSIGNMENTS = {
+}
+TARGET_FUNCTIONS = {
+    'get_tabular_analysis_function_names',
+    '_normalize_tabular_sheet_token',
+    '_tokenize_tabular_sheet_text',
+    '_score_tabular_sheet_match',
+    '_select_likely_workbook_sheet',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected constants and helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {'re': re}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+PLUGIN_SPEC = importlib.util.spec_from_file_location('tabular_processing_plugin', PLUGIN_FILE)
+PLUGIN_MODULE = importlib.util.module_from_spec(PLUGIN_SPEC)
+PLUGIN_SPEC.loader.exec_module(PLUGIN_MODULE)
+TabularProcessingPlugin = PLUGIN_MODULE.TabularProcessingPlugin
+
+
+def build_mock_workbook_plugin():
+    """Create a plugin backed by in-memory workbook frames."""
+    plugin = TabularProcessingPlugin()
+    container_name = 'mock-container'
+    blob_name = 'Family Finances.xlsx'
+    workbook_frames = {
+        'Balance': pd.DataFrame({
+            'Accounts': ['Cash', 'Total Monthly Expenses'],
+            'Nov-25': ['2500.00', '7400.12'],
+        }),
+        'Assets': pd.DataFrame({
+            'Accounts': ['Checking', 'Total Assets'],
+            'Nov-25': ['1500.00', '481225.18'],
+        }),
+    }
+    workbook_metadata = {
+        'is_workbook': True,
+        'sheet_names': ['Balance', 'Assets'],
+        'sheet_count': 2,
+        'default_sheet': 'Balance',
+    }
+
+    plugin._resolve_blob_location_with_fallback = lambda *args, **kwargs: (container_name, blob_name)
+    plugin._get_workbook_metadata = lambda *args, **kwargs: workbook_metadata.copy()
+
+    def read_dataframe(container, blob, sheet_name=None, sheet_index=None, require_explicit_sheet=False):
+        selected_sheet, _ = plugin._resolve_sheet_selection(
+            container,
+            blob,
+            sheet_name=sheet_name,
+            sheet_index=sheet_index,
+            require_explicit_sheet=require_explicit_sheet,
+        )
+        return workbook_frames[selected_sheet].copy()
+
+    plugin._read_tabular_blob_to_dataframe = read_dataframe
+    return plugin, container_name, blob_name
+
+
+def test_lookup_value_returns_target_value_from_assets_sheet():
+    """Verify lookup_value returns the requested period value from the selected sheet."""
+    print('Testing lookup_value on explicit worksheet...')
+
+    try:
+        plugin, _, _ = build_mock_workbook_plugin()
+        result_json = asyncio.run(plugin.lookup_value(
+            user_id='test-user',
+            conversation_id='test-conversation',
+            filename='Family Finances.xlsx',
+            lookup_column='Accounts',
+            lookup_value='Total Assets',
+            target_column='Nov-25',
+            sheet_name='Assets',
+            source='workspace',
+        ))
+        payload = json.loads(result_json)
+
+        assert 'error' not in payload, f"Unexpected error payload: {payload}"
+        assert payload['selected_sheet'] == 'Assets', payload
+        assert payload['value'] == 481225.18, payload
+        assert payload['total_matches'] == 1, payload
+
+        print('PASS lookup_value explicit worksheet')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_default_sheet_override_allows_lookup_without_sheet_argument():
+    """Verify cross-sheet search finds values without sheet_name, and default-sheet override still works."""
+    print('Testing lookup_value default-sheet override...')
+
+    try:
+        plugin, container_name, blob_name = build_mock_workbook_plugin()
+
+        # Without override: cross-sheet search now succeeds automatically
+        initial_payload = json.loads(asyncio.run(plugin.lookup_value(
+            user_id='test-user',
+            conversation_id='test-conversation',
+            filename='Family Finances.xlsx',
+            lookup_column='Accounts',
+            lookup_value='Total Assets',
+            target_column='Nov-25',
+            source='workspace',
+        )))
+        assert initial_payload.get('selected_sheet') == 'ALL (cross-sheet search)', initial_payload
+        assert initial_payload.get('total_matches', 0) >= 1, initial_payload
+        assert 'Assets' in initial_payload.get('sheets_matched', []), initial_payload
+
+        # With override: targeted single-sheet lookup also works
+        plugin.set_default_sheet(container_name, blob_name, 'Assets')
+        override_payload = json.loads(asyncio.run(plugin.lookup_value(
+            user_id='test-user',
+            conversation_id='test-conversation',
+            filename='Family Finances.xlsx',
+            lookup_column='Accounts',
+            lookup_value='Total Assets',
+            target_column='Nov-25',
+            sheet_name='Assets',
+            source='workspace',
+        )))
+
+        assert 'error' not in override_payload, override_payload
+        assert override_payload['selected_sheet'] == 'Assets', override_payload
+        assert override_payload['value'] == 481225.18, override_payload
+
+        print('PASS lookup_value default-sheet override')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_likely_sheet_helper_handles_pluralized_question_text():
+    """Verify likely-sheet selection matches singular question terms to plural sheet names."""
+    print('Testing likely-sheet helper...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        select_likely_sheet = helpers['_select_likely_workbook_sheet']
+        analytical_functions = helpers['get_tabular_analysis_function_names']()
+
+        likely_sheet = select_likely_sheet(
+            ['Balance', 'Budget', 'Assets', 'Liabilities'],
+            'what were our asset values for nov 2025?',
+        )
+
+        assert 'lookup_value' in analytical_functions, analytical_functions
+        assert likely_sheet == 'Assets', likely_sheet
+
+        print('PASS likely-sheet helper')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_route_uses_analytical_filters_and_lookup_guidance():
+    """Verify the route prompt and execution settings match the analytical-only design."""
+    print('Testing route analytical-only orchestration text...')
+
+    try:
+        _, route_content = load_tabular_route_helpers()
+
+        checks = {
+            'lookup_value is advertised first': 'AVAILABLE FUNCTIONS: lookup_value' in route_content,
+            'discovery tools are disabled for analysis': 'Discovery functions are not available in this analysis run because schema context is already pre-loaded.' in route_content,
+            'prompt includes likely worksheet hints': 'LIKELY WORKSHEET HINTS:' in route_content,
+            'analysis function filters are configured': 'included_functions' in route_content,
+            'retry attempts require analytical function use': 'FunctionChoiceBehavior.Required(' in route_content,
+            'likely sheet override is applied': 'tabular_plugin.set_default_sheet(container, blob_path, likely_sheet)' in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected orchestration behavior: {failed_checks}"
+
+        print('PASS route analytical-only orchestration text')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_column_based_scoring_selects_sheet_by_column_names():
+    """Verify likely-sheet selection uses column names when sheet names are generic.
+
+    Simulates a Superstore-style workbook where sheet names (Orders, People,
+    Returns) are generic but column names (Sales, Profit) match the question.
+    """
+    print('Testing column-based likely-sheet scoring...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        select_likely_sheet = helpers['_select_likely_workbook_sheet']
+        score_match = helpers['_score_tabular_sheet_match']
+
+        per_sheet = {
+            'Orders': {
+                'columns': ['Row ID', 'Order ID', 'Order Date', 'Ship Date',
+                            'Ship Mode', 'Customer ID', 'Customer Name',
+                            'Segment', 'Country', 'City', 'State',
+                            'Postal Code', 'Region', 'Product ID',
+                            'Category', 'Sub-Category', 'Product Name',
+                            'Sales', 'Quantity', 'Discount', 'Profit'],
+                'row_count': 10194,
+            },
+            'People': {
+                'columns': ['Person', 'Region'],
+                'row_count': 4,
+            },
+            'Returns': {
+                'columns': ['Returned', 'Order ID'],
+                'row_count': 800,
+            },
+        }
+        sheet_names = ['Orders', 'People', 'Returns']
+
+        # Without columns, "analyze sales\profit" should NOT match any sheet
+        no_column_result = select_likely_sheet(sheet_names, 'analyze sales\\profit')
+        assert no_column_result is None, f"Expected None without columns, got {no_column_result}"
+
+        # With columns, "analyze sales\profit" should match Orders via Sales/Profit columns
+        with_column_result = select_likely_sheet(
+            sheet_names,
+            'analyze sales\\profit',
+            per_sheet=per_sheet,
+        )
+        assert with_column_result == 'Orders', f"Expected Orders, got {with_column_result}"
+
+        # Verify the individual scores make sense
+        orders_score = score_match('Orders', 'analyze sales\\profit',
+                                   columns=per_sheet['Orders']['columns'])
+        people_score = score_match('People', 'analyze sales\\profit',
+                                   columns=per_sheet['People']['columns'])
+        returns_score = score_match('Returns', 'analyze sales\\profit',
+                                    columns=per_sheet['Returns']['columns'])
+        assert orders_score > people_score, f"Orders {orders_score} should beat People {people_score}"
+        assert orders_score > returns_score, f"Orders {orders_score} should beat Returns {returns_score}"
+
+        print('PASS column-based likely-sheet scoring')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_fallback_selects_largest_sheet_when_no_column_match():
+    """Verify the route code falls back to the largest sheet when no scoring match.
+
+    When neither sheet names nor column names match the question (e.g. a very
+    generic question like 'help me with this file'), the analysis-mode
+    fallback should pick the sheet with the most rows as a reasonable default.
+    """
+    print('Testing fallback largest-sheet selection in route source...')
+
+    try:
+        _, route_content = load_tabular_route_helpers()
+
+        # The fallback logic should be present in the route source
+        checks = {
+            'fallback picks max rows sheet': (
+                "key=lambda s: per_sheet.get(s, {}).get('row_count', 0)" in route_content
+            ),
+            'fallback sets default sheet': (
+                "likely_sheet = fallback_sheet" in route_content
+            ),
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected fallback logic: {failed_checks}"
+
+        print('PASS fallback largest-sheet selection')
+        return True
+
+    except Exception as exc:
+        print(f'FAIL test: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_lookup_value_returns_target_value_from_assets_sheet,
+        test_default_sheet_override_allows_lookup_without_sheet_argument,
+        test_likely_sheet_helper_handles_pluralized_question_text,
+        test_route_uses_analytical_filters_and_lookup_guidance,
+        test_column_based_scoring_selects_sheet_by_column_names,
+        test_fallback_selects_largest_sheet_when_no_column_match,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\nRunning {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\nResults: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_popup_download_fix.py b/functional_tests/test_tabular_popup_download_fix.py
new file mode 100644
index 00000000..7d8e5efe
--- /dev/null
+++ b/functional_tests/test_tabular_popup_download_fix.py
@@ -0,0 +1,134 @@
+#!/usr/bin/env python3
+# test_tabular_popup_download_fix.py
+"""
+Functional test for the tabular popup download fix.
+Version: 0.239.124
+Implemented in: 0.239.124
+
+This test ensures that the chat tabular preview modal uses an authenticated
+fetch-to-blob download flow so download failures are surfaced in-app instead of
+failing silently through the browser-managed anchor path.
+"""
+
+import os
+import sys
+
+sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+
+
+def test_tabular_popup_uses_fetch_download_flow():
+    """Verify the tabular popup uses a controlled fetch download flow."""
+    print("🔍 Testing tabular popup fetch download flow...")
+
+    js_file_path = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'static',
+        'js',
+        'chat',
+        'chat-enhanced-citations.js'
+    )
+
+    if not os.path.exists(js_file_path):
+        print(f"❌ Enhanced citations JS file not found: {js_file_path}")
+        return False
+
+    with open(js_file_path, 'r', encoding='utf-8') as handle:
+        js_content = handle.read()
+
+    required_snippets = [
+        'async function downloadTabularFile(downloadUrl, fallbackFilename, downloadBtn)',
+        'fetch(downloadUrl, {',
+        "credentials: 'same-origin'",
+        'const blob = await response.blob();',
+        'triggerBlobDownload(blob, downloadFilename);',
+        "showToast(error.message || 'Could not download file.', 'danger');",
+        'downloadBtn.onclick = (event) => {',
+    ]
+
+    for snippet in required_snippets:
+        if snippet not in js_content:
+            print(f"❌ Missing required download flow snippet: {snippet}")
+            return False
+
+    print("✅ Controlled fetch download flow snippets found")
+    return True
+
+
+def test_tabular_popup_no_longer_uses_blank_target_anchor():
+    """Verify the modal download control no longer relies on a blank-target anchor."""
+    print("🔍 Testing tabular popup download control markup...")
+
+    js_file_path = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'static',
+        'js',
+        'chat',
+        'chat-enhanced-citations.js'
+    )
+
+    with open(js_file_path, 'r', encoding='utf-8') as handle:
+        js_content = handle.read()
+
+    if '<button type="button" id="enhanced-tabular-download" class="btn btn-primary btn-sm">' not in js_content:
+        print("❌ Download control is not rendered as a button")
+        return False
+
+    disallowed_snippets = [
+        'downloadBtn.href = downloadUrl;',
+        'downloadBtn.download = fileName;',
+        'target="_blank" rel="noopener noreferrer"',
+    ]
+
+    for snippet in disallowed_snippets:
+        if snippet in js_content:
+            print(f"❌ Found old anchor-based download snippet: {snippet}")
+            return False
+
+    print("✅ Download control no longer uses the old anchor path")
+    return True
+
+
+def test_version_updated_for_fix():
+    """Verify config.py reflects the new fix version."""
+    print("🔍 Testing version update for tabular popup download fix...")
+
+    config_file_path = os.path.join(
+        os.path.dirname(os.path.abspath(__file__)),
+        '..',
+        'application',
+        'single_app',
+        'config.py'
+    )
+
+    with open(config_file_path, 'r', encoding='utf-8') as handle:
+        config_content = handle.read()
+
+    if 'VERSION = "0.239.124"' not in config_content:
+        print("❌ Version not updated to 0.239.124")
+        return False
+
+    print("✅ Version properly updated to 0.239.124")
+    return True
+
+
+if __name__ == '__main__':
+    tests = [
+        test_tabular_popup_uses_fetch_download_flow,
+        test_tabular_popup_no_longer_uses_blank_target_anchor,
+        test_version_updated_for_fix,
+    ]
+
+    results = []
+    for test in tests:
+        print()
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Test Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_raw_tool_fallback.py b/functional_tests/test_tabular_raw_tool_fallback.py
new file mode 100644
index 00000000..a8aaa736
--- /dev/null
+++ b/functional_tests/test_tabular_raw_tool_fallback.py
@@ -0,0 +1,306 @@
+#!/usr/bin/env python3
+# test_tabular_raw_tool_fallback.py
+"""
+Functional test for tabular raw tool fallback summaries.
+Version: 0.239.125
+Implemented in: 0.239.125
+
+This test ensures successful tabular tool calls are not discarded when the
+inner tabular synthesis step fails, and that the analysis prompt now prefers
+combined conjunctive queries with conservative row limits.
+"""
+
+import ast
+import json
+import os
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_error_message',
+    'get_tabular_invocation_selected_sheet',
+    'get_tabular_invocation_data_rows',
+    'normalize_tabular_overlap_value',
+    'get_tabular_overlap_identifier_column',
+    'describe_tabular_invocation_conditions',
+    'get_tabular_query_overlap_summary',
+    'get_tabular_invocation_compact_payload',
+    'build_tabular_analysis_fallback_from_invocations',
+}
+
+
+class FakeInvocation:
+    """Small stand-in for plugin invocation objects used by route helpers."""
+
+    def __init__(self, function_name, parameters, result, error_message=None):
+        self.function_name = function_name
+        self.parameters = parameters
+        self.result = result
+        self.error_message = error_message
+
+
+def load_fallback_helpers():
+    """Load the raw fallback helper functions from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {'json': json}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_raw_fallback_builds_overlap_summary_from_successful_queries():
+    """Verify successful query results can still produce a compact overlap summary."""
+    print('🔍 Testing raw fallback overlap summary...')
+
+    try:
+        helpers, _ = load_fallback_helpers()
+        build_fallback = helpers['build_tabular_analysis_fallback_from_invocations']
+
+        congestion_rows = {
+            'filename': 'faa_flight_operations_dataset.csv',
+            'selected_sheet': None,
+            'total_matches': 3,
+            'returned_rows': 3,
+            'data': [
+                {
+                    'FlightID': 101,
+                    'Callsign': 'FL101',
+                    'OriginAirport': 'DFW',
+                    'DestinationAirport': 'ORD',
+                    'DepartureQueueLength': 12,
+                    'ArrivalQueueLength': 6,
+                    'VisibilityMiles': 8,
+                    'Precipitation': 'None',
+                    'DelayCategory': 'Congestion',
+                },
+                {
+                    'FlightID': 102,
+                    'Callsign': 'FL102',
+                    'OriginAirport': 'JFK',
+                    'DestinationAirport': 'BOS',
+                    'DepartureQueueLength': 13,
+                    'ArrivalQueueLength': 11,
+                    'VisibilityMiles': 2,
+                    'Precipitation': 'Snow',
+                    'DelayCategory': 'Weather',
+                },
+                {
+                    'FlightID': 103,
+                    'Callsign': 'FL103',
+                    'OriginAirport': 'SEA',
+                    'DestinationAirport': 'SFO',
+                    'DepartureQueueLength': 10,
+                    'ArrivalQueueLength': 12,
+                    'VisibilityMiles': 3,
+                    'Precipitation': 'Rain',
+                    'DelayCategory': 'Weather',
+                },
+            ],
+        }
+        weather_rows = {
+            'filename': 'faa_flight_operations_dataset.csv',
+            'selected_sheet': None,
+            'total_matches': 3,
+            'returned_rows': 3,
+            'data': [
+                {
+                    'FlightID': 102,
+                    'Callsign': 'FL102',
+                    'OriginAirport': 'JFK',
+                    'DestinationAirport': 'BOS',
+                    'DepartureQueueLength': 13,
+                    'ArrivalQueueLength': 11,
+                    'VisibilityMiles': 2,
+                    'Precipitation': 'Snow',
+                    'DelayCategory': 'Weather',
+                },
+                {
+                    'FlightID': 103,
+                    'Callsign': 'FL103',
+                    'OriginAirport': 'SEA',
+                    'DestinationAirport': 'SFO',
+                    'DepartureQueueLength': 10,
+                    'ArrivalQueueLength': 12,
+                    'VisibilityMiles': 3,
+                    'Precipitation': 'Rain',
+                    'DelayCategory': 'Weather',
+                },
+                {
+                    'FlightID': 104,
+                    'Callsign': 'FL104',
+                    'OriginAirport': 'DEN',
+                    'DestinationAirport': 'MIA',
+                    'DepartureQueueLength': 4,
+                    'ArrivalQueueLength': 3,
+                    'VisibilityMiles': 1,
+                    'Precipitation': 'Thunderstorm',
+                    'DelayCategory': 'Weather',
+                },
+            ],
+        }
+
+        invocations = [
+            FakeInvocation(
+                'query_tabular_data',
+                {
+                    'filename': 'faa_flight_operations_dataset.csv',
+                    'query_expression': 'DepartureQueueLength >= 10 or ArrivalQueueLength >= 10',
+                },
+                json.dumps(congestion_rows),
+            ),
+            FakeInvocation(
+                'query_tabular_data',
+                {
+                    'filename': 'faa_flight_operations_dataset.csv',
+                    'query_expression': "VisibilityMiles <= 3 or Precipitation != 'None' or DelayCategory == 'Weather'",
+                },
+                json.dumps(weather_rows),
+            ),
+        ]
+
+        fallback_summary = build_fallback(invocations)
+
+        assert fallback_summary is not None, 'Expected raw fallback summary'
+        assert 'OVERLAP SUMMARY:' in fallback_summary, fallback_summary
+        assert 'FlightID' in fallback_summary, fallback_summary
+        assert '"overlap_count": 2' in fallback_summary, fallback_summary
+        assert 'FL102' in fallback_summary, fallback_summary
+        assert 'FL103' in fallback_summary, fallback_summary
+        assert 'DepartureQueueLength >= 10 or ArrivalQueueLength >= 10' in fallback_summary, fallback_summary
+
+        print('✅ Raw fallback overlap summary passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_raw_fallback_preserves_aggregate_and_group_summaries():
+    """Verify aggregate and grouped results survive the raw fallback handoff."""
+    print('🔍 Testing raw fallback aggregate/group summaries...')
+
+    try:
+        helpers, _ = load_fallback_helpers()
+        build_fallback = helpers['build_tabular_analysis_fallback_from_invocations']
+
+        invocations = [
+            FakeInvocation(
+                'aggregate_column',
+                {
+                    'filename': 'Sample - Superstore (1).xlsx',
+                    'column': 'Sales',
+                    'operation': 'sum',
+                },
+                json.dumps({
+                    'filename': 'Sample - Superstore (1).xlsx',
+                    'selected_sheet': 'Orders',
+                    'column': 'Sales',
+                    'operation': 'sum',
+                    'result': 2326534.3543,
+                }),
+            ),
+            FakeInvocation(
+                'group_by_aggregate',
+                {
+                    'filename': 'Sample - Superstore (1).xlsx',
+                    'group_by_column': 'Category',
+                    'aggregate_column': 'Profit',
+                    'operation': 'sum',
+                },
+                json.dumps({
+                    'filename': 'Sample - Superstore (1).xlsx',
+                    'selected_sheet': 'Orders',
+                    'group_by': 'Category',
+                    'aggregate_column': 'Profit',
+                    'operation': 'sum',
+                    'groups': 3,
+                    'highest_group': 'Technology',
+                    'highest_value': 146543.3756,
+                    'lowest_group': 'Furniture',
+                    'lowest_value': 19729.9956,
+                    'top_results': {
+                        'Technology': 146543.3756,
+                        'Office Supplies': 126023.4434,
+                        'Furniture': 19729.9956,
+                    },
+                }),
+            ),
+        ]
+
+        fallback_summary = build_fallback(invocations)
+
+        assert fallback_summary is not None, 'Expected raw fallback summary'
+        assert 'TOOL RESULT SUMMARIES:' in fallback_summary, fallback_summary
+        assert '2326534.3543' in fallback_summary, fallback_summary
+        assert 'Technology' in fallback_summary, fallback_summary
+        assert '146543.3756' in fallback_summary, fallback_summary
+
+        print('✅ Raw fallback aggregate/group summaries passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_route_prompt_prefers_combined_queries_and_raw_fallback_helper():
+    """Verify the route now discourages oversized broad intersections."""
+    print('🔍 Testing route prompt guidance for combined queries...')
+
+    try:
+        _, route_content = load_fallback_helpers()
+
+        checks = {
+            'combined query guidance present': 'prefer one combined query_expression using and/or' in route_content,
+            'broad query intersection warning present': 'instead of separate broad queries that you plan to intersect later' in route_content,
+            'conservative max_rows guidance present': 'Keep max_rows as small as possible.' in route_content,
+            'raw fallback helper used after synthesis error': 'build_tabular_analysis_fallback_from_invocations(' in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f'Missing expected prompt/fallback behavior: {failed_checks}'
+
+        print('✅ Route prompt guidance passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_raw_fallback_builds_overlap_summary_from_successful_queries,
+        test_raw_fallback_preserves_aggregate_and_group_summaries,
+        test_route_prompt_prefers_combined_queries_and_raw_fallback_helper,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        results.append(test())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_retry_sheet_recovery.py b/functional_tests/test_tabular_retry_sheet_recovery.py
new file mode 100644
index 00000000..df3d6bee
--- /dev/null
+++ b/functional_tests/test_tabular_retry_sheet_recovery.py
@@ -0,0 +1,252 @@
+#!/usr/bin/env python3
+# test_tabular_retry_sheet_recovery.py
+"""
+Functional test for workbook retry-sheet recovery.
+Version: 0.239.117
+Implemented in: 0.239.117
+
+This test ensures tabular analytical tool failures on the wrong worksheet
+return candidate recovery sheets, camel-case sheet names tokenize cleanly,
+and the route retry helpers promote a better worksheet on the next attempt.
+"""
+
+import ast
+import asyncio
+import importlib.util
+import json
+import os
+import sys
+from types import SimpleNamespace
+
+import pandas as pd
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+PLUGIN_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'semantic_kernel_plugins',
+    'tabular_processing_plugin.py',
+)
+
+TARGET_FUNCTIONS = {
+    'get_tabular_analysis_function_names',
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_candidate_sheets',
+    'get_tabular_retry_sheet_overrides',
+    '_normalize_tabular_sheet_token',
+    '_tokenize_tabular_sheet_text',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {
+        'json': json,
+        're': __import__('re'),
+    }
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+PLUGIN_SPEC = importlib.util.spec_from_file_location('tabular_processing_plugin', PLUGIN_FILE)
+PLUGIN_MODULE = importlib.util.module_from_spec(PLUGIN_SPEC)
+PLUGIN_SPEC.loader.exec_module(PLUGIN_MODULE)
+TabularProcessingPlugin = PLUGIN_MODULE.TabularProcessingPlugin
+
+
+def build_mock_retry_plugin():
+    """Create a plugin backed by an in-memory multi-sheet workbook."""
+    plugin = TabularProcessingPlugin()
+    container_name = 'mock-container'
+    blob_name = 'irs_treasury_multi_tab_workbook.xlsx'
+    workbook_frames = {
+        'Taxpayers': pd.DataFrame({
+            'TaxpayerID': ['TP000123'],
+            'FirstName': ['Daniel'],
+            'LastName': ['Garcia'],
+        }),
+        'TaxReturns': pd.DataFrame({
+            'ReturnID': ['RET000123'],
+            'TaxpayerID': ['TP000123'],
+            'TaxLiability': ['4200'],
+            'CreditsClaimed': ['300'],
+            'WithholdingAmount': ['5000'],
+            'EstimatedPaymentsAmount': ['250'],
+            'RefundAmount': ['1350'],
+            'BalanceDue': ['0'],
+        }),
+        'EstimatedPayments': pd.DataFrame({
+            'ReturnID': ['RET000123'],
+            'PaymentAmount': ['250'],
+        }),
+    }
+    workbook_metadata = {
+        'is_workbook': True,
+        'sheet_names': ['Taxpayers', 'TaxReturns', 'EstimatedPayments'],
+        'sheet_count': 3,
+        'default_sheet': 'Taxpayers',
+    }
+
+    plugin._resolve_blob_location_with_fallback = lambda *args, **kwargs: (container_name, blob_name)
+    plugin._get_workbook_metadata = lambda *args, **kwargs: workbook_metadata.copy()
+
+    def read_dataframe(container, blob, sheet_name=None, sheet_index=None, require_explicit_sheet=False):
+        selected_sheet, _ = plugin._resolve_sheet_selection(
+            container,
+            blob,
+            sheet_name=sheet_name,
+            sheet_index=sheet_index,
+            require_explicit_sheet=require_explicit_sheet,
+        )
+        return workbook_frames[selected_sheet].copy()
+
+    plugin._read_tabular_blob_to_dataframe = read_dataframe
+    return plugin
+
+
+def test_sheet_tokenizer_splits_camel_case_names():
+    """Verify camel-case worksheet names become analyzable tokens."""
+    print('🔍 Testing camel-case sheet tokenization...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        tokenize_sheet_text = helpers['_tokenize_tabular_sheet_text']
+
+        tokens = tokenize_sheet_text('TaxReturns')
+        assert 'tax' in tokens, tokens
+        assert 'return' in tokens, tokens
+
+        payment_tokens = tokenize_sheet_text('EstimatedPayments')
+        assert 'estimated' in payment_tokens, payment_tokens
+        assert 'payment' in payment_tokens, payment_tokens
+
+        print('✅ Camel-case sheet tokenization passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_lookup_value_missing_column_returns_candidate_sheets():
+    """Verify cross-sheet search finds results when sheet_name is omitted, and
+    missing-column errors include candidate recovery sheets when sheet_name is explicit."""
+    print('🔍 Testing workbook-aware missing-column payloads...')
+
+    try:
+        plugin = build_mock_retry_plugin()
+
+        # Case 1: cross-sheet search succeeds when sheet_name is omitted
+        result_json = asyncio.run(plugin.lookup_value(
+            user_id='test-user',
+            conversation_id='test-conversation',
+            filename='irs_treasury_multi_tab_workbook.xlsx',
+            lookup_column='ReturnID',
+            lookup_value='RET000123',
+            target_column='RefundAmount',
+            source='workspace',
+        ))
+        payload = json.loads(result_json)
+        assert payload['selected_sheet'] == 'ALL (cross-sheet search)', payload
+        assert 'TaxReturns' in payload.get('sheets_matched', []), payload
+        assert payload['total_matches'] >= 1, payload
+
+        # Case 2: missing-column error with candidate sheets when sheet_name is explicit
+        result_json_2 = asyncio.run(plugin.lookup_value(
+            user_id='test-user',
+            conversation_id='test-conversation',
+            filename='irs_treasury_multi_tab_workbook.xlsx',
+            lookup_column='ReturnID',
+            lookup_value='RET000123',
+            target_column='RefundAmount',
+            sheet_name='Taxpayers',
+            source='workspace',
+        ))
+        payload_2 = json.loads(result_json_2)
+        assert payload_2['missing_column'] == 'ReturnID', payload_2
+        assert payload_2['selected_sheet'] == 'Taxpayers', payload_2
+        assert payload_2['candidate_sheets'][0] == 'TaxReturns', payload_2
+        assert "Column 'ReturnID' not found on sheet 'Taxpayers'" in payload_2['error'], payload_2
+
+        print('✅ Workbook-aware missing-column payloads passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_retry_sheet_override_prefers_candidate_sheet():
+    """Verify retry helpers promote the best candidate worksheet after failure."""
+    print('🔍 Testing retry worksheet override selection...')
+
+    try:
+        helpers, route_content = load_tabular_route_helpers()
+        get_retry_sheet_overrides = helpers['get_tabular_retry_sheet_overrides']
+
+        failed_invocation = SimpleNamespace(
+            function_name='lookup_value',
+            parameters={
+                'filename': 'irs_treasury_multi_tab_workbook.xlsx',
+            },
+            result=json.dumps({
+                'error': "Column 'ReturnID' not found on sheet 'Taxpayers'.",
+                'filename': 'irs_treasury_multi_tab_workbook.xlsx',
+                'missing_column': 'ReturnID',
+                'selected_sheet': 'Taxpayers',
+                'candidate_sheets': ['TaxReturns', 'EstimatedPayments'],
+            }),
+            error_message=None,
+        )
+
+        retry_sheet_overrides = get_retry_sheet_overrides([failed_invocation])
+
+        assert retry_sheet_overrides['irs_treasury_multi_tab_workbook.xlsx']['sheet_name'] == 'TaxReturns', retry_sheet_overrides
+        assert 'get_tabular_retry_sheet_overrides(failed_analytical_invocations)' in route_content, route_content
+        assert "tabular_plugin.set_default_sheet(" in route_content, route_content
+
+        print('✅ Retry worksheet override selection passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_sheet_tokenizer_splits_camel_case_names,
+        test_lookup_value_missing_column_returns_candidate_sheets,
+        test_retry_sheet_override_prefers_candidate_sheet,
+    ]
+
+    results = []
+    for test_function in tests:
+        print(f'\n🧪 Running {test_function.__name__}...')
+        results.append(test_function())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_tabular_tool_error_retry_and_thoughts.py b/functional_tests/test_tabular_tool_error_retry_and_thoughts.py
new file mode 100644
index 00000000..63b96899
--- /dev/null
+++ b/functional_tests/test_tabular_tool_error_retry_and_thoughts.py
@@ -0,0 +1,179 @@
+#!/usr/bin/env python3
+# test_tabular_tool_error_retry_and_thoughts.py
+"""
+Functional test for tabular tool-error retry and thought visibility fix.
+Version: 0.239.114
+Implemented in: 0.239.037
+
+This test ensures that analytical tabular tool calls returning JSON error
+payloads are treated as failed attempts, are retried instead of being accepted
+as completed analysis, and surface clearer thought details for recovery.
+"""
+
+import ast
+import json
+import os
+import sys
+from types import SimpleNamespace
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'get_tabular_discovery_function_names',
+    'get_tabular_analysis_function_names',
+    'get_tabular_thought_excluded_parameter_names',
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_error_message',
+    'split_tabular_analysis_invocations',
+    'summarize_tabular_invocation_errors',
+    'filter_tabular_citation_invocations',
+    'format_tabular_thought_parameter_value',
+    'get_tabular_tool_thought_payloads',
+    'get_tabular_status_thought_payloads',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected helper functions from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {'json': json}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_tool_error_payload_is_classified_as_failed_analysis():
+    """Verify JSON error payloads are treated as failed analytical tool calls."""
+    print("🔍 Testing tabular tool error classification...")
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        get_error_message = helpers['get_tabular_invocation_error_message']
+        split_analysis_invocations = helpers['split_tabular_analysis_invocations']
+        filter_citations = helpers['filter_tabular_citation_invocations']
+        failed_result_payload = json.dumps({
+            'error': "aggregate_column is required unless operation='count'."
+        })
+
+        failed_invocation = SimpleNamespace(
+            function_name='group_by_datetime_component',
+            result=failed_result_payload,
+            error_message=None,
+        )
+
+        assert get_error_message(failed_invocation) == "aggregate_column is required unless operation='count'."
+
+        successful_invocations, failed_invocations = split_analysis_invocations([failed_invocation])
+        assert len(successful_invocations) == 0, 'Did not expect a successful analytical invocation.'
+        assert len(failed_invocations) == 1, 'Expected the analytical invocation to be classified as failed.'
+        assert filter_citations([failed_invocation]) == [], 'Failed analytical invocations should not become citations.'
+
+        print("✅ Tabular tool error classification passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_failed_tool_thoughts_and_recovery_status_are_rendered():
+    """Verify failed analytical calls produce failed tool thoughts and recovery status thoughts."""
+    print("🔍 Testing tabular failure thought payloads...")
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        get_tool_payloads = helpers['get_tabular_tool_thought_payloads']
+        get_status_payloads = helpers['get_tabular_status_thought_payloads']
+        failed_result_payload = json.dumps({
+            'error': "aggregate_column is required unless operation='count'."
+        })
+
+        failed_invocation = SimpleNamespace(
+            function_name='group_by_datetime_component',
+            duration_ms=168.5,
+            success=True,
+            parameters={
+                'filename': 'faa_flight_operations_dataset.csv',
+                'datetime_component': 'hour',
+                'source': 'workspace',
+            },
+            result=failed_result_payload,
+            error_message=None,
+        )
+
+        tool_payloads = get_tool_payloads([failed_invocation])
+        status_payloads = get_status_payloads([failed_invocation], analysis_succeeded=True)
+
+        assert tool_payloads[0][0] == 'Tabular tool group_by_datetime_component on faa_flight_operations_dataset.csv (168ms) failed'
+        assert "error=aggregate_column is required unless operation='count'." in tool_payloads[0][1]
+        assert 'success=False' in tool_payloads[0][1]
+        assert status_payloads == [(
+            'Tabular analysis recovered via internal fallback after tool errors',
+            "aggregate_column is required unless operation='count'."
+        )], status_payloads
+
+        print("✅ Tabular failure thought payloads passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_route_contains_retry_feedback_for_tool_errors():
+    """Verify the route feeds previous tool errors back into retry attempts."""
+    print("🔍 Testing route retry feedback for tool errors...")
+
+    try:
+        _, route_content = load_tabular_route_helpers()
+
+        checks = {
+            'tool error prompt feedback': 'PREVIOUS TOOL ERRORS:' in route_content,
+            'tool error retry logging': 'used analytical tool(s) but all returned errors; retrying' in route_content,
+            'fallback recovery thought': 'Tabular analysis recovered via internal fallback after tool errors' in route_content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected tool-error retry behavior: {failed_checks}"
+
+        print("✅ Route retry feedback for tool errors passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_tool_error_payload_is_classified_as_failed_analysis,
+        test_failed_tool_thoughts_and_recovery_status_are_rendered,
+        test_route_contains_retry_feedback_for_tool_errors,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/functional_tests/test_tabular_workbook_schema_summary_mode.py b/functional_tests/test_tabular_workbook_schema_summary_mode.py
new file mode 100644
index 00000000..75f9dec6
--- /dev/null
+++ b/functional_tests/test_tabular_workbook_schema_summary_mode.py
@@ -0,0 +1,184 @@
+#!/usr/bin/env python3
+# test_tabular_workbook_schema_summary_mode.py
+"""
+Functional test for workbook schema-summary routing fix.
+Version: 0.239.119
+Implemented in: 0.239.115
+
+This test ensures workbook-structure questions use schema-summary routing,
+preserve describe_tabular_file citations when appropriate, and avoid fallback
+prompts that demand unavailable tool calls.
+"""
+
+import ast
+import os
+import sys
+from types import SimpleNamespace
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(ROOT_DIR)
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+TARGET_FUNCTIONS = {
+    'get_tabular_discovery_function_names',
+    'get_tabular_analysis_function_names',
+    'is_tabular_schema_summary_question',
+    'is_tabular_entity_lookup_question',
+    'get_tabular_execution_mode',
+    'build_tabular_fallback_system_message',
+    'get_tabular_invocation_result_payload',
+    'get_tabular_invocation_error_message',
+    'split_tabular_analysis_invocations',
+    'split_tabular_plugin_invocations',
+    'filter_tabular_citation_invocations',
+}
+
+
+def load_tabular_route_helpers():
+    """Load selected helpers from the chat route source."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        route_content = file_handle.read()
+
+    parsed = ast.parse(route_content, filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in TARGET_FUNCTIONS:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {
+        'json': __import__('json'),
+        're': __import__('re'),
+    }
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace, route_content
+
+
+def test_workbook_structure_questions_route_to_schema_summary_mode():
+    """Verify workbook-summary prompts are routed to schema-summary mode."""
+    print('🔍 Testing workbook schema-summary intent detection...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        is_schema_summary_question = helpers['is_tabular_schema_summary_question']
+        get_execution_mode = helpers['get_tabular_execution_mode']
+
+        workbook_question = (
+            'Summarize this workbook for me. What worksheets does it contain, '
+            'what does each worksheet represent, and how are they related?'
+        )
+        analytical_question = 'What was the total tax withheld in 2025?'
+
+        assert is_schema_summary_question(workbook_question), workbook_question
+        assert get_execution_mode(workbook_question) == 'schema_summary', workbook_question
+        assert not is_schema_summary_question(analytical_question), analytical_question
+        assert get_execution_mode(analytical_question) == 'analysis', analytical_question
+
+        print('✅ Workbook schema-summary intent detection passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_schema_summary_fallback_prompt_does_not_require_unavailable_tools():
+    """Verify the outer fallback prompt no longer requires tool calls after SK failure."""
+    print('🔍 Testing workbook fallback prompt behavior...')
+
+    try:
+        helpers, route_content = load_tabular_route_helpers()
+        build_fallback_prompt = helpers['build_tabular_fallback_system_message']
+
+        schema_prompt = build_fallback_prompt(
+            'irs_treasury_multi_tab_workbook.xlsx',
+            execution_mode='schema_summary',
+        )
+        analysis_prompt = build_fallback_prompt(
+            'irs_treasury_multi_tab_workbook.xlsx',
+            execution_mode='analysis',
+        )
+
+        assert 'answer from the schema summary only' in schema_prompt, schema_prompt
+        assert 'MUST use the tabular_processing plugin functions' not in schema_prompt, schema_prompt
+        assert 'could not compute tool-backed results' in analysis_prompt, analysis_prompt
+        assert 'MUST use the tabular_processing plugin functions' not in analysis_prompt, analysis_prompt
+        assert 'AVAILABLE FUNCTIONS: describe_tabular_file only.' in route_content, 'Schema-summary prompt should constrain tool choice.'
+        assert 'build_tabular_fallback_system_message(' in route_content, 'Workspace fallback should use the shared fallback helper.'
+
+        print('✅ Workbook fallback prompt behavior passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_schema_summary_citations_keep_describe_calls_when_they_are_the_only_successes():
+    """Verify describe_tabular_file citations are preserved when they are the only successful tool calls."""
+    print('🔍 Testing schema-summary citation filtering...')
+
+    try:
+        helpers, _ = load_tabular_route_helpers()
+        filter_invocations = helpers['filter_tabular_citation_invocations']
+
+        describe_only_invocations = [
+            SimpleNamespace(
+                function_name='describe_tabular_file',
+                error_message=None,
+                result='{"sheet_count": 6}',
+                success=True,
+            )
+        ]
+        filtered_describe_only = filter_invocations(describe_only_invocations)
+        assert [invocation.function_name for invocation in filtered_describe_only] == ['describe_tabular_file'], filtered_describe_only
+
+        mixed_invocations = [
+            SimpleNamespace(
+                function_name='describe_tabular_file',
+                error_message=None,
+                result='{"sheet_count": 6}',
+                success=True,
+            ),
+            SimpleNamespace(
+                function_name='lookup_value',
+                error_message=None,
+                result='{"value": 42}',
+                success=True,
+            ),
+        ]
+        filtered_mixed = filter_invocations(mixed_invocations)
+        assert [invocation.function_name for invocation in filtered_mixed] == ['lookup_value'], filtered_mixed
+
+        print('✅ Schema-summary citation filtering passed')
+        return True
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_workbook_structure_questions_route_to_schema_summary_mode,
+        test_schema_summary_fallback_prompt_does_not_require_unavailable_tools,
+        test_schema_summary_citations_keep_describe_calls_when_they_are_the_only_successes,
+    ]
+
+    results = []
+    for test in tests:
+        print(f'\n🧪 Running {test.__name__}...')
+        results.append(test())
+
+    success = all(results)
+    print(f'\n📊 Results: {sum(results)}/{len(results)} tests passed')
+    sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_thoughts_feature.py b/functional_tests/test_thoughts_feature.py
new file mode 100644
index 00000000..c07df2da
--- /dev/null
+++ b/functional_tests/test_thoughts_feature.py
@@ -0,0 +1,520 @@
+#!/usr/bin/env python3
+"""
+Functional test for Processing Thoughts feature.
+Version: 0.239.003
+Implemented in: 0.239.003
+
+This test ensures that the Processing Thoughts feature is properly implemented
+across backend (ThoughtTracker, API endpoints, chat instrumentation) and
+frontend (polling, streaming, toggle, rendering) components.
+"""
+
+import sys
+import os
+
+sys.path.insert(0, os.path.join(os.path.dirname(os.path.dirname(os.path.abspath(__file__))), 'application', 'single_app'))
+
+
+def test_thoughts_backend_module():
+    """Test that functions_thoughts.py has proper ThoughtTracker and CRUD functions."""
+    print("Testing functions_thoughts.py module structure...")
+    try:
+        backend_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'functions_thoughts.py'
+        )
+
+        with open(backend_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'ThoughtTracker class': 'class ThoughtTracker' in content,
+            'add_thought method': 'def add_thought' in content,
+            'complete_thought method': 'def complete_thought' in content,
+            'enabled property': 'def enabled' in content or 'enabled' in content,
+            'get_thoughts_for_message': 'def get_thoughts_for_message' in content,
+            'get_pending_thoughts': 'def get_pending_thoughts' in content,
+            'archive_thoughts_for_conversation': 'def archive_thoughts_for_conversation' in content,
+            'delete_thoughts_for_conversation': 'def delete_thoughts_for_conversation' in content,
+            'delete_thoughts_for_message': 'def delete_thoughts_for_message' in content,
+            'cosmos_thoughts_container import': 'cosmos_thoughts_container' in content,
+            'step_index tracking': 'step_index' in content,
+            'step_type field': 'step_type' in content,
+            'error handling (try/except)': 'except Exception' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_route_module():
+    """Test that route_backend_thoughts.py has proper API endpoints."""
+    print("\nTesting route_backend_thoughts.py API endpoints...")
+    try:
+        route_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'route_backend_thoughts.py'
+        )
+
+        with open(route_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'register function': 'def register_route_backend_thoughts' in content,
+            'message thoughts endpoint': '/messages/' in content and '/thoughts' in content,
+            'pending thoughts endpoint': '/thoughts/pending' in content,
+            'swagger_route decorator': 'swagger_route' in content,
+            'login_required decorator': 'login_required' in content,
+            'user_required decorator': 'user_required' in content,
+            'get_auth_security import': 'get_auth_security' in content,
+            'enable_thoughts check': 'enable_thoughts' in content,
+            'returns enabled flag': "'enabled'" in content or '"enabled"' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_chat_instrumentation():
+    """Test that route_backend_chats.py has thought instrumentation points."""
+    print("\nTesting route_backend_chats.py thought instrumentation...")
+    try:
+        chats_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'route_backend_chats.py'
+        )
+
+        with open(chats_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'ThoughtTracker import': 'ThoughtTracker' in content,
+            'thought_tracker instantiation': 'thought_tracker' in content and 'ThoughtTracker(' in content,
+            'add_thought calls': 'thought_tracker.add_thought' in content,
+            'content_safety thought': "'content_safety'" in content or '"content_safety"' in content,
+            'search thought': "add_thought('search'" in content or 'add_thought("search"' in content,
+            'web_search thought': "'web_search'" in content or '"web_search"' in content,
+            'generation thought': "'generation'" in content or '"generation"' in content,
+            'thoughts_enabled in response': 'thoughts_enabled' in content,
+            'SSE thought event (streaming)': '"type": "thought"' in content or "'type': 'thought'" in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_frontend_module():
+    """Test that chat-thoughts.js has polling, streaming, and toggle logic."""
+    print("\nTesting chat-thoughts.js frontend module...")
+    try:
+        js_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'static', 'js', 'chat', 'chat-thoughts.js'
+        )
+
+        with open(js_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'startThoughtPolling export': 'export function startThoughtPolling' in content,
+            'stopThoughtPolling export': 'export function stopThoughtPolling' in content,
+            'handleStreamingThought export': 'export function handleStreamingThought' in content,
+            'createThoughtsToggleHtml export': 'export function createThoughtsToggleHtml' in content,
+            'attachThoughtsToggleListener export': 'export function attachThoughtsToggleListener' in content,
+            'polling interval setup': 'setInterval' in content,
+            'polling endpoint fetch': '/thoughts/pending' in content,
+            'message thoughts fetch': '/thoughts' in content,
+            'icon map (search)': 'bi-search' in content,
+            'icon map (globe)': 'bi-globe' in content,
+            'icon map (robot)': 'bi-robot' in content,
+            'icon map (lightning)': 'bi-lightning' in content,
+            'icon map (shield)': 'bi-shield-check' in content,
+            'enable_thoughts check': 'enable_thoughts' in content,
+            'd-none toggle': 'd-none' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_chat_messages_integration():
+    """Test that chat-messages.js integrates thought polling and toggle."""
+    print("\nTesting chat-messages.js thoughts integration...")
+    try:
+        messages_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'static', 'js', 'chat', 'chat-messages.js'
+        )
+
+        with open(messages_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'chat-thoughts import': 'chat-thoughts' in content,
+            'startThoughtPolling import': 'startThoughtPolling' in content,
+            'stopThoughtPolling import': 'stopThoughtPolling' in content,
+            'createThoughtsToggleHtml import': 'createThoughtsToggleHtml' in content,
+            'attachThoughtsToggleListener import': 'attachThoughtsToggleListener' in content,
+            'startThoughtPolling call': 'startThoughtPolling(' in content,
+            'stopThoughtPolling call': 'stopThoughtPolling(' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_streaming_integration():
+    """Test that chat-streaming.js handles thought SSE events."""
+    print("\nTesting chat-streaming.js thought event handling...")
+    try:
+        streaming_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'static', 'js', 'chat', 'chat-streaming.js'
+        )
+
+        with open(streaming_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'handleStreamingThought import': 'handleStreamingThought' in content,
+            'thought type check': "'thought'" in content or '"thought"' in content,
+            'handleStreamingThought call': 'handleStreamingThought(' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_loading_indicator():
+    """Test that chat-loading-indicator.js supports thought text updates."""
+    print("\nTesting chat-loading-indicator.js thought text support...")
+    try:
+        indicator_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'static', 'js', 'chat', 'chat-loading-indicator.js'
+        )
+
+        with open(indicator_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'updateLoadingIndicatorText export': 'export function updateLoadingIndicatorText' in content
+                                                  or 'updateLoadingIndicatorText' in content,
+            'text element update': 'thought-live-text' in content or 'loading-text' in content
+                                   or 'textContent' in content or 'innerHTML' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_admin_settings():
+    """Test that admin settings UI includes the thoughts toggle."""
+    print("\nTesting admin settings UI for thoughts toggle...")
+    try:
+        html_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'templates', 'admin_settings.html'
+        )
+
+        with open(html_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'enable_thoughts checkbox': 'id="enable_thoughts"' in content,
+            'enable_thoughts name attr': 'name="enable_thoughts"' in content,
+            'Processing Thoughts label': 'Processing Thoughts' in content,
+            'lightbulb icon': 'bi-lightbulb' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_settings_default():
+    """Test that functions_settings.py includes enable_thoughts default."""
+    print("\nTesting functions_settings.py default setting...")
+    try:
+        settings_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'functions_settings.py'
+        )
+
+        with open(settings_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'enable_thoughts in defaults': 'enable_thoughts' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_cosmos_containers():
+    """Test that config.py defines thoughts Cosmos DB containers."""
+    print("\nTesting config.py Cosmos DB container definitions...")
+    try:
+        config_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'config.py'
+        )
+
+        with open(config_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'thoughts container name': 'cosmos_thoughts_container_name' in content,
+            'thoughts container object': 'cosmos_thoughts_container' in content,
+            'archive_thoughts container name': 'cosmos_archived_thoughts_container_name' in content,
+            'archive_thoughts container object': 'cosmos_archived_thoughts_container' in content,
+            'partition key /user_id': '/user_id' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_conversation_archive():
+    """Test that route_backend_conversations.py handles thought archive/delete."""
+    print("\nTesting route_backend_conversations.py thought archive support...")
+    try:
+        conv_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'route_backend_conversations.py'
+        )
+
+        with open(conv_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'archive_thoughts_for_conversation import': 'archive_thoughts_for_conversation' in content,
+            'delete_thoughts_for_conversation import': 'delete_thoughts_for_conversation' in content,
+            'archive_thoughts call': 'archive_thoughts_for_conversation(' in content,
+            'delete_thoughts call': 'delete_thoughts_for_conversation(' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_css_styles():
+    """Test that chats.css includes thought-related styles."""
+    print("\nTesting chats.css thought styles...")
+    try:
+        css_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'static', 'css', 'chats.css'
+        )
+
+        with open(css_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'thoughts-toggle-btn style': '.thoughts-toggle-btn' in content,
+            'thoughts-container style': '.thoughts-container' in content,
+            'thought-step style': '.thought-step' in content,
+            'animate-pulse animation': '.animate-pulse' in content,
+            'thought-pulse keyframes': '@keyframes thought-pulse' in content,
+            'dark mode thoughts toggle': '[data-bs-theme="dark"] .thoughts-toggle-btn' in content,
+            'dark mode thought step': '[data-bs-theme="dark"] .thought-step' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+def test_thoughts_app_registration():
+    """Test that app.py registers the thoughts route blueprint."""
+    print("\nTesting app.py route registration...")
+    try:
+        app_file = os.path.join(
+            os.path.dirname(os.path.dirname(__file__)),
+            'application', 'single_app', 'app.py'
+        )
+
+        with open(app_file, 'r', encoding='utf-8') as f:
+            content = f.read()
+
+        checks = {
+            'import register function': 'register_route_backend_thoughts' in content,
+            'register call': 'register_route_backend_thoughts(app)' in content,
+        }
+
+        all_passed = True
+        for name, passed in checks.items():
+            status = 'PASS' if passed else 'FAIL'
+            print(f"  [{status}] {name}")
+            if not passed:
+                all_passed = False
+
+        return all_passed
+
+    except Exception as e:
+        print(f"  [FAIL] Exception: {e}")
+        return False
+
+
+if __name__ == "__main__":
+    print("=" * 60)
+    print("Processing Thoughts Feature - Functional Tests")
+    print("Version: 0.239.003")
+    print("=" * 60)
+
+    tests = [
+        test_thoughts_backend_module,
+        test_thoughts_route_module,
+        test_thoughts_chat_instrumentation,
+        test_thoughts_frontend_module,
+        test_thoughts_chat_messages_integration,
+        test_thoughts_streaming_integration,
+        test_thoughts_loading_indicator,
+        test_thoughts_admin_settings,
+        test_thoughts_settings_default,
+        test_thoughts_cosmos_containers,
+        test_thoughts_conversation_archive,
+        test_thoughts_css_styles,
+        test_thoughts_app_registration,
+    ]
+
+    results = []
+    for test_fn in tests:
+        try:
+            result = test_fn()
+            results.append(result)
+        except Exception as e:
+            print(f"  [FAIL] Unhandled exception: {e}")
+            results.append(False)
+
+    print("\n" + "=" * 60)
+    passed = sum(1 for r in results if r)
+    total = len(results)
+    print(f"Results: {passed}/{total} test groups passed")
+
+    if passed == total:
+        print("All tests passed!")
+    else:
+        print("Some tests failed. Review output above.")
+
+    print("=" * 60)
+    sys.exit(0 if passed == total else 1)
diff --git a/functional_tests/test_web_search_failure_handling.py b/functional_tests/test_web_search_failure_handling.py
index 83afcc13..27849b18 100644
--- a/functional_tests/test_web_search_failure_handling.py
+++ b/functional_tests/test_web_search_failure_handling.py
@@ -1,7 +1,7 @@
 #!/usr/bin/env python3
 """
 Functional test for Web Search Failure Graceful Handling.
-Version: 0.236.014
+Version: 0.239.124
 Implemented in: 0.236.014
 
 This test ensures that when web search fails, the system properly injects
@@ -167,9 +167,9 @@ def test_error_scenarios_have_return_false():
 
 def test_web_search_results_container_usage():
     """
-    Test that web_search_results_container is used to inject system messages.
+    Test that the augmentation message container is used to inject system messages.
     """
-    print("\n🔍 Testing web_search_results_container for system message injection...")
+    print("\n🔍 Testing augmentation container for system message injection...")
     
     try:
         file_path = os.path.join(
@@ -192,9 +192,9 @@ def test_web_search_results_container_usage():
         
         func_content = content[func_start:func_end]
         
-        # Check for container append with system role
-        has_container_param = 'web_search_results_container' in func_content
-        has_append_call = 'web_search_results_container.append' in func_content
+        # Check for augmentation container append with system role
+        has_container_param = 'system_messages_for_augmentation' in func_content
+        has_append_call = 'system_messages_for_augmentation.append' in func_content
         has_system_message = "'role': 'system'" in func_content or '"role": "system"' in func_content
         
         checks = {
@@ -228,7 +228,7 @@ def run_all_tests():
     """Run all tests and report results."""
     print("=" * 60)
     print("Web Search Failure Graceful Handling Fix - Functional Tests")
-    print("Version: 0.236.013")
+    print("Version: 0.239.124")
     print("=" * 60)
     
     tests = [
diff --git a/functional_tests/test_workspace_scope_prompts_fix.py b/functional_tests/test_workspace_scope_prompts_fix.py
index e83608e4..68983523 100644
--- a/functional_tests/test_workspace_scope_prompts_fix.py
+++ b/functional_tests/test_workspace_scope_prompts_fix.py
@@ -1,216 +1,163 @@
 #!/usr/bin/env python3
+# test_workspace_scope_prompts_fix.py
 """
 Functional test for workspace scope affecting prompts functionality.
-Version: 0.229.042
-Implemented in: 0.229.042
+Version: 0.239.125
+Implemented in: 0.239.124
 
-This test ensures that workspace scope selection (All, Personal, Group, Public) 
-properly filters prompts in the same way it filters documents. When scope is 
-changed, only prompts from the selected scope should be visible.
+This test ensures that chat prompt loading remains scope-aware for personal,
+group, and public prompt sources, and that the prompt picker uses the current
+searchable single-select implementation.
 """
 
-import sys
 import os
-sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+import sys
+
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+
+CHAT_PROMPTS_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-prompts.js',
+)
+CHAT_GLOBAL_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'static',
+    'js',
+    'chat',
+    'chat-global.js',
+)
+CHAT_TEMPLATE_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'templates',
+    'chats.html',
+)
+CONFIG_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'config.py',
+)
+PUBLIC_PROMPTS_ROUTE_FILE = os.path.join(
+    ROOT_DIR,
+    'application',
+    'single_app',
+    'route_backend_public_prompts.py',
+)
+
+
+def read_file(path):
+    with open(path, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def test_prompt_scope_filtering_and_searchable_picker_implementation():
+    """Verify prompt scope filtering now uses effective scopes and searchable dropdown UI."""
+    print('🔍 Testing workspace scope prompts implementation...')
 
-def test_prompt_scope_filtering_javascript_implementation():
-    """Test that the JavaScript implementation properly handles prompt scope filtering."""
-    print("🔍 Testing Workspace Scope Prompts Fix...")
-    
     try:
-        # Read the updated chat-prompts.js file
-        chat_prompts_path = os.path.join(
-            os.path.dirname(os.path.abspath(__file__)),
-            "..",
-            "application",
-            "single_app",
-            "static",
-            "js",
-            "chat",
-            "chat-prompts.js"
-        )
-        
-        if not os.path.exists(chat_prompts_path):
-            raise Exception(f"Chat prompts file not found: {chat_prompts_path}")
-            
-        with open(chat_prompts_path, 'r', encoding='utf-8') as f:
-            content = f.read()
-        
-        # Test 1: Check if publicPrompts variable is declared in global
-        chat_global_path = os.path.join(
-            os.path.dirname(os.path.abspath(__file__)),
-            "..",
-            "application",
-            "single_app",
-            "static",
-            "js",
-            "chat",
-            "chat-global.js"
-        )
-        
-        with open(chat_global_path, 'r', encoding='utf-8') as f:
-            global_content = f.read()
-            
-        if "let publicPrompts = [];" not in global_content:
-            raise Exception("❌ publicPrompts variable not declared in chat-global.js")
-        print("✅ publicPrompts variable properly declared in global scope")
-        
-        # Test 2: Check if loadPublicPrompts function exists
-        if "export function loadPublicPrompts()" not in content:
-            raise Exception("❌ loadPublicPrompts function not found")
-        print("✅ loadPublicPrompts function implemented")
-        
-        # Test 3: Check if loadPublicPrompts fetches from correct API endpoint
-        if '"/api/public_prompts"' not in content:
-            raise Exception("❌ loadPublicPrompts not using correct API endpoint")
-        print("✅ loadPublicPrompts uses correct API endpoint (/api/public_prompts)")
-        
-        # Test 4: Check if populatePromptSelectScope function exists
-        if "export function populatePromptSelectScope()" not in content:
-            raise Exception("❌ populatePromptSelectScope function not found")
-        print("✅ populatePromptSelectScope function implemented")
-        
-        # Test 5: Check if scope filtering logic is implemented
-        scope_conditions = [
-            'scopeVal === "all"',
-            'scopeVal === "personal"', 
-            'scopeVal === "group"',
-            'scopeVal === "public"'
+        prompts_content = read_file(CHAT_PROMPTS_FILE)
+        global_content = read_file(CHAT_GLOBAL_FILE)
+        template_content = read_file(CHAT_TEMPLATE_FILE)
+        config_content = read_file(CONFIG_FILE)
+
+        required_global_snippets = [
+            'let publicPrompts = [];',
         ]
-        
-        for condition in scope_conditions:
-            if condition not in content:
-                raise Exception(f"❌ Scope filtering condition missing: {condition}")
-        print("✅ All scope filtering conditions implemented (all, personal, group, public)")
-        
-        # Test 6: Check if prompts are properly labeled by scope
-        scope_labels = [
+        missing_global = [snippet for snippet in required_global_snippets if snippet not in global_content]
+        assert not missing_global, f'Missing global prompt state: {missing_global}'
+        print('✅ publicPrompts variable properly declared in chat-global.js')
+
+        required_prompt_snippets = [
+            'import { docScopeSelect, getEffectiveScopes } from "./chat-documents.js";',
+            'import { createSearchableSingleSelect } from "./chat-searchable-select.js";',
+            'function initializePromptSelector() {',
+            'promptSelectorController = createSearchableSingleSelect({',
+            'async function fetchAllPromptPages(endpoint, emptyStatuses = []) {',
+            'const promptPageSize = 100;',
+            'scopes.personal',
+            'scopes.groupIds.length > 0',
+            'scopes.publicWorkspaceIds.length > 0',
             'scope: "Personal"',
             'scope: "Group"',
-            'scope: "Public"'
+            'scope: "Public"',
+            'loadAllPromptsPromise = Promise.all([loadUserPrompts(), loadGroupPrompts(), loadPublicPrompts()])',
+            'docScopeSelect.addEventListener("change", function() {',
         ]
-        
-        for label in scope_labels:
-            if label not in content:
-                raise Exception(f"❌ Scope label missing: {label}")
-        print("✅ Prompts properly labeled with scope (Personal, Group, Public)")
-        
-        # Test 7: Check if loadAllPrompts function exists
-        if "export function loadAllPrompts()" not in content:
-            raise Exception("❌ loadAllPrompts function not found")
-        print("✅ loadAllPrompts function implemented")
-        
-        # Test 8: Check if loadAllPrompts loads all three types of prompts
-        all_loads = [
-            "loadUserPrompts()",
-            "loadGroupPrompts()", 
-            "loadPublicPrompts()"
+        missing_prompt = [snippet for snippet in required_prompt_snippets if snippet not in prompts_content]
+        assert not missing_prompt, f'Missing prompt scope/search snippets: {missing_prompt}'
+        print('✅ Prompt scope filtering and searchable selector logic implemented')
+
+        required_template_snippets = [
+            'id="prompt-dropdown"',
+            'id="prompt-search-input"',
+            'id="prompt-dropdown-items"',
         ]
-        
-        for load_func in all_loads:
-            if load_func not in content:
-                raise Exception(f"❌ loadAllPrompts missing: {load_func}")
-        print("✅ loadAllPrompts loads all prompt types (user, group, public)")
-        
-        # Test 9: Check if scope change event listener is added
-        if 'docScopeSelect.addEventListener("change"' not in content:
-            raise Exception("❌ Scope change event listener not added")
-        print("✅ Scope change event listener properly added")
-        
-        # Test 10: Check if imports include docScopeSelect
-        if 'import { docScopeSelect } from "./chat-documents.js";' not in content:
-            raise Exception("❌ docScopeSelect import missing")
-        print("✅ docScopeSelect properly imported from chat-documents.js")
-        
-        # Test 11: Check version update in config.py
-        config_path = os.path.join(
-            os.path.dirname(os.path.abspath(__file__)),
-            "..",
-            "application",
-            "single_app",
-            "config.py"
-        )
-        
-        with open(config_path, 'r', encoding='utf-8') as f:
-            config_content = f.read()
-            
-        if 'VERSION = "0.229.042"' not in config_content:
-            raise Exception("❌ Version not updated in config.py")
-        print("✅ Version properly updated to 0.229.042 in config.py")
-        
-        print("✅ All workspace scope prompts functionality tests passed!")
+        missing_template = [snippet for snippet in required_template_snippets if snippet not in template_content]
+        assert not missing_template, f'Missing prompt dropdown template markup: {missing_template}'
+        print('✅ Prompt dropdown template markup implemented')
+
+        assert 'VERSION = "0.239.125"' in config_content, 'Expected config.py version 0.239.125'
+        print('✅ Version properly updated to 0.239.125 in config.py')
+
+        print('✅ Workspace scope prompt implementation checks passed!')
         return True
-        
-    except Exception as e:
-        print(f"❌ Test failed: {e}")
+
+    except Exception as exc:
+        print(f'❌ Test failed: {exc}')
         import traceback
         traceback.print_exc()
         return False
 
-def test_api_endpoints_exist():
-    """Test that required API endpoints exist for public prompts."""
-    print("\n🔍 Testing API Endpoints...")
-    
+
+def test_public_prompt_api_endpoints_exist():
+    """Verify public prompt API endpoints still exist for scope-aware prompt loading."""
+    print('\n🔍 Testing public prompt API endpoints...')
+
     try:
-        # Check if public prompts route file exists
-        route_path = os.path.join(
-            os.path.dirname(os.path.abspath(__file__)),
-            "..",
-            "application",
-            "single_app",
-            "route_backend_public_prompts.py"
-        )
-        
-        if not os.path.exists(route_path):
-            raise Exception("❌ route_backend_public_prompts.py not found")
-        
-        with open(route_path, 'r', encoding='utf-8') as f:
-            route_content = f.read()
-        
-        # Check for required API endpoints
+        route_content = read_file(PUBLIC_PROMPTS_ROUTE_FILE)
+
         required_endpoints = [
             "'/api/public_prompts', methods=['GET']",
             "'/api/public_prompts', methods=['POST']",
-            "'/api/public_prompts/<prompt_id>', methods=['GET']"
+            "'/api/public_prompts/<prompt_id>', methods=['GET']",
         ]
-        
-        for endpoint in required_endpoints:
-            if endpoint not in route_content:
-                raise Exception(f"❌ API endpoint missing: {endpoint}")
-        
-        print("✅ All required API endpoints exist for public prompts")
+
+        missing = [endpoint for endpoint in required_endpoints if endpoint not in route_content]
+        assert not missing, f'Missing public prompt API endpoints: {missing}'
+
+        print('✅ All required API endpoints exist for public prompts')
         return True
-        
-    except Exception as e:
-        print(f"❌ API endpoint test failed: {e}")
+
+    except Exception as exc:
+        print(f'❌ API endpoint test failed: {exc}')
+        import traceback
+        traceback.print_exc()
         return False
 
-if __name__ == "__main__":
-    print("🧪 Running Workspace Scope Prompts Fix Tests...\n")
-    
-    test1_result = test_prompt_scope_filtering_javascript_implementation()
-    test2_result = test_api_endpoints_exist()
-    
-    success = test1_result and test2_result
-    
-    print(f"\n📊 Results: {'2/2' if success else '0/2 or 1/2'} tests passed")
-    
-    if success:
-        print("\n🎉 Workspace scope prompts fix implementation verified!")
-        print("📋 Summary of changes:")
-        print("   • Added publicPrompts variable to chat-global.js")
-        print("   • Implemented loadPublicPrompts() function")
-        print("   • Created populatePromptSelectScope() for scope-aware filtering")
-        print("   • Added loadAllPrompts() to load all prompt types")
-        print("   • Added scope change event listener")
-        print("   • Updated version to 0.229.042")
-        print("\n🔧 How it works:")
-        print("   • When scope is 'All': shows Personal + Group + Public prompts")
-        print("   • When scope is 'Personal': shows only Personal prompts")
-        print("   • When scope is 'Group': shows only Group prompts") 
-        print("   • When scope is 'Public': shows only Public prompts")
-        print("   • Scope changes automatically update prompt list")
-    else:
-        print("\n❌ Some tests failed. Please review the implementation.")
-    
+
+if __name__ == '__main__':
+    print('🧪 Running Workspace Scope Prompts Fix Tests...\n')
+
+    tests = [
+        test_prompt_scope_filtering_and_searchable_picker_implementation,
+        test_public_prompt_api_endpoints_exist,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
     sys.exit(0 if success else 1)
\ No newline at end of file
diff --git a/functional_tests/test_workspace_tabular_trigger_and_thoughts.py b/functional_tests/test_workspace_tabular_trigger_and_thoughts.py
new file mode 100644
index 00000000..8ecdd537
--- /dev/null
+++ b/functional_tests/test_workspace_tabular_trigger_and_thoughts.py
@@ -0,0 +1,201 @@
+#!/usr/bin/env python3
+# test_workspace_tabular_trigger_and_thoughts.py
+"""
+Functional test for workspace-selected tabular trigger and per-tool thoughts fix.
+Version: 0.239.114
+Implemented in: 0.239.035
+
+This test ensures that explicitly selected workspace tabular files still trigger
+SK mini-agent analysis even when retrieval context is sparse, and that
+processing thoughts show individual tabular tool calls instead of only generic
+wrapper messages.
+"""
+
+import ast
+import os
+import sys
+from types import SimpleNamespace
+
+sys.path.append(os.path.dirname(os.path.abspath(__file__)))
+
+ROOT_DIR = os.path.dirname(os.path.dirname(os.path.abspath(__file__)))
+sys.path.append(os.path.join(ROOT_DIR, 'application', 'single_app'))
+ROUTE_FILE = os.path.join(ROOT_DIR, 'application', 'single_app', 'route_backend_chats.py')
+
+
+def read_route_backend_chats():
+    """Read the chat route implementation for structural verification."""
+    with open(ROUTE_FILE, 'r', encoding='utf-8') as file_handle:
+        return file_handle.read()
+
+
+def load_tabular_thought_helpers():
+    """Load selected tabular thought helpers from the route source."""
+    parsed = ast.parse(read_route_backend_chats(), filename=ROUTE_FILE)
+    selected_nodes = []
+
+    for node in parsed.body:
+        if isinstance(node, ast.FunctionDef) and node.name in {
+            'get_tabular_thought_excluded_parameter_names',
+            'get_tabular_invocation_result_payload',
+            'get_tabular_invocation_error_message',
+            'format_tabular_thought_parameter_value',
+            'get_tabular_tool_thought_payloads',
+        }:
+            selected_nodes.append(node)
+
+    module = ast.Module(body=selected_nodes, type_ignores=[])
+    namespace = {'json': __import__('json')}
+    exec(compile(module, ROUTE_FILE, 'exec'), namespace)
+    return namespace
+
+
+def test_workspace_selected_tabular_trigger():
+    """Verify explicitly selected workspace tabular files participate in trigger detection."""
+    print("🔍 Testing workspace-selected tabular trigger detection...")
+
+    try:
+        content = read_route_backend_chats()
+
+        checks = {
+            'selected workspace helper exists': 'def get_selected_workspace_tabular_filenames(' in content,
+            'combined workspace helper exists': 'def collect_workspace_tabular_filenames(' in content,
+            'workspace trigger uses selected ids': 'selected_document_ids=selected_document_ids' in content,
+            'workspace trigger uses selected id': 'selected_document_id=selected_document_id' in content,
+            'workspace-specific fallback prompt': 'IMPORTANT: The selected workspace tabular file(s) are' in content,
+            'workspace trigger gated by document search': 'if hybrid_search_enabled and workspace_tabular_files' in content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected workspace trigger elements: {failed_checks}"
+
+        print("✅ Workspace-selected tabular trigger checks passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_tabular_analysis_thoughts_are_recorded():
+    """Verify processing thoughts now expose individual tabular tool calls."""
+    print("🔍 Testing tabular analysis thoughts instrumentation...")
+
+    try:
+        content = read_route_backend_chats()
+
+        checks = {
+            'tool thought payload helper exists': 'def get_tabular_tool_thought_payloads(' in content,
+            'non-streaming tool thought loop': 'for thought_content, thought_detail in tabular_thought_payloads:' in content,
+            'streaming tool thought loop': "yield emit_thought('tabular_analysis', thought_content, thought_detail)" in content,
+            'generic workspace wrapper thought removed': 'Running tabular analysis on {len(workspace_tabular_files)} workspace file(s)' not in content,
+            'generic completion wrapper thought removed': 'Tabular analysis completed using {len(tabular_sk_citations)} tool call(s)' not in content,
+            'failure thought remains': 'Tabular analysis could not compute results; using schema context instead' in content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected tabular thought instrumentation: {failed_checks}"
+
+        print("✅ Tabular analysis thoughts checks passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_tabular_tool_thought_payload_formatting():
+    """Verify individual tabular tool calls produce readable thought payloads."""
+    print("🔍 Testing tabular tool thought payload formatting...")
+
+    try:
+        helpers = load_tabular_thought_helpers()
+        payload_builder = helpers['get_tabular_tool_thought_payloads']
+
+        invocations = [
+            SimpleNamespace(
+                function_name='group_by_datetime_component',
+                duration_ms=42.8,
+                success=True,
+                parameters={
+                    'user_id': 'test-user',
+                    'conversation_id': 'test-conversation',
+                    'filename': 'faa.csv',
+                    'datetime_component': 'hour',
+                    'operation': 'mean',
+                },
+                error_message=None,
+            )
+        ]
+
+        thought_payloads = payload_builder(invocations)
+        assert len(thought_payloads) == 1, f"Expected one thought payload, got {len(thought_payloads)}"
+
+        thought_content, thought_detail = thought_payloads[0]
+        assert thought_content == 'Tabular tool group_by_datetime_component on faa.csv (42ms)', thought_content
+        assert 'datetime_component=hour' in thought_detail, thought_detail
+        assert 'operation=mean' in thought_detail, thought_detail
+        assert 'user_id=' not in thought_detail, thought_detail
+        assert 'conversation_id=' not in thought_detail, thought_detail
+
+        print("✅ Tabular tool thought payload formatting passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+def test_tabular_sk_prompt_requires_tool_use():
+    """Verify the mini-agent retries when it answers without using tabular tools."""
+    print("🔍 Testing mandatory tabular tool-use prompt hardening...")
+
+    try:
+        content = read_route_backend_chats()
+
+        checks = {
+            'mandatory tool-use prompt': (
+                'You MUST use one or more ' in content
+                and 'tabular_processing plugin functions before answering.' in content
+            ),
+            'retry mode prompt': 'RETRY MODE: Your previous attempt did not execute a usable analytical tool call.' in content,
+            'retry logging': 'returned narrative without tool use; retrying' in content,
+            'required retry mode': 'FunctionChoiceBehavior.Required(' in content,
+            'three-pass retry loop': 'for attempt_number in range(1, 4):' in content,
+        }
+
+        failed_checks = [name for name, passed in checks.items() if not passed]
+        assert not failed_checks, f"Missing expected tabular SK prompt hardening: {failed_checks}"
+
+        print("✅ Tabular SK prompt hardening checks passed")
+        return True
+
+    except Exception as exc:
+        print(f"❌ Test failed: {exc}")
+        import traceback
+        traceback.print_exc()
+        return False
+
+
+if __name__ == '__main__':
+    tests = [
+        test_workspace_selected_tabular_trigger,
+        test_tabular_analysis_thoughts_are_recorded,
+        test_tabular_tool_thought_payload_formatting,
+        test_tabular_sk_prompt_requires_tool_use,
+    ]
+
+    results = []
+    for test in tests:
+        print(f"\n🧪 Running {test.__name__}...")
+        results.append(test())
+
+    success = all(results)
+    print(f"\n📊 Results: {sum(results)}/{len(results)} tests passed")
+    sys.exit(0 if success else 1)
diff --git a/pip.conf.d/.gitkeep b/pip.conf.d/.gitkeep
deleted file mode 100644
index e69de29b..00000000