Spaces:

DataQuests
/

DeepCritical

Running

App Files Files Community

Joseph Pollack commited on 12 days ago

Commit

f5a06d4

1 Parent(s): 85f2fd9

attempts to solve the websearch , adds serper , adds tools , adds adapter , solves settings issue , adds some more stuff basically

Browse files

Files changed (17) hide show

SERPER_WEBSEARCH_IMPLEMENTATION_PLAN.md +396 -0
requirements.txt +10 -18
src/agent_factory/judges.py +16 -43
src/agents/retrieval_agent.py +7 -7
src/app.py +109 -3
src/orchestrator/graph_orchestrator.py +25 -2
src/tools/rate_limiter.py +27 -0
src/tools/searchxng_web_search.py +119 -0
src/tools/serper_web_search.py +119 -0
src/tools/vendored/__init__.py +26 -0
src/tools/vendored/searchxng_client.py +98 -0
src/tools/vendored/serper_client.py +94 -0
src/tools/vendored/web_search_core.py +205 -0
src/tools/web_search.py +29 -11
src/tools/web_search_adapter.py +19 -27
src/tools/web_search_factory.py +73 -0
src/utils/llm_factory.py +10 -22

SERPER_WEBSEARCH_IMPLEMENTATION_PLAN.md ADDED Viewed

	@@ -0,0 +1,396 @@

+# SERPER Web Search Implementation Plan
+## Executive Summary
+This plan details the implementation of SERPER-based web search by vendoring code from `folder/tools/web_search.py` into `src/tools/`, creating a protocol-compliant `SerperWebSearchTool`, fixing the existing `WebSearchTool`, and integrating both into the main search flow.
+## Project Structure
+### Project 1: Vendor and Refactor Core Web Search Components
+**Goal**: Extract and vendor Serper/SearchXNG search logic from `folder/tools/web_search.py` into `src/tools/`
+### Project 2: Create Protocol-Compliant SerperWebSearchTool
+**Goal**: Implement `SerperWebSearchTool` class that fully complies with `SearchTool` protocol
+### Project 3: Fix Existing WebSearchTool Protocol Compliance
+**Goal**: Make existing `WebSearchTool` (DuckDuckGo) protocol-compliant
+### Project 4: Integrate Web Search into SearchHandler
+**Goal**: Add web search tools to main search flow in `src/app.py`
+### Project 5: Update Callers and Dependencies
+**Goal**: Update all code that uses web search to work with new implementation
+### Project 6: Testing and Validation
+**Goal**: Add comprehensive tests for all web search implementations
+---
+## Detailed Implementation Plan
+### PROJECT 1: Vendor and Refactor Core Web Search Components
+#### Activity 1.1: Create Vendor Module Structure
+**File**: `src/tools/vendored/__init__.py`
+- **Task 1.1.1**: Create `src/tools/vendored/` directory
+- **Task 1.1.2**: Create `__init__.py` with exports
+**File**: `src/tools/vendored/web_search_core.py`
+- **Task 1.1.3**: Vendor `ScrapeResult`, `WebpageSnippet`, `SearchResults` models from `folder/tools/web_search.py` (lines 23-37)
+- **Task 1.1.4**: Vendor `scrape_urls()` function (lines 274-299)
+- **Task 1.1.5**: Vendor `fetch_and_process_url()` function (lines 302-348)
+- **Task 1.1.6**: Vendor `html_to_text()` function (lines 351-368)
+- **Task 1.1.7**: Vendor `is_valid_url()` function (lines 371-410)
+- **Task 1.1.8**: Vendor `ssl_context` setup (lines 115-120)
+- **Task 1.1.9**: Add imports: `aiohttp`, `asyncio`, `BeautifulSoup`, `ssl`
+- **Task 1.1.10**: Add `CONTENT_LENGTH_LIMIT = 10000` constant
+- **Task 1.1.11**: Add type hints following project standards
+- **Task 1.1.12**: Add structlog logging
+- **Task 1.1.13**: Replace `print()` statements with `logger` calls
+**File**: `src/tools/vendored/serper_client.py`
+- **Task 1.1.14**: Vendor `SerperClient` class from `folder/tools/web_search.py` (lines 123-196)
+- **Task 1.1.15**: Remove dependency on `ResearchAgent` and `ResearchRunner`
+- **Task 1.1.16**: Replace filter agent with simple relevance filtering or remove it
+- **Task 1.1.17**: Add `__init__` that takes `api_key: str | None` parameter
+- **Task 1.1.18**: Update `search()` method to return `list[WebpageSnippet]` without filtering
+- **Task 1.1.19**: Remove `_filter_results()` method (or make it optional)
+- **Task 1.1.20**: Add error handling with `SearchError` and `RateLimitError`
+- **Task 1.1.21**: Add structlog logging
+- **Task 1.1.22**: Add type hints
+**File**: `src/tools/vendored/searchxng_client.py`
+- **Task 1.1.23**: Vendor `SearchXNGClient` class from `folder/tools/web_search.py` (lines 199-271)
+- **Task 1.1.24**: Remove dependency on `ResearchAgent` and `ResearchRunner`
+- **Task 1.1.25**: Replace filter agent with simple relevance filtering or remove it
+- **Task 1.1.26**: Add `__init__` that takes `host: str` parameter
+- **Task 1.1.27**: Update `search()` method to return `list[WebpageSnippet]` without filtering
+- **Task 1.1.28**: Remove `_filter_results()` method (or make it optional)
+- **Task 1.1.29**: Add error handling with `SearchError` and `RateLimitError`
+- **Task 1.1.30**: Add structlog logging
+- **Task 1.1.31**: Add type hints
+#### Activity 1.2: Create Rate Limiting for Web Search
+**File**: `src/tools/rate_limiter.py`
+- **Task 1.2.1**: Add `get_serper_limiter()` function (rate: "10/second" with API key)
+- **Task 1.2.2**: Add `get_searchxng_limiter()` function (rate: "5/second")
+- **Task 1.2.3**: Use `RateLimiterFactory.get()` pattern
+---
+### PROJECT 2: Create Protocol-Compliant SerperWebSearchTool
+#### Activity 2.1: Implement SerperWebSearchTool Class
+**File**: `src/tools/serper_web_search.py`
+- **Task 2.1.1**: Create new file `src/tools/serper_web_search.py`
+- **Task 2.1.2**: Add imports:
+  - `from src.tools.base import SearchTool`
+  - `from src.tools.vendored.serper_client import SerperClient`
+  - `from src.tools.vendored.web_search_core import scrape_urls, WebpageSnippet`
+  - `from src.tools.rate_limiter import get_serper_limiter`
+  - `from src.tools.query_utils import preprocess_query`
+  - `from src.utils.config import settings`
+  - `from src.utils.exceptions import SearchError, RateLimitError`
+  - `from src.utils.models import Citation, Evidence`
+  - `import structlog`
+  - `from tenacity import retry, stop_after_attempt, wait_exponential`
+- **Task 2.1.3**: Create `SerperWebSearchTool` class
+- **Task 2.1.4**: Add `__init__(self, api_key: str | None = None)` method
+  - Line 2.1.4.1: Get API key from parameter or `settings.serper_api_key`
+  - Line 2.1.4.2: Validate API key is not None, raise `ConfigurationError` if missing
+  - Line 2.1.4.3: Initialize `SerperClient(api_key=self.api_key)`
+  - Line 2.1.4.4: Get rate limiter: `self._limiter = get_serper_limiter(self.api_key)`
+- **Task 2.1.5**: Add `@property def name(self) -> str:` returning `"serper"`
+- **Task 2.1.6**: Add `async def _rate_limit(self) -> None:` method
+  - Line 2.1.6.1: Call `await self._limiter.acquire()`
+- **Task 2.1.7**: Add `@retry(...)` decorator with exponential backoff
+- **Task 2.1.8**: Add `async def search(self, query: str, max_results: int = 10) -> list[Evidence]:` method
+  - Line 2.1.8.1: Call `await self._rate_limit()`
+  - Line 2.1.8.2: Preprocess query: `clean_query = preprocess_query(query)`
+  - Line 2.1.8.3: Use `clean_query if clean_query else query`
+  - Line 2.1.8.4: Call `search_results = await self._client.search(query, filter_for_relevance=False, max_results=max_results)`
+  - Line 2.1.8.5: Call `scraped = await scrape_urls(search_results)`
+  - Line 2.1.8.6: Convert `ScrapeResult` to `Evidence` objects:
+    - Line 2.1.8.6.1: Create `Citation` with `title`, `url`, `source="serper"`, `date="Unknown"`, `authors=[]`
+    - Line 2.1.8.6.2: Create `Evidence` with `content=scraped.text`, `citation`, `relevance=0.0`
+  - Line 2.1.8.7: Return `list[Evidence]`
+  - Line 2.1.8.8: Add try/except for `httpx.HTTPStatusError`:
+    - Line 2.1.8.8.1: Check for 429 status, raise `RateLimitError`
+    - Line 2.1.8.8.2: Otherwise raise `SearchError`
+  - Line 2.1.8.9: Add try/except for `httpx.TimeoutException`, raise `SearchError`
+  - Line 2.1.8.10: Add generic exception handler, log and raise `SearchError`
+#### Activity 2.2: Implement SearchXNGWebSearchTool Class
+**File**: `src/tools/searchxng_web_search.py`
+- **Task 2.2.1**: Create new file `src/tools/searchxng_web_search.py`
+- **Task 2.2.2**: Add imports (similar to SerperWebSearchTool)
+- **Task 2.2.3**: Create `SearchXNGWebSearchTool` class
+- **Task 2.2.4**: Add `__init__(self, host: str | None = None)` method
+  - Line 2.2.4.1: Get host from parameter or `settings.searchxng_host`
+  - Line 2.2.4.2: Validate host is not None, raise `ConfigurationError` if missing
+  - Line 2.2.4.3: Initialize `SearchXNGClient(host=self.host)`
+  - Line 2.2.4.4: Get rate limiter: `self._limiter = get_searchxng_limiter()`
+- **Task 2.2.5**: Add `@property def name(self) -> str:` returning `"searchxng"`
+- **Task 2.2.6**: Add `async def _rate_limit(self) -> None:` method
+- **Task 2.2.7**: Add `@retry(...)` decorator
+- **Task 2.2.8**: Add `async def search(self, query: str, max_results: int = 10) -> list[Evidence]:` method
+  - Line 2.2.8.1-2.2.8.10: Similar structure to SerperWebSearchTool
+---
+### PROJECT 3: Fix Existing WebSearchTool Protocol Compliance
+#### Activity 3.1: Update WebSearchTool Class
+**File**: `src/tools/web_search.py`
+- **Task 3.1.1**: Add `@property def name(self) -> str:` method returning `"duckduckgo"` (after line 17)
+- **Task 3.1.2**: Change `search()` return type from `SearchResult` to `list[Evidence]` (line 19)
+- **Task 3.1.3**: Update `search()` method body:
+  - Line 3.1.3.1: Keep existing search logic (lines 21-43)
+  - Line 3.1.3.2: Instead of returning `SearchResult`, return `evidence` list directly (line 44)
+  - Line 3.1.3.3: Update exception handler to return empty list `[]` instead of `SearchResult` (line 51)
+- **Task 3.1.4**: Add imports if needed:
+  - Line 3.1.4.1: `from src.utils.exceptions import SearchError`
+  - Line 3.1.4.2: Update exception handling to raise `SearchError` instead of returning error `SearchResult`
+- **Task 3.1.5**: Add query preprocessing:
+  - Line 3.1.5.1: Import `from src.tools.query_utils import preprocess_query`
+  - Line 3.1.5.2: Add `clean_query = preprocess_query(query)` before search
+  - Line 3.1.5.3: Use `clean_query if clean_query else query`
+#### Activity 3.2: Update Retrieval Agent Caller
+**File**: `src/agents/retrieval_agent.py`
+- **Task 3.2.1**: Update `search_web()` function (line 31):
+  - Line 3.2.1.1: Change `results = await _web_search.search(query, max_results)`
+  - Line 3.2.1.2: Change to `evidence = await _web_search.search(query, max_results)`
+  - Line 3.2.1.3: Update check: `if not evidence:` instead of `if not results.evidence:`
+  - Line 3.2.1.4: Update state update: `new_count = state.add_evidence(evidence)` instead of `results.evidence`
+  - Line 3.2.1.5: Update logging: `results_found=len(evidence)` instead of `len(results.evidence)`
+  - Line 3.2.1.6: Update output formatting: `for i, r in enumerate(evidence[:max_results], 1):` instead of `results.evidence[:max_results]`
+  - Line 3.2.1.7: Update deduplication: `await state.embedding_service.deduplicate(evidence)` instead of `results.evidence`
+  - Line 3.2.1.8: Update output message: `Found {len(evidence)} web results` instead of `len(results.evidence)`
+---
+### PROJECT 4: Integrate Web Search into SearchHandler
+#### Activity 4.1: Create Web Search Tool Factory
+**File**: `src/tools/web_search_factory.py`
+- **Task 4.1.1**: Create new file `src/tools/web_search_factory.py`
+- **Task 4.1.2**: Add imports:
+  - `from src.tools.web_search import WebSearchTool`
+  - `from src.tools.serper_web_search import SerperWebSearchTool`
+  - `from src.tools.searchxng_web_search import SearchXNGWebSearchTool`
+  - `from src.utils.config import settings`
+  - `from src.utils.exceptions import ConfigurationError`
+  - `import structlog`
+- **Task 4.1.3**: Add `logger = structlog.get_logger()`
+- **Task 4.1.4**: Create `def create_web_search_tool() -> SearchTool | None:` function
+  - Line 4.1.4.1: Check `settings.web_search_provider`
+  - Line 4.1.4.2: If `"serper"`:
+    - Line 4.1.4.2.1: Check `settings.serper_api_key` or `settings.web_search_available()`
+    - Line 4.1.4.2.2: If available, return `SerperWebSearchTool()`
+    - Line 4.1.4.2.3: Else log warning and return `None`
+  - Line 4.1.4.3: If `"searchxng"`:
+    - Line 4.1.4.3.1: Check `settings.searchxng_host` or `settings.web_search_available()`
+    - Line 4.1.4.3.2: If available, return `SearchXNGWebSearchTool()`
+    - Line 4.1.4.3.3: Else log warning and return `None`
+  - Line 4.1.4.4: If `"duckduckgo"`:
+    - Line 4.1.4.4.1: Return `WebSearchTool()` (always available)
+  - Line 4.1.4.5: If `"brave"` or `"tavily"`:
+    - Line 4.1.4.5.1: Log warning "Not yet implemented"
+    - Line 4.1.4.5.2: Return `None`
+  - Line 4.1.4.6: Default: return `WebSearchTool()` (fallback to DuckDuckGo)
+#### Activity 4.2: Update SearchHandler Initialization
+**File**: `src/app.py`
+- **Task 4.2.1**: Add import: `from src.tools.web_search_factory import create_web_search_tool`
+- **Task 4.2.2**: Update `configure_orchestrator()` function (around line 73):
+  - Line 4.2.2.1: Before creating `SearchHandler`, call `web_search_tool = create_web_search_tool()`
+  - Line 4.2.2.2: Create tools list: `tools = [PubMedTool(), ClinicalTrialsTool(), EuropePMCTool()]`
+  - Line 4.2.2.3: If `web_search_tool is not None`:
+    - Line 4.2.2.3.1: Append `web_search_tool` to tools list
+    - Line 4.2.2.3.2: Log info: "Web search tool added to search handler"
+  - Line 4.2.2.4: Update `SearchHandler` initialization to use `tools` list
+---
+### PROJECT 5: Update Callers and Dependencies
+#### Activity 5.1: Update web_search_adapter
+**File**: `src/tools/web_search_adapter.py`
+- **Task 5.1.1**: Update `web_search()` function to use new implementation:
+  - Line 5.1.1.1: Import `from src.tools.web_search_factory import create_web_search_tool`
+  - Line 5.1.1.2: Remove dependency on `folder.tools.web_search`
+  - Line 5.1.1.3: Get tool: `tool = create_web_search_tool()`
+  - Line 5.1.1.4: If `tool is None`, return error message
+  - Line 5.1.1.5: Call `evidence = await tool.search(query, max_results=5)`
+  - Line 5.1.1.6: Convert `Evidence` objects to formatted string:
+    - Line 5.1.1.6.1: Format each evidence with title, URL, content preview
+  - Line 5.1.1.7: Return formatted string
+#### Activity 5.2: Update Tool Executor
+**File**: `src/tools/tool_executor.py`
+- **Task 5.2.1**: Verify `web_search_adapter.web_search()` usage (line 86) still works
+- **Task 5.2.2**: No changes needed if adapter is updated correctly
+#### Activity 5.3: Update Planner Agent
+**File**: `src/orchestrator/planner_agent.py`
+- **Task 5.3.1**: Verify `web_search_adapter.web_search()` usage (line 14) still works
+- **Task 5.3.2**: No changes needed if adapter is updated correctly
+#### Activity 5.4: Remove Legacy Dependencies
+**File**: `src/tools/web_search_adapter.py`
+- **Task 5.4.1**: Remove import of `folder.llm_config` and `folder.tools.web_search`
+- **Task 5.4.2**: Update error messages to reflect new implementation
+---
+### PROJECT 6: Testing and Validation
+#### Activity 6.1: Unit Tests for Vendored Components
+**File**: `tests/unit/tools/test_vendored_web_search_core.py`
+- **Task 6.1.1**: Test `scrape_urls()` function
+- **Task 6.1.2**: Test `fetch_and_process_url()` function
+- **Task 6.1.3**: Test `html_to_text()` function
+- **Task 6.1.4**: Test `is_valid_url()` function
+**File**: `tests/unit/tools/test_vendored_serper_client.py`
+- **Task 6.1.5**: Mock SerperClient API calls
+- **Task 6.1.6**: Test successful search
+- **Task 6.1.7**: Test error handling
+- **Task 6.1.8**: Test rate limiting
+**File**: `tests/unit/tools/test_vendored_searchxng_client.py`
+- **Task 6.1.9**: Mock SearchXNGClient API calls
+- **Task 6.1.10**: Test successful search
+- **Task 6.1.11**: Test error handling
+- **Task 6.1.12**: Test rate limiting
+#### Activity 6.2: Unit Tests for Web Search Tools
+**File**: `tests/unit/tools/test_serper_web_search.py`
+- **Task 6.2.1**: Test `SerperWebSearchTool.__init__()` with valid API key
+- **Task 6.2.2**: Test `SerperWebSearchTool.__init__()` without API key (should raise)
+- **Task 6.2.3**: Test `name` property returns `"serper"`
+- **Task 6.2.4**: Test `search()` returns `list[Evidence]`
+- **Task 6.2.5**: Test `search()` with mocked SerperClient
+- **Task 6.2.6**: Test error handling (SearchError, RateLimitError)
+- **Task 6.2.7**: Test query preprocessing
+- **Task 6.2.8**: Test rate limiting
+**File**: `tests/unit/tools/test_searchxng_web_search.py`
+- **Task 6.2.9**: Similar tests for SearchXNGWebSearchTool
+**File**: `tests/unit/tools/test_web_search.py`
+- **Task 6.2.10**: Test `WebSearchTool.name` property returns `"duckduckgo"`
+- **Task 6.2.11**: Test `WebSearchTool.search()` returns `list[Evidence]`
+- **Task 6.2.12**: Test `WebSearchTool.search()` with mocked DDGS
+- **Task 6.2.13**: Test error handling
+- **Task 6.2.14**: Test query preprocessing
+#### Activity 6.3: Integration Tests
+**File**: `tests/integration/test_web_search_integration.py`
+- **Task 6.3.1**: Test `SerperWebSearchTool` with real API (marked `@pytest.mark.integration`)
+- **Task 6.3.2**: Test `SearchXNGWebSearchTool` with real API (marked `@pytest.mark.integration`)
+- **Task 6.3.3**: Test `WebSearchTool` with real DuckDuckGo (marked `@pytest.mark.integration`)
+- **Task 6.3.4**: Test `create_web_search_tool()` factory function
+- **Task 6.3.5**: Test SearchHandler with web search tool
+#### Activity 6.4: Update Existing Tests
+**File**: `tests/unit/agents/test_retrieval_agent.py`
+- **Task 6.4.1**: Update tests to expect `list[Evidence]` instead of `SearchResult`
+- **Task 6.4.2**: Mock `WebSearchTool.search()` to return `list[Evidence]`
+**File**: `tests/unit/tools/test_tool_executor.py`
+- **Task 6.4.3**: Verify tests still pass with updated `web_search_adapter`
+---
+## Implementation Order
+1. **PROJECT 1**: Vendor core components (foundation)
+2. **PROJECT 3**: Fix existing WebSearchTool (quick win, unblocks retrieval agent)
+3. **PROJECT 2**: Create SerperWebSearchTool (new functionality)
+4. **PROJECT 4**: Integrate into SearchHandler (main integration)
+5. **PROJECT 5**: Update callers (cleanup dependencies)
+6. **PROJECT 6**: Testing (validation)
+---
+## Dependencies and Prerequisites
+### External Dependencies
+- `aiohttp` - Already in requirements
+- `beautifulsoup4` - Already in requirements
+- `duckduckgo-search` - Already in requirements
+- `tenacity` - Already in requirements
+- `structlog` - Already in requirements
+### Internal Dependencies
+- `src/tools/base.py` - SearchTool protocol
+- `src/tools/rate_limiter.py` - Rate limiting utilities
+- `src/tools/query_utils.py` - Query preprocessing
+- `src/utils/config.py` - Settings and configuration
+- `src/utils/exceptions.py` - Custom exceptions
+- `src/utils/models.py` - Evidence, Citation models
+### Configuration Requirements
+- `SERPER_API_KEY` - For Serper provider
+- `SEARCHXNG_HOST` - For SearchXNG provider
+- `WEB_SEARCH_PROVIDER` - Environment variable (default: "duckduckgo")
+---
+## Risk Assessment
+### High Risk
+- **Breaking changes to retrieval_agent.py**: Must update carefully to handle `list[Evidence]` instead of `SearchResult`
+- **Legacy folder dependencies**: Need to ensure all code is properly vendored
+### Medium Risk
+- **Rate limiting**: Serper API may have different limits than expected
+- **Error handling**: Need to handle API failures gracefully
+### Low Risk
+- **Query preprocessing**: May need adjustment for web search vs PubMed
+- **Testing**: Integration tests require API keys
+---
+## Success Criteria
+1. ✅ `SerperWebSearchTool` implements `SearchTool` protocol correctly
+2. ✅ `WebSearchTool` implements `SearchTool` protocol correctly
+3. ✅ Both tools can be added to `SearchHandler`
+4. ✅ `web_search_adapter` works with new implementation
+5. ✅ `retrieval_agent` works with updated `WebSearchTool`
+6. ✅ All unit tests pass
+7. ✅ Integration tests pass (with API keys)
+8. ✅ No dependencies on `folder/tools/web_search.py` in `src/` code
+9. ✅ Configuration supports multiple providers
+10. ✅ Error handling is robust
+---
+## Notes
+- The vendored code should be self-contained and not depend on `folder/` modules
+- Filter agent functionality from original code is removed (can be added later if needed)
+- Rate limiting follows the same pattern as PubMed tool
+- Query preprocessing may need web-specific adjustments (less aggressive than PubMed)
+- Consider adding relevance scoring in the future

requirements.txt CHANGED Viewed

@@ -35,9 +35,6 @@ pydantic-graph>=1.22.0
 # Web search
 duckduckgo-search>=5.0
-# Multi-agent orchestration (Advanced mode)
-agent-framework-core>=1.0.0b251120,<2.0.0
 # LlamaIndex RAG
 llama-index-llms-huggingface>=0.6.1
 llama-index-llms-huggingface-api>=0.6.1
@@ -51,28 +48,23 @@ pillow>=10.0.0  # For image processing
 # TTS dependencies (for Modal GPU TTS)
 torch>=2.0.0  # Required by Kokoro TTS
-transformers>=4.30.0  # Required by Kokoro TTS
 modal>=0.63.0  # Required for TTS GPU execution
 # Note: Kokoro is installed in Modal image from: git+https://github.com/hexgrad/kokoro.git
-# Multi-agent orchestration (Advanced mode) - from optional magentic
-agent-framework-core>=1.0.0b251120,<2.0.0
-llama-index-llms-openai>=0.6.9
-llama-index-embeddings-openai>=0.5.1
 # Embeddings & Vector Store
 tokenizers>=0.22.0,<=0.23.0
-transformers>=4.57.2
-chromadb>=0.4.0
 rpds-py>=0.29.0  # Python implementation of rpds (required by chromadb on Windows)
 sentence-transformers>=2.2.0
-numpy<2.0
-# Optional: Modal for code execution
-modal>=0.63.0
-# LlamaIndex RAG - from optional modal
-llama-index-llms-openai
-llama-index-embeddings-openai
-pydantic-ai-slim[huggingface]>=0.0.18

 # Web search
 duckduckgo-search>=5.0
 # LlamaIndex RAG
 llama-index-llms-huggingface>=0.6.1
 llama-index-llms-huggingface-api>=0.6.1
 # TTS dependencies (for Modal GPU TTS)
 torch>=2.0.0  # Required by Kokoro TTS
+transformers>=4.57.2  # Required by Kokoro TTS
 modal>=0.63.0  # Required for TTS GPU execution
 # Note: Kokoro is installed in Modal image from: git+https://github.com/hexgrad/kokoro.git
 # Embeddings & Vector Store
 tokenizers>=0.22.0,<=0.23.0
 rpds-py>=0.29.0  # Python implementation of rpds (required by chromadb on Windows)
+chromadb>=0.4.0
 sentence-transformers>=2.2.0
+numpy<2.0  # chromadb compatibility: uses np.float_ removed in NumPy 2.0
+# Pydantic AI with HuggingFace support
+pydantic-ai-slim[huggingface]>=0.0.18
+# Multi-agent orchestration (Advanced mode)
+agent-framework-core>=1.0.0b251120,<2.0.0
+# LlamaIndex RAG - OpenAI
+llama-index-llms-openai>=0.6.9
+llama-index-embeddings-openai>=0.5.1

src/agent_factory/judges.py CHANGED Viewed

@@ -38,54 +38,27 @@ def get_model(oauth_token: str | None = None) -> Any:
     Args:
         oauth_token: Optional OAuth token from HuggingFace login (takes priority over env vars)
     """
-    # Priority: oauth_token > env vars
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
-    # If OAuth token is available, prefer HuggingFace (free tier on Spaces)
-    if effective_hf_token:
-        model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-        hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
-        logger.info(
-            "using_huggingface_with_token",
-            has_oauth=bool(oauth_token),
-            model=model_name,
         )
-        return HuggingFaceModel(model_name, provider=hf_provider)
-    llm_provider = settings.llm_provider
-    if llm_provider == "anthropic":
-        if not settings.anthropic_api_key:
-            logger.warning("Anthropic provider selected but no API key available, defaulting to HuggingFace")
-            # Fallback to HuggingFace without token (public models)
-            model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-            hf_provider = HuggingFaceProvider(api_key=None)
-            return HuggingFaceModel(model_name, provider=hf_provider)
-        provider = AnthropicProvider(api_key=settings.anthropic_api_key)
-        return AnthropicModel(settings.anthropic_model, provider=provider)
-    if llm_provider == "huggingface":
-        # No token available, use public models
-        model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-        hf_provider = HuggingFaceProvider(api_key=None)
-        return HuggingFaceModel(model_name, provider=hf_provider)
-    if llm_provider == "openai":
-        if not settings.openai_api_key:
-            logger.warning("OpenAI provider selected but no API key available, defaulting to HuggingFace")
-            # Fallback to HuggingFace without token (public models)
-            model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-            hf_provider = HuggingFaceProvider(api_key=None)
-            return HuggingFaceModel(model_name, provider=hf_provider)
-        openai_provider = OpenAIProvider(api_key=settings.openai_api_key)
-        return OpenAIModel(settings.openai_model, provider=openai_provider)
-    # Default to HuggingFace if provider is unknown or not specified
-    if llm_provider not in ("huggingface", "openai", "anthropic"):
-        logger.warning("Unknown LLM provider, defaulting to HuggingFace", provider=llm_provider)
     model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-    hf_provider = HuggingFaceProvider(api_key=None)  # Public models
     return HuggingFaceModel(model_name, provider=hf_provider)

     Args:
         oauth_token: Optional OAuth token from HuggingFace login (takes priority over env vars)
     """
+    # Priority: oauth_token > settings.hf_token > settings.huggingface_api_key
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
+    # HuggingFaceProvider requires a token - cannot use None
+    if not effective_hf_token:
+        raise ConfigurationError(
+            "HuggingFace token required. Please either:\n"
+            "1. Log in via HuggingFace OAuth (recommended for Spaces)\n"
+            "2. Set HF_TOKEN environment variable\n"
+            "3. Set huggingface_api_key in settings"
         )
+    # Always use HuggingFace with available token
     model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
+    hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
+    logger.info(
+        "using_huggingface_with_token",
+        has_oauth=bool(oauth_token),
+        has_settings_token=bool(settings.hf_token or settings.huggingface_api_key),
+        model=model_name,
+    )
     return HuggingFaceModel(model_name, provider=hf_provider)

src/agents/retrieval_agent.py CHANGED Viewed

@@ -28,28 +28,28 @@ async def search_web(query: str, max_results: int = 10) -> str:
     logger.info("Web search starting", query=query, max_results=max_results)
     state = get_magentic_state()
-    results = await _web_search.search(query, max_results)
-    if not results.evidence:
         logger.info("Web search returned no results", query=query)
         return f"No web results found for: {query}"
     # Update state
     # We add *all* found results to state
-    new_count = state.add_evidence(results.evidence)
     logger.info(
         "Web search complete",
         query=query,
-        results_found=len(results.evidence),
         new_evidence=new_count,
     )
     # Use embedding service for deduplication/indexing if available
     if state.embedding_service:
         # This method also adds to vector DB as a side effect for unique items
-        await state.embedding_service.deduplicate(results.evidence)
-    output = [f"Found {len(results.evidence)} web results ({new_count} new stored):\n"]
-    for i, r in enumerate(results.evidence[:max_results], 1):
         output.append(f"{i}. **{r.citation.title}**")
         output.append(f"   Source: {r.citation.url}")
         output.append(f"   {r.content[:300]}...\n")

     logger.info("Web search starting", query=query, max_results=max_results)
     state = get_magentic_state()
+    evidence = await _web_search.search(query, max_results)
+    if not evidence:
         logger.info("Web search returned no results", query=query)
         return f"No web results found for: {query}"
     # Update state
     # We add *all* found results to state
+    new_count = state.add_evidence(evidence)
     logger.info(
         "Web search complete",
         query=query,
+        results_found=len(evidence),
         new_evidence=new_count,
     )
     # Use embedding service for deduplication/indexing if available
     if state.embedding_service:
         # This method also adds to vector DB as a side effect for unique items
+        await state.embedding_service.deduplicate(evidence)
+    output = [f"Found {len(evidence)} web results ({new_count} new stored):\n"]
+    for i, r in enumerate(evidence[:max_results], 1):
         output.append(f"{i}. **{r.citation.title}**")
         output.append(f"   Source: {r.citation.url}")
         output.append(f"   {r.content[:300]}...\n")

src/app.py CHANGED Viewed

@@ -30,6 +30,7 @@ from src.agent_factory.judges import HFInferenceJudgeHandler, JudgeHandler, Mock
 from src.orchestrator_factory import create_orchestrator
 from src.services.audio_processing import get_audio_service
 from src.services.multimodal_processing import get_multimodal_service
 from src.tools.clinicaltrials import ClinicalTrialsTool
 from src.tools.europepmc import EuropePMCTool
 from src.tools.pubmed import PubMedTool
@@ -37,6 +38,8 @@ from src.tools.search_handler import SearchHandler
 from src.utils.config import settings
 from src.utils.models import AgentEvent, OrchestratorConfig
 def configure_orchestrator(
     use_mock: bool = False,
@@ -70,8 +73,18 @@ def configure_orchestrator(
     # Create search tools with RAG enabled
     # Pass OAuth token to SearchHandler so it can be used by RAG service
     search_handler = SearchHandler(
-        tools=[PubMedTool(), ClinicalTrialsTool(), EuropePMCTool()],
         timeout=config.search_timeout,
         include_rag=True,
         auto_ingest_to_rag=True,
@@ -150,6 +163,49 @@ def configure_orchestrator(
     return orchestrator, backend_info
 def event_to_chat_message(event: AgentEvent) -> dict[str, Any]:
     """
     Convert AgentEvent to gr.ChatMessage with metadata for accordion display.
@@ -183,11 +239,61 @@ def event_to_chat_message(event: AgentEvent) -> dict[str, Any]:
     # For complete events, return main response without accordion
     if event.type == "complete":
         # Return as dict format for Gradio Chatbot compatibility
-        return {
             "role": "assistant",
-            "content": event.message,
         }
     # Build metadata for accordion according to Gradio ChatMessage spec
     # Metadata keys: title (str), status ("pending"|"done"), log (str), duration (float)

 from src.orchestrator_factory import create_orchestrator
 from src.services.audio_processing import get_audio_service
 from src.services.multimodal_processing import get_multimodal_service
+# import structlog
 from src.tools.clinicaltrials import ClinicalTrialsTool
 from src.tools.europepmc import EuropePMCTool
 from src.tools.pubmed import PubMedTool
 from src.utils.config import settings
 from src.utils.models import AgentEvent, OrchestratorConfig
+# logger = structlog.get_logger()
 def configure_orchestrator(
     use_mock: bool = False,
     # Create search tools with RAG enabled
     # Pass OAuth token to SearchHandler so it can be used by RAG service
+    tools = [PubMedTool(), ClinicalTrialsTool(), EuropePMCTool()]
+    # Add web search tool if available
+    from src.tools.web_search_factory import create_web_search_tool
+    web_search_tool = create_web_search_tool()
+    if web_search_tool is not None:
+        tools.append(web_search_tool)
+        logger.info("Web search tool added to search handler", provider=web_search_tool.name)
     search_handler = SearchHandler(
+        tools=tools,
         timeout=config.search_timeout,
         include_rag=True,
         auto_ingest_to_rag=True,
     return orchestrator, backend_info
+def _is_file_path(text: str) -> bool:
+    """Check if text appears to be a file path.
+    Args:
+        text: Text to check
+    Returns:
+        True if text looks like a file path
+    """
+    import os
+    # Check for common file extensions
+    file_extensions = ['.md', '.pdf', '.txt', '.json', '.csv', '.xlsx', '.docx', '.html']
+    text_lower = text.lower().strip()
+    # Check if it ends with a file extension
+    if any(text_lower.endswith(ext) for ext in file_extensions):
+        # Check if it's a valid path (absolute or relative)
+        if os.path.sep in text or '/' in text or '\\' in text:
+            return True
+        # Or if it's just a filename with extension
+        if '.' in text and len(text.split('.')) == 2:
+            return True
+    # Check if it's an absolute path
+    if os.path.isabs(text):
+        return True
+    return False
+def _get_file_name(file_path: str) -> str:
+    """Extract filename from file path.
+    Args:
+        file_path: Full file path
+    Returns:
+        Filename with extension
+    """
+    import os
+    return os.path.basename(file_path)
 def event_to_chat_message(event: AgentEvent) -> dict[str, Any]:
     """
     Convert AgentEvent to gr.ChatMessage with metadata for accordion display.
     # For complete events, return main response without accordion
     if event.type == "complete":
+        # Check if event contains file information
+        content = event.message
+        files: list[str] | None = None
+        # Check event.data for file paths
+        if event.data and isinstance(event.data, dict):
+            # Support both "files" (list) and "file" (single path) keys
+            if "files" in event.data:
+                files = event.data["files"]
+                if isinstance(files, str):
+                    files = [files]
+                elif not isinstance(files, list):
+                    files = None
+                else:
+                    # Filter to only valid file paths
+                    files = [f for f in files if isinstance(f, str) and _is_file_path(f)]
+            elif "file" in event.data:
+                file_path = event.data["file"]
+                if isinstance(file_path, str) and _is_file_path(file_path):
+                    files = [file_path]
+        # Also check if message itself is a file path (less common, but possible)
+        if not files and isinstance(event.message, str) and _is_file_path(event.message):
+            files = [event.message]
+            # Keep message as text description
+            content = "Report generated. Download available below."
         # Return as dict format for Gradio Chatbot compatibility
+        result: dict[str, Any] = {
             "role": "assistant",
+            "content": content,
         }
+        # Add files if present
+        # Gradio Chatbot supports file paths in content as markdown links
+        # The links will be clickable and downloadable
+        if files:
+            # Validate files exist before including them
+            import os
+            valid_files = [f for f in files if os.path.exists(f)]
+            if valid_files:
+                # Format files for Gradio: include as markdown download links
+                file_links = "\n\n".join([
+                    f"📎 [Download: {_get_file_name(f)}]({f})"
+                    for f in valid_files
+                ])
+                result["content"] = f"{content}\n\n{file_links}"
+                # Also store in metadata for potential future use
+                if "metadata" not in result:
+                    result["metadata"] = {}
+                result["metadata"]["files"] = valid_files
+        return result
     # Build metadata for accordion according to Gradio ChatMessage spec
     # Metadata keys: title (str), status ("pending"|"done"), log (str), duration (float)

src/orchestrator/graph_orchestrator.py CHANGED Viewed

@@ -533,10 +533,33 @@ class GraphOrchestrator:
         # Final event
         final_result = context.get_node_result(current_node_id) if current_node_id else None
         yield AgentEvent(
             type="complete",
-            message=final_result if isinstance(final_result, str) else "Research completed",
-            data={"mode": self.mode, "iterations": iteration},
             iteration=iteration,
         )

         # Final event
         final_result = context.get_node_result(current_node_id) if current_node_id else None
+        # Check if final result contains file information
+        event_data: dict[str, Any] = {"mode": self.mode, "iterations": iteration}
+        message: str = "Research completed"
+        if isinstance(final_result, str):
+            message = final_result
+        elif isinstance(final_result, dict):
+            # If result is a dict, check for file paths
+            if "file" in final_result:
+                file_path = final_result["file"]
+                if isinstance(file_path, str):
+                    event_data["file"] = file_path
+                    message = final_result.get("message", "Report generated. Download available.")
+            elif "files" in final_result:
+                files = final_result["files"]
+                if isinstance(files, list):
+                    event_data["files"] = files
+                    message = final_result.get("message", "Report generated. Downloads available.")
+                elif isinstance(files, str):
+                    event_data["files"] = [files]
+                    message = final_result.get("message", "Report generated. Download available.")
         yield AgentEvent(
             type="complete",
+            message=message,
+            data=event_data,
             iteration=iteration,
         )

src/tools/rate_limiter.py CHANGED Viewed

@@ -93,6 +93,33 @@ def reset_pubmed_limiter() -> None:
     _pubmed_limiter = None
 # Factory for other APIs
 class RateLimiterFactory:
     """Factory for creating/getting rate limiters for different APIs."""

     _pubmed_limiter = None
+def get_serper_limiter(api_key: str | None = None) -> RateLimiter:
+    """
+    Get the shared Serper API rate limiter.
+    Rate: 10 requests/second (Serper API limit)
+    Args:
+        api_key: Serper API key (optional, for consistency with other limiters)
+    Returns:
+        Shared RateLimiter instance
+    """
+    return RateLimiterFactory.get("serper", "10/second")
+def get_searchxng_limiter() -> RateLimiter:
+    """
+    Get the shared SearchXNG API rate limiter.
+    Rate: 5 requests/second (conservative limit)
+    Returns:
+        Shared RateLimiter instance
+    """
+    return RateLimiterFactory.get("searchxng", "5/second")
 # Factory for other APIs
 class RateLimiterFactory:
     """Factory for creating/getting rate limiters for different APIs."""

src/tools/searchxng_web_search.py ADDED Viewed

	@@ -0,0 +1,119 @@

+"""SearchXNG web search tool using SearchXNG API for Google searches."""
+from typing import Any
+import structlog
+from tenacity import retry, stop_after_attempt, wait_exponential
+from src.tools.base import SearchTool
+from src.tools.query_utils import preprocess_query
+from src.tools.rate_limiter import get_searchxng_limiter
+from src.tools.vendored.searchxng_client import SearchXNGClient
+from src.tools.vendored.web_search_core import scrape_urls
+from src.utils.config import settings
+from src.utils.exceptions import ConfigurationError, RateLimitError, SearchError
+from src.utils.models import Citation, Evidence
+logger = structlog.get_logger()
+class SearchXNGWebSearchTool:
+    """Tool for searching the web using SearchXNG API (Google search)."""
+    def __init__(self, host: str | None = None) -> None:
+        """Initialize SearchXNG web search tool.
+        Args:
+            host: SearchXNG host URL. If None, reads from settings.
+        Raises:
+            ConfigurationError: If no host is available.
+        """
+        self.host = host or settings.searchxng_host
+        if not self.host:
+            raise ConfigurationError(
+                "SearchXNG host required. Set SEARCHXNG_HOST environment variable or searchxng_host in settings."
+            )
+        self._client = SearchXNGClient(host=self.host)
+        self._limiter = get_searchxng_limiter()
+    @property
+    def name(self) -> str:
+        """Return the name of this search tool."""
+        return "searchxng"
+    async def _rate_limit(self) -> None:
+        """Enforce SearchXNG API rate limiting."""
+        await self._limiter.acquire()
+    @retry(
+        stop=stop_after_attempt(3),
+        wait=wait_exponential(multiplier=1, min=1, max=10),
+        reraise=True,
+    )
+    async def search(self, query: str, max_results: int = 10) -> list[Evidence]:
+        """Execute a web search using SearchXNG API.
+        Args:
+            query: The search query string
+            max_results: Maximum number of results to return
+        Returns:
+            List of Evidence objects
+        Raises:
+            SearchError: If the search fails
+            RateLimitError: If rate limit is exceeded
+        """
+        await self._rate_limit()
+        # Preprocess query to remove noise
+        clean_query = preprocess_query(query)
+        final_query = clean_query if clean_query else query
+        try:
+            # Get search results (snippets)
+            search_results = await self._client.search(
+                final_query, filter_for_relevance=False, max_results=max_results
+            )
+            if not search_results:
+                logger.info("No search results found", query=final_query)
+                return []
+            # Scrape URLs to get full content
+            scraped = await scrape_urls(search_results)
+            # Convert ScrapeResult to Evidence objects
+            evidence = []
+            for result in scraped:
+                ev = Evidence(
+                    content=result.text,
+                    citation=Citation(
+                        title=result.title,
+                        url=result.url,
+                        source="searchxng",
+                        date="Unknown",
+                        authors=[],
+                    ),
+                    relevance=0.0,
+                )
+                evidence.append(ev)
+            logger.info(
+                "SearchXNG search complete",
+                query=final_query,
+                results_found=len(evidence),
+            )
+            return evidence
+        except RateLimitError:
+            raise
+        except SearchError:
+            raise
+        except Exception as e:
+            logger.error("Unexpected error in SearchXNG search", error=str(e), query=final_query)
+            raise SearchError(f"SearchXNG search failed: {e}") from e

src/tools/serper_web_search.py ADDED Viewed

	@@ -0,0 +1,119 @@

+"""Serper web search tool using Serper API for Google searches."""
+from typing import Any
+import structlog
+from tenacity import retry, stop_after_attempt, wait_exponential
+from src.tools.base import SearchTool
+from src.tools.query_utils import preprocess_query
+from src.tools.rate_limiter import get_serper_limiter
+from src.tools.vendored.serper_client import SerperClient
+from src.tools.vendored.web_search_core import scrape_urls
+from src.utils.config import settings
+from src.utils.exceptions import ConfigurationError, RateLimitError, SearchError
+from src.utils.models import Citation, Evidence
+logger = structlog.get_logger()
+class SerperWebSearchTool:
+    """Tool for searching the web using Serper API (Google search)."""
+    def __init__(self, api_key: str | None = None) -> None:
+        """Initialize Serper web search tool.
+        Args:
+            api_key: Serper API key. If None, reads from settings.
+        Raises:
+            ConfigurationError: If no API key is available.
+        """
+        self.api_key = api_key or settings.serper_api_key
+        if not self.api_key:
+            raise ConfigurationError(
+                "Serper API key required. Set SERPER_API_KEY environment variable or serper_api_key in settings."
+            )
+        self._client = SerperClient(api_key=self.api_key)
+        self._limiter = get_serper_limiter(self.api_key)
+    @property
+    def name(self) -> str:
+        """Return the name of this search tool."""
+        return "serper"
+    async def _rate_limit(self) -> None:
+        """Enforce Serper API rate limiting."""
+        await self._limiter.acquire()
+    @retry(
+        stop=stop_after_attempt(3),
+        wait=wait_exponential(multiplier=1, min=1, max=10),
+        reraise=True,
+    )
+    async def search(self, query: str, max_results: int = 10) -> list[Evidence]:
+        """Execute a web search using Serper API.
+        Args:
+            query: The search query string
+            max_results: Maximum number of results to return
+        Returns:
+            List of Evidence objects
+        Raises:
+            SearchError: If the search fails
+            RateLimitError: If rate limit is exceeded
+        """
+        await self._rate_limit()
+        # Preprocess query to remove noise
+        clean_query = preprocess_query(query)
+        final_query = clean_query if clean_query else query
+        try:
+            # Get search results (snippets)
+            search_results = await self._client.search(
+                final_query, filter_for_relevance=False, max_results=max_results
+            )
+            if not search_results:
+                logger.info("No search results found", query=final_query)
+                return []
+            # Scrape URLs to get full content
+            scraped = await scrape_urls(search_results)
+            # Convert ScrapeResult to Evidence objects
+            evidence = []
+            for result in scraped:
+                ev = Evidence(
+                    content=result.text,
+                    citation=Citation(
+                        title=result.title,
+                        url=result.url,
+                        source="serper",
+                        date="Unknown",
+                        authors=[],
+                    ),
+                    relevance=0.0,
+                )
+                evidence.append(ev)
+            logger.info(
+                "Serper search complete",
+                query=final_query,
+                results_found=len(evidence),
+            )
+            return evidence
+        except RateLimitError:
+            raise
+        except SearchError:
+            raise
+        except Exception as e:
+            logger.error("Unexpected error in Serper search", error=str(e), query=final_query)
+            raise SearchError(f"Serper search failed: {e}") from e

src/tools/vendored/__init__.py ADDED Viewed

	@@ -0,0 +1,26 @@

+"""Vendored web search components from folder/tools/web_search.py."""
+from src.tools.vendored.web_search_core import (
+    CONTENT_LENGTH_LIMIT,
+    ScrapeResult,
+    WebpageSnippet,
+    scrape_urls,
+    fetch_and_process_url,
+    html_to_text,
+    is_valid_url,
+)
+from src.tools.vendored.serper_client import SerperClient
+from src.tools.vendored.searchxng_client import SearchXNGClient
+__all__ = [
+    "CONTENT_LENGTH_LIMIT",
+    "ScrapeResult",
+    "WebpageSnippet",
+    "SerperClient",
+    "SearchXNGClient",
+    "scrape_urls",
+    "fetch_and_process_url",
+    "html_to_text",
+    "is_valid_url",
+]

src/tools/vendored/searchxng_client.py ADDED Viewed

	@@ -0,0 +1,98 @@

+"""SearchXNG API client for Google searches.
+Vendored and adapted from folder/tools/web_search.py.
+"""
+import os
+from typing import List, Optional
+import aiohttp
+import structlog
+from src.tools.vendored.web_search_core import WebpageSnippet, ssl_context
+from src.utils.exceptions import RateLimitError, SearchError
+logger = structlog.get_logger()
+class SearchXNGClient:
+    """A client for the SearchXNG API to perform Google searches."""
+    def __init__(self, host: Optional[str] = None) -> None:
+        """Initialize SearchXNG client.
+        Args:
+            host: SearchXNG host URL. If None, reads from SEARCHXNG_HOST env var.
+        Raises:
+            ConfigurationError: If no host is provided.
+        """
+        host = host or os.getenv("SEARCHXNG_HOST")
+        if not host:
+            from src.utils.exceptions import ConfigurationError
+            raise ConfigurationError("SEARCHXNG_HOST environment variable is not set")
+        # Ensure host ends with /search
+        if not host.endswith("/search"):
+            host = f"{host}/search" if not host.endswith("/") else f"{host}search"
+        self.host: str = host
+    async def search(
+        self, query: str, filter_for_relevance: bool = False, max_results: int = 5
+    ) -> List[WebpageSnippet]:
+        """Perform a search using SearchXNG API.
+        Args:
+            query: The search query
+            filter_for_relevance: Whether to filter results (currently not implemented)
+            max_results: Maximum number of results to return
+        Returns:
+            List of WebpageSnippet objects with search results
+        Raises:
+            SearchError: If the search fails
+            RateLimitError: If rate limit is exceeded
+        """
+        connector = aiohttp.TCPConnector(ssl=ssl_context)
+        try:
+            async with aiohttp.ClientSession(connector=connector) as session:
+                params = {
+                    "q": query,
+                    "format": "json",
+                }
+                async with session.get(self.host, params=params) as response:
+                    if response.status == 429:
+                        raise RateLimitError("SearchXNG API rate limit exceeded")
+                    response.raise_for_status()
+                    results = await response.json()
+                    results_list = [
+                        WebpageSnippet(
+                            url=result.get("url", ""),
+                            title=result.get("title", ""),
+                            description=result.get("content", ""),
+                        )
+                        for result in results.get("results", [])
+                    ]
+                    if not results_list:
+                        logger.info("No search results found", query=query)
+                        return []
+                    # Return results up to max_results
+                    return results_list[:max_results]
+        except aiohttp.ClientError as e:
+            logger.error("SearchXNG API request failed", error=str(e), query=query)
+            raise SearchError(f"SearchXNG API request failed: {e}") from e
+        except RateLimitError:
+            raise
+        except Exception as e:
+            logger.error("Unexpected error in SearchXNG search", error=str(e), query=query)
+            raise SearchError(f"SearchXNG search failed: {e}") from e

src/tools/vendored/serper_client.py ADDED Viewed

	@@ -0,0 +1,94 @@

+"""Serper API client for Google searches.
+Vendored and adapted from folder/tools/web_search.py.
+"""
+import os
+from typing import List, Optional
+import aiohttp
+import structlog
+from src.tools.vendored.web_search_core import WebpageSnippet, ssl_context
+from src.utils.exceptions import RateLimitError, SearchError
+logger = structlog.get_logger()
+class SerperClient:
+    """A client for the Serper API to perform Google searches."""
+    def __init__(self, api_key: Optional[str] = None) -> None:
+        """Initialize Serper client.
+        Args:
+            api_key: Serper API key. If None, reads from SERPER_API_KEY env var.
+        Raises:
+            ConfigurationError: If no API key is provided.
+        """
+        self.api_key = api_key or os.getenv("SERPER_API_KEY")
+        if not self.api_key:
+            from src.utils.exceptions import ConfigurationError
+            raise ConfigurationError(
+                "No API key provided. Set SERPER_API_KEY environment variable."
+            )
+        self.url = "https://google.serper.dev/search"
+        self.headers = {"X-API-KEY": self.api_key, "Content-Type": "application/json"}
+    async def search(
+        self, query: str, filter_for_relevance: bool = False, max_results: int = 5
+    ) -> List[WebpageSnippet]:
+        """Perform a Google search using Serper API.
+        Args:
+            query: The search query
+            filter_for_relevance: Whether to filter results (currently not implemented)
+            max_results: Maximum number of results to return
+        Returns:
+            List of WebpageSnippet objects with search results
+        Raises:
+            SearchError: If the search fails
+            RateLimitError: If rate limit is exceeded
+        """
+        connector = aiohttp.TCPConnector(ssl=ssl_context)
+        try:
+            async with aiohttp.ClientSession(connector=connector) as session:
+                async with session.post(
+                    self.url, headers=self.headers, json={"q": query, "autocorrect": False}
+                ) as response:
+                    if response.status == 429:
+                        raise RateLimitError("Serper API rate limit exceeded")
+                    response.raise_for_status()
+                    results = await response.json()
+                    results_list = [
+                        WebpageSnippet(
+                            url=result.get("link", ""),
+                            title=result.get("title", ""),
+                            description=result.get("snippet", ""),
+                        )
+                        for result in results.get("organic", [])
+                    ]
+                    if not results_list:
+                        logger.info("No search results found", query=query)
+                        return []
+                    # Return results up to max_results
+                    return results_list[:max_results]
+        except aiohttp.ClientError as e:
+            logger.error("Serper API request failed", error=str(e), query=query)
+            raise SearchError(f"Serper API request failed: {e}") from e
+        except RateLimitError:
+            raise
+        except Exception as e:
+            logger.error("Unexpected error in Serper search", error=str(e), query=query)
+            raise SearchError(f"Serper search failed: {e}") from e

src/tools/vendored/web_search_core.py ADDED Viewed

	@@ -0,0 +1,205 @@

+"""Core web search utilities vendored from folder/tools/web_search.py.
+This module contains shared utilities for web scraping, URL processing,
+and HTML text extraction used by web search tools.
+"""
+import asyncio
+import ssl
+from typing import List, Optional
+import aiohttp
+import structlog
+from bs4 import BeautifulSoup
+from pydantic import BaseModel, Field
+logger = structlog.get_logger()
+# Content length limit to avoid exceeding token limits
+CONTENT_LENGTH_LIMIT = 10000
+# Create a shared SSL context for web requests
+ssl_context = ssl.create_default_context()
+ssl_context.check_hostname = False
+ssl_context.verify_mode = ssl.CERT_NONE
+ssl_context.set_ciphers("DEFAULT:@SECLEVEL=1")  # Allow older cipher suites
+class ScrapeResult(BaseModel):
+    """Result of scraping a single webpage."""
+    url: str = Field(description="The URL of the webpage")
+    text: str = Field(description="The full text content of the webpage")
+    title: str = Field(description="The title of the webpage")
+    description: str = Field(description="A short description of the webpage")
+class WebpageSnippet(BaseModel):
+    """Snippet information for a webpage (before scraping)."""
+    url: str = Field(description="The URL of the webpage")
+    title: str = Field(description="The title of the webpage")
+    description: Optional[str] = Field(
+        default=None, description="A short description of the webpage"
+    )
+async def scrape_urls(items: List[WebpageSnippet]) -> List[ScrapeResult]:
+    """Fetch text content from provided URLs.
+    Args:
+        items: List of WebpageSnippet items to extract content from
+    Returns:
+        List of ScrapeResult objects with scraped content
+    """
+    connector = aiohttp.TCPConnector(ssl=ssl_context)
+    async with aiohttp.ClientSession(connector=connector) as session:
+        # Create list of tasks for concurrent execution
+        tasks = []
+        for item in items:
+            if item.url:  # Skip empty URLs
+                tasks.append(fetch_and_process_url(session, item))
+        # Execute all tasks concurrently and gather results
+        results = await asyncio.gather(*tasks, return_exceptions=True)
+        # Filter out errors and return successful results
+        successful_results: List[ScrapeResult] = []
+        for result in results:
+            if isinstance(result, ScrapeResult):
+                successful_results.append(result)
+            elif isinstance(result, Exception):
+                logger.warning("Failed to scrape URL", error=str(result))
+        return successful_results
+async def fetch_and_process_url(
+    session: aiohttp.ClientSession, item: WebpageSnippet
+) -> ScrapeResult:
+    """Helper function to fetch and process a single URL.
+    Args:
+        session: aiohttp ClientSession
+        item: WebpageSnippet with URL to fetch
+    Returns:
+        ScrapeResult with fetched content
+    """
+    if not is_valid_url(item.url):
+        return ScrapeResult(
+            url=item.url,
+            title=item.title,
+            description=item.description or "",
+            text="Error fetching content: URL contains restricted file extension",
+        )
+    try:
+        timeout = aiohttp.ClientTimeout(total=8)
+        async with session.get(item.url, timeout=timeout) as response:
+            if response.status == 200:
+                content = await response.text()
+                # Run html_to_text in a thread pool to avoid blocking
+                loop = asyncio.get_event_loop()
+                text_content = await loop.run_in_executor(None, html_to_text, content)
+                text_content = text_content[
+                    :CONTENT_LENGTH_LIMIT
+                ]  # Trim content to avoid exceeding token limit
+                return ScrapeResult(
+                    url=item.url,
+                    title=item.title,
+                    description=item.description or "",
+                    text=text_content,
+                )
+            else:
+                # Return a ScrapeResult with an error message
+                return ScrapeResult(
+                    url=item.url,
+                    title=item.title,
+                    description=item.description or "",
+                    text=f"Error fetching content: HTTP {response.status}",
+                )
+    except Exception as e:
+        logger.warning("Error fetching URL", url=item.url, error=str(e))
+        # Return a ScrapeResult with an error message
+        return ScrapeResult(
+            url=item.url,
+            title=item.title,
+            description=item.description or "",
+            text=f"Error fetching content: {str(e)}",
+        )
+def html_to_text(html_content: str) -> str:
+    """Strip out unnecessary elements from HTML to prepare for text extraction.
+    Args:
+        html_content: Raw HTML content
+    Returns:
+        Extracted text from relevant HTML tags
+    """
+    # Parse the HTML using lxml for speed
+    soup = BeautifulSoup(html_content, "lxml")
+    # Extract text from relevant tags
+    tags_to_extract = ("h1", "h2", "h3", "h4", "h5", "h6", "p", "li", "blockquote")
+    # Use a generator expression for efficiency
+    extracted_text = "\n".join(
+        element.get_text(strip=True)
+        for element in soup.find_all(tags_to_extract)
+        if element.get_text(strip=True)
+    )
+    return extracted_text
+def is_valid_url(url: str) -> bool:
+    """Check that a URL does not contain restricted file extensions.
+    Args:
+        url: URL to validate
+    Returns:
+        True if URL is valid, False if it contains restricted extensions
+    """
+    restricted_extensions = [
+        ".pdf",
+        ".doc",
+        ".xls",
+        ".ppt",
+        ".zip",
+        ".rar",
+        ".7z",
+        ".txt",
+        ".js",
+        ".xml",
+        ".css",
+        ".png",
+        ".jpg",
+        ".jpeg",
+        ".gif",
+        ".ico",
+        ".svg",
+        ".webp",
+        ".mp3",
+        ".mp4",
+        ".avi",
+        ".mov",
+        ".wmv",
+        ".flv",
+        ".wma",
+        ".wav",
+        ".m4a",
+        ".m4v",
+        ".m4b",
+        ".m4p",
+        ".m4u",
+    ]
+    if any(ext in url for ext in restricted_extensions):
+        return False
+    return True

src/tools/web_search.py CHANGED Viewed

@@ -5,7 +5,9 @@ import asyncio
 import structlog
 from duckduckgo_search import DDGS
-from src.utils.models import Citation, Evidence, SearchResult
 logger = structlog.get_logger()
@@ -16,14 +18,34 @@ class WebSearchTool:
     def __init__(self) -> None:
         self._ddgs = DDGS()
-    async def search(self, query: str, max_results: int = 10) -> SearchResult:
-        """Execute a web search."""
         try:
             loop = asyncio.get_running_loop()
             def _do_search() -> list[dict[str, str]]:
                 # text() returns an iterator, need to list() it or iterate
-                return list(self._ddgs.text(query, max_results=max_results))
             raw_results = await loop.run_in_executor(None, _do_search)
@@ -42,12 +64,8 @@ class WebSearchTool:
                 )
                 evidence.append(ev)
-            return SearchResult(
-                query=query, evidence=evidence, sources_searched=["web"], total_found=len(evidence)
-            )
         except Exception as e:
-            logger.error("Web search failed", error=str(e))
-            return SearchResult(
-                query=query, evidence=[], sources_searched=["web"], total_found=0, errors=[str(e)]
-            )

 import structlog
 from duckduckgo_search import DDGS
+from src.tools.query_utils import preprocess_query
+from src.utils.exceptions import SearchError
+from src.utils.models import Citation, Evidence
 logger = structlog.get_logger()
     def __init__(self) -> None:
         self._ddgs = DDGS()
+    @property
+    def name(self) -> str:
+        """Return the name of this search tool."""
+        return "duckduckgo"
+    async def search(self, query: str, max_results: int = 10) -> list[Evidence]:
+        """Execute a web search and return evidence.
+        Args:
+            query: The search query string
+            max_results: Maximum number of results to return
+        Returns:
+            List of Evidence objects
+        Raises:
+            SearchError: If the search fails
+        """
         try:
+            # Preprocess query to remove noise
+            clean_query = preprocess_query(query)
+            final_query = clean_query if clean_query else query
             loop = asyncio.get_running_loop()
             def _do_search() -> list[dict[str, str]]:
                 # text() returns an iterator, need to list() it or iterate
+                return list(self._ddgs.text(final_query, max_results=max_results))
             raw_results = await loop.run_in_executor(None, _do_search)
                 )
                 evidence.append(ev)
+            return evidence
         except Exception as e:
+            logger.error("Web search failed", error=str(e), query=query)
+            raise SearchError(f"DuckDuckGo search failed: {e}") from e

src/tools/web_search_adapter.py CHANGED Viewed

@@ -1,10 +1,12 @@
 """Web search tool adapter for Pydantic AI agents.
-Adapts the folder/tools/web_search.py implementation to work with Pydantic AI.
 """
 import structlog
 logger = structlog.get_logger()
@@ -22,42 +24,32 @@ async def web_search(query: str) -> str:
         Formatted string with search results including titles, descriptions, and URLs
     """
     try:
-        # Lazy import to avoid requiring folder/ dependencies at import time
-        # This will use the existing web_search tool from folder/tools
-        from folder.llm_config import create_default_config
-        from folder.tools.web_search import create_web_search_tool
-        config = create_default_config()
-        web_search_tool = create_web_search_tool(config)
-        # Call the tool function
-        # The tool returns List[ScrapeResult] or str
-        results = await web_search_tool(query)
-        if isinstance(results, str):
-            # Error message returned
-            logger.warning("Web search returned error", error=results)
-            return results
-        if not results:
             return f"No web search results found for: {query}"
         # Format results for agent consumption
-        formatted = [f"Found {len(results)} web search results:\n"]
-        for i, result in enumerate(results[:5], 1):  # Limit to 5 results
-            formatted.append(f"{i}. **{result.title}**")
-            if result.description:
-                formatted.append(f"   {result.description[:200]}...")
-            formatted.append(f"   URL: {result.url}")
-            if result.text:
-                formatted.append(f"   Content: {result.text[:300]}...")
             formatted.append("")
         return "\n".join(formatted)
-    except ImportError as e:
-        logger.error("Web search tool not available", error=str(e))
-        return f"Web search tool not available: {e!s}"
     except Exception as e:
         logger.error("Web search failed", error=str(e), query=query)
         return f"Error performing web search: {e!s}"

 """Web search tool adapter for Pydantic AI agents.
+Uses the new web search factory to provide web search functionality.
 """
 import structlog
+from src.tools.web_search_factory import create_web_search_tool
 logger = structlog.get_logger()
         Formatted string with search results including titles, descriptions, and URLs
     """
     try:
+        # Get web search tool from factory
+        tool = create_web_search_tool()
+        if tool is None:
+            logger.warning("Web search tool not available", hint="Check configuration")
+            return "Web search tool not available. Please configure a web search provider."
+        # Call the tool - it returns list[Evidence]
+        evidence = await tool.search(query, max_results=5)
+        if not evidence:
             return f"No web search results found for: {query}"
         # Format results for agent consumption
+        formatted = [f"Found {len(evidence)} web search results:\n"]
+        for i, ev in enumerate(evidence, 1):
+            citation = ev.citation
+            formatted.append(f"{i}. **{citation.title}**")
+            if citation.url:
+                formatted.append(f"   URL: {citation.url}")
+            if ev.content:
+                formatted.append(f"   Content: {ev.content[:300]}...")
             formatted.append("")
         return "\n".join(formatted)
     except Exception as e:
         logger.error("Web search failed", error=str(e), query=query)
         return f"Error performing web search: {e!s}"

src/tools/web_search_factory.py ADDED Viewed

	@@ -0,0 +1,73 @@

+"""Factory for creating web search tools based on configuration."""
+import structlog
+from src.tools.base import SearchTool
+from src.tools.searchxng_web_search import SearchXNGWebSearchTool
+from src.tools.serper_web_search import SerperWebSearchTool
+from src.tools.web_search import WebSearchTool
+from src.utils.config import settings
+from src.utils.exceptions import ConfigurationError
+logger = structlog.get_logger()
+def create_web_search_tool() -> SearchTool | None:
+    """Create a web search tool based on configuration.
+    Returns:
+        SearchTool instance, or None if not available/configured
+    The tool is selected based on settings.web_search_provider:
+    - "serper": SerperWebSearchTool (requires SERPER_API_KEY)
+    - "searchxng": SearchXNGWebSearchTool (requires SEARCHXNG_HOST)
+    - "duckduckgo": WebSearchTool (always available, no API key)
+    - "brave" or "tavily": Not yet implemented, returns None
+    """
+    provider = settings.web_search_provider
+    try:
+        if provider == "serper":
+            if not settings.serper_api_key:
+                logger.warning(
+                    "Serper provider selected but no API key found",
+                    hint="Set SERPER_API_KEY environment variable",
+                )
+                return None
+            return SerperWebSearchTool()
+        elif provider == "searchxng":
+            if not settings.searchxng_host:
+                logger.warning(
+                    "SearchXNG provider selected but no host found",
+                    hint="Set SEARCHXNG_HOST environment variable",
+                )
+                return None
+            return SearchXNGWebSearchTool()
+        elif provider == "duckduckgo":
+            # DuckDuckGo is always available (no API key required)
+            return WebSearchTool()
+        elif provider in ("brave", "tavily"):
+            logger.warning(
+                f"Web search provider '{provider}' not yet implemented",
+                hint="Use 'serper', 'searchxng', or 'duckduckgo'",
+            )
+            return None
+        else:
+            logger.warning(
+                f"Unknown web search provider '{provider}', falling back to DuckDuckGo"
+            )
+            return WebSearchTool()
+    except ConfigurationError as e:
+        logger.error("Failed to create web search tool", error=str(e), provider=provider)
+        return None
+    except Exception as e:
+        logger.error(
+            "Unexpected error creating web search tool", error=str(e), provider=provider
+        )
+        return None

src/utils/llm_factory.py CHANGED Viewed

@@ -132,34 +132,22 @@ def get_pydantic_ai_model(oauth_token: str | None = None) -> Any:
     Returns:
         Configured pydantic-ai model
     """
-    from pydantic_ai.models.anthropic import AnthropicModel
     from pydantic_ai.models.huggingface import HuggingFaceModel
-    from pydantic_ai.models.openai import OpenAIChatModel as OpenAIModel
-    from pydantic_ai.providers.anthropic import AnthropicProvider
     from pydantic_ai.providers.huggingface import HuggingFaceProvider
-    from pydantic_ai.providers.openai import OpenAIProvider
-    # Priority: oauth_token > env vars
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
-    if settings.llm_provider == "huggingface":
-        model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
-        hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
-        return HuggingFaceModel(model_name, provider=hf_provider)
-    if settings.llm_provider == "openai":
-        if not settings.openai_api_key:
-            raise ConfigurationError("OPENAI_API_KEY not set for pydantic-ai")
-        provider = OpenAIProvider(api_key=settings.openai_api_key)
-        return OpenAIModel(settings.openai_model, provider=provider)
-    if settings.llm_provider == "anthropic":
-        if not settings.anthropic_api_key:
-            raise ConfigurationError("ANTHROPIC_API_KEY not set for pydantic-ai")
-        anthropic_provider = AnthropicProvider(api_key=settings.anthropic_api_key)
-        return AnthropicModel(settings.anthropic_model, provider=anthropic_provider)
-    # Default to HuggingFace if provider is unknown or not specified
     model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
     hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
     return HuggingFaceModel(model_name, provider=hf_provider)

     Returns:
         Configured pydantic-ai model
     """
     from pydantic_ai.models.huggingface import HuggingFaceModel
     from pydantic_ai.providers.huggingface import HuggingFaceProvider
+    # Priority: oauth_token > settings.hf_token > settings.huggingface_api_key
     effective_hf_token = oauth_token or settings.hf_token or settings.huggingface_api_key
+    # HuggingFaceProvider requires a token - cannot use None
+    if not effective_hf_token:
+        raise ConfigurationError(
+            "HuggingFace token required. Please either:\n"
+            "1. Log in via HuggingFace OAuth (recommended for Spaces)\n"
+            "2. Set HF_TOKEN environment variable\n"
+            "3. Set huggingface_api_key in settings"
+        )
+    # Always use HuggingFace with available token
     model_name = settings.huggingface_model or "meta-llama/Llama-3.1-8B-Instruct"
     hf_provider = HuggingFaceProvider(api_key=effective_hf_token)
     return HuggingFaceModel(model_name, provider=hf_provider)