# Changelog Note: version releases in the 0.x.y range may introduce breaking changes. ## 1.2.0 - minor: Add Drift Reduce response and streaming endpoint - minor: add cosmosdb vector store - patch: Fix example notebooks - patch: Set default rate limits. - patch: unit tests for text_splitting ## 1.2.0 - patch: Basic Rag minor fix ## 1.1.1 - patch: Fix a bug on creating community hierarchy for dynamic search - patch: Increase LOCAL_SEARCH_COMMUNITY_PROP to 15% ## 1.1.0 - minor: Make gleanings independent of encoding - minor: Remove DataShaper (first steps). - minor: Remove old pipeline runner. - minor: new search implemented as a new option for the api - patch: Fix gleanings loop check - patch: Implement cosmosdb storage option for cache and output - patch: Move extractor code to co-locate with operations. - patch: Remove config input models. - patch: Ruff update - patch: Simplify and streamline internal config. - patch: Simplify callbacks model. - patch: Streamline flows. - patch: fix instantiation of storage classes. ## 1.0.1 - patch: Fix encoding model config parsing - patch: Fix exception on error callbacks - patch: Manage llm instances inside a cached singleton. Check for empty dfs after entity/relationship extraction - patch: Respect encoding_model option ## 1.0.0 - patch: Add Parent id to communities data model - patch: Add migration notebook. - patch: Create separate community workflow, collapse subflows. - patch: Dependency Updates - patch: cleanup and refactor factory classes. ## 0.9.0 - minor: Refactor graph creation. - patch: Dependency updates - patch: Fix Global Search with dynamic Community selection bug - patch: Fix question gen. - patch: Optimize Final Community Reports calculation and stabilize cache - patch: miscellaneous code cleanup and minor changes for better alignment of style across the codebase. - patch: replace llm package with fnllm - patch: replaced md5 hash with sha256 - patch: replaced md5 hash with sha512 - patch: update API and add a demonstration notebook ## 0.5.0 - minor: Data model changes. - patch: Add Parquet as part of the default emitters when not pressent - patch: Centralized prompts and export all for easier injection. - patch: Cleanup of artifact outputs/schemas. - patch: Config and docs updates. - patch: Implement dynamic community selection to global search - patch: fix autocompletion of existing files/directory paths. - patch: move import statements out of init files ## 0.4.1 - patch: Add update cli entrypoint for incremental indexing - patch: Allow some CI/CD jobs to skip PRs dedicated to doc updates only. - patch: Fix a file paths issue in the viz guide. - patch: Fix optional covariates update in incremental indexing - patch: Raise error on empty deltas for inc indexing - patch: add visualization guide to doc site - patch: fix streaming output error ## 0.4.0 - minor: Add Incremental Indexing - minor: Added DRIFT graph reasoning query module - minor: embeddings moved to a different workflow - patch: Add DRIFT search cli and example notebook - patch: Add config for incremental updates - patch: Add embeddings to subflow. - patch: Add naive community merge using time period - patch: Add relationship merge - patch: Add runtime-only storage option. - patch: Add text units update - patch: Allow empty workflow returns to avoid disk writing. - patch: Apply pandas optimizations to create final entities - patch: Calculate new inputs and deleted inputs on update - patch: Collapse covariates flow. - patch: Collapse create-base-entity-graph. - patch: Collapse create-final-community-reports. - patch: Collapse create-final-documents. - patch: Collapse create-final-entities. - patch: Collapse create-final-nodes. - patch: Collapse create_base_documents. - patch: Collapse create_base_text_units. - patch: Collapse create_final_relationships. - patch: Collapse entity extraction. - patch: Collapse entity summarize. - patch: Collapse intermediate workflow outputs. - patch: Dependency updates - patch: Extract DataShaper-less flows. - patch: Fix Community ID loading for DRIFT search over existing indexes - patch: Fix embeddings faulty assignments - patch: Fix init defaults for vector store and drift img in docs - patch: Fix nested json parsing - patch: Fix some edge cases on Drift Search over small input sets - patch: Fix var name for embedding - patch: Merge existing and new entities, updating values accordingly - patch: Merge text_embed into create-final-relationships subflow. - patch: Move embedding verbs to operations. - patch: Moving verbs around. - patch: Optimize Create Base Documents subflow - patch: Optimize text unit relationship count - patch: Perf optimizations in map_query_to_entities() - patch: Remove aggregate_df from final coomunities and final text units - patch: Remove duplicated relationships and nodes - patch: Remove unused column from final entities - patch: Reorganized api,reporter,callback code into separate components. Defined debug profiles. - patch: Small cleanup in community context history building - patch: Transient entity graph and snapshotting. - patch: Update Incremental Indexing to new embeddings workflow - patch: Use mkdocs for documentation - patch: add backwards compatibility patch to vector store. - patch: add-autogenerated-cli-docs - patch: fix docs image path - patch: refactor use of vector stores and update support for managed identity - patch: remove redundant error-handling code from global-search - patch: reorganize cli layer ## 0.3.6 - patch: Collapse create_final_relationships. - patch: Dependency update and cleanup ## 0.3.5 - patch: Add compound verbs with tests infra. - patch: Collapse create_final_communities. - patch: Collapse create_final_text_units. - patch: Covariate verb collapse. - patch: Fix duplicates in community context builder - patch: Fix prompt tune output path - patch: Fix seed hardcoded init - patch: Fix seeded random gen on clustering - patch: Improve logging. - patch: Set default values for cli parameters. - patch: Use static output directories. ## 0.3.4 - patch: Deep copy txt units on local search to avoid race conditions - patch: Fix summarization including empty descriptions ## 0.3.3 - patch: Add entrypoints for incremental indexing - patch: Clean up and organize run index code - patch: Consistent config loading. Resolves #99 and Resolves #1049 - patch: Fix circular dependency when running prompt tune api directly - patch: Fix default settings for embedding - patch: Fix img for auto tune - patch: Fix img width - patch: Fixed a bug in prompt tuning process - patch: Refactor text unit build at local search - patch: Update Prompt Tuning docs - patch: Update create_pipeline_config.py - patch: Update prompt tune command in docs - patch: add querying from azure blob storage - patch: fix setting base_dir to full paths when not using file system. - patch: fix strategy config in entity_extraction ## 0.3.2 - patch: Add context data to query API responses. - patch: Add missing config parameter documentation for prompt tuning - patch: Add neo4j community notebook - patch: Ensure entity types to be str when running prompt tuning - patch: Fix weight casting during graph extraction - patch: Patch "past" dependency issues - patch: Update developer guide. - patch: Update query type hints. - patch: change-lancedb-placement ## 0.3.1 - patch: Add preflight check to check LLM connectivity. - patch: Add streaming support for local/global search to query cli - patch: Add support for both float and int on schema validation for community report generation - patch: Avoid running index on gh-pages publishing - patch: Implement Index API - patch: Improves filtering for data dir inferring - patch: Update to nltk 3.9.1 ## 0.3.0 - minor: Implement auto templating API. - minor: Implement query engine API. - patch: Fix file dumps using json for non ASCII chars - patch: Stabilize smoke tests for query context building - patch: fix query embedding - patch: fix sort_context & max_tokens params in verb ## 0.2.2 - patch: Add a check if there is no community record added in local search context - patch: Add sepparate workflow for Python Tests - patch: Docs updates - patch: Run smoke tests on 4o ## 0.2.1 - patch: Added default columns for vector store at create_pipeline_config. No change for other cases. - patch: Change json parsing error in the map step of global search to warning - patch: Fix Local Search breaking when loading Embeddings input. Defaulting overwrite to True as in the rest of the vector store config - patch: Fix json parsing when LLM returns faulty responses - patch: Fix missing community reports and refactor community context builder - patch: Fixed a bug that erased the vector database, added a new parameter to specify the config file path, and updated the documentation accordingly. - patch: Try parsing json before even repairing - patch: Update Prompt Tuning meta prompts with finer examples - patch: Update default entity extraction and gleaning prompts to reduce hallucinations - patch: add encoding-model to entity/claim extraction config - patch: add encoding-model to text chunking config - patch: add user prompt to history-tracking llm - patch: update config reader to allow for zero gleans - patch: update config-reader to allow for empty chunk-by arrays - patch: update history-tracking LLm to use 'assistant' instead of 'system' in output history. - patch: use history argument in hash key computation; add history input to cache data ## 0.2.0 - minor: Add content-based KNN for selecting prompt tune few shot examples - minor: Add dynamic community report rating to the prompt tuning engine - patch: Add Minute-based Rate Limiting and fix rpm, tpm settings - patch: Add N parameter support - patch: Add cli flag to overlay default values onto a provided config. - patch: Add exception handling on file load - patch: Add language support to prompt tuning - patch: Add llm params to local and global search - patch: Fix broken prompt tuning link on docs - patch: Fix delta none on query calls - patch: Fix docsite base url - patch: Fix encoding model parameter on prompt tune - patch: Fix for --limit exceeding the dataframe length - patch: Fix for Ruff 0.5.2 - patch: Fixed an issue where base OpenAI embeddings can't work with Azure OpenAI LLM - patch: Modify defaults for CHUNK_SIZE, CHUNK_OVERLAP and GLEANINGS to reduce time and LLM calls - patch: fix community_report doesn't work in settings.yaml - patch: fix llm response content is None in query - patch: fix the organization parameter is ineffective during queries - patch: remove duplicate file read - patch: support non-open ai model config to prompt tune - patch: use binary io processing for all file io operations ## 0.1.0 - minor: Initial Release