Nathan Evans
978e79875e
Remove file filtering ( #2050 )
...
* Remove document filtering
* Semver
* Fix integ tests
* Fix file find tuple
* Fix another dangling find tuple
2025-09-09 15:36:25 -07:00
Nathan Evans
429e1b1f9e
Remove graph embedding and UMAP ( #2048 )
...
* Remove umap/layout operation
* Remove graph embedding
* Bump unified-search to GR 2.5.0
* Remove graph vis from unified-search
2025-09-09 15:35:43 -07:00
Copilot
e84df28e64
Improve internal logging functionality by using Python's standard logging module ( #1956 )
...
gh-pages / build (push) Waiting to run
Python CI / python-ci (ubuntu-latest, 3.10) (push) Waiting to run
Python CI / python-ci (ubuntu-latest, 3.11) (push) Waiting to run
Python CI / python-ci (windows-latest, 3.10) (push) Waiting to run
Python CI / python-ci (windows-latest, 3.11) (push) Waiting to run
Python Integration Tests / python-ci (ubuntu-latest, 3.10) (push) Waiting to run
Python Integration Tests / python-ci (windows-latest, 3.10) (push) Waiting to run
Python Notebook Tests / python-ci (ubuntu-latest, 3.10) (push) Waiting to run
Python Notebook Tests / python-ci (windows-latest, 3.10) (push) Waiting to run
Python Publish (pypi) / Upload release to PyPI (push) Waiting to run
Python Smoke Tests / python-ci (ubuntu-latest, 3.10) (push) Waiting to run
Python Smoke Tests / python-ci (windows-latest, 3.10) (push) Waiting to run
Spellcheck / spellcheck (push) Waiting to run
* Initial plan for issue
* Implement standard logging module and integrate with existing loggers
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Add test cases and improve documentation for standard logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Apply ruff formatting and add semversioner file for logging improvements
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove custom logger classes and refactor to use standard logging only
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Apply ruff formatting to resolve CI/CD test failures
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Add semversioner file and fix linting issues
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* ruff fixes
* fix spelling error
* Remove StandardProgressLogger and refactor to use standard logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove LoggerFactory and custom loggers, refactor to use standard logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix pyright error: use logger.info() instead of calling logger as function in cosmosdb_pipeline_storage.py
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* ruff fixes
* Remove deprecated logger files that were marked as deprecated placeholders
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Replace custom get_logger with standard Python logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix linting issues found by ruff check --fix
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff check fixes
* add word to dictionary
* Fix type checker error in ModelManager.__new__ method
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Refactor multiple logging.getLogger() calls to use single logger per file
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove progress_logger parameter from build_index() and logger parameter from generate_indexing_prompts()
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove logger parameter from run_pipeline and standardize logger naming
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Replace logger parameter with log_level parameter in CLI commands
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix import ordering in notebook files to pass poetry poe check
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove --logger parameter from smoke test command
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix Windows CI/CD issue with log file cleanup in tests
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Add StreamHandler to root logger in __main__.py for CLI logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Only add StreamHandler if root logger doesn't have existing StreamHandler
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix import ordering in notebook files to pass ruff checks
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Replace logging.StreamHandler with colorlog.StreamHandler for colorized log output
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Regenerate poetry.lock file after adding colorlog dependency
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix import ordering in notebook files to pass ruff checks
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* move printing of dataframes to debug level
* remove colorlog for now
* Refactor workflow callbacks to inherit from logging.Handler
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix linting issues in workflow callback handlers
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix pyright type errors in blob and file workflow callbacks
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Refactor pipeline logging to use pure logging.Handler subclasses
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Rename workflow callback classes to workflow logger classes and move to logger directory
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* update dictionary
* apply ruff fixes
* fix function name
* simplify logger code
* update
* Remove error, warning, and log methods from WorkflowCallbacks and replace with standard logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* ruff fixes
* Fix pyright errors by removing WorkflowCallbacks from strategy type signatures
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove ConsoleWorkflowLogger and apply consistent formatter to all handlers
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff fixes
* Refactor pipeline_logger.py to use standard FileHandler and remove FileWorkflowLogger
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove conditional azure import checks from blob_workflow_logger.py
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Fix pyright type checking errors in mock_provider.py and utils.py
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Run ruff check --fix to fix import ordering in notebooks
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Merge configure_logging and create_pipeline_logger into init_loggers function
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* Remove configure_logging and create_pipeline_logger functions, replace all usage with init_loggers
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff fixes
* cleanup unused code
* Update init_loggers to accept GraphRagConfig instead of ReportingConfig
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff check fixes
* Fix test failures by providing valid GraphRagConfig with required model configurations
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff fixes
* remove logging_workflow_callback
* cleanup logging messages
* Add logging to track progress of pandas DataFrame apply operation in create_base_text_units
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* cleanup logger logic throughout codebase
* update
* more cleanup of old loggers
* small logger cleanup
* final code cleanup and added loggers to query
* add verbose logging to query
* minor code cleanup
* Fix broken unit tests for chunk_text and standard_logging
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* apply ruff fixes
* Fix test_chunk_text by mocking progress_ticker function instead of ProgressTicker class
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
* remove unnecessary logger
* remove rich and fix type annotation
* revert test formatting changes my by copilot
* promote graphrag logs to root logger
* add correct semversioner file
* revert change to file
* revert formatting changes that have no effect
* fix changes after merge with main
* revert unnecessary copilot changes
* remove whitespace
* cleanup docstring
* simplify some logic with less code
* update poetry lock file
* ruff fixes
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: jgbradley1 <654554+jgbradley1@users.noreply.github.com>
Co-authored-by: Josh Bradley <joshbradley@microsoft.com>
2025-07-09 18:29:03 -06:00
Nathan Evans
27c6de846f
Update docs for 2.0+ ( #1984 )
...
gh-pages / build (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.11) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.11) (push) Has been cancelled
Python Integration Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Integration Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Publish (pypi) / Upload release to PyPI (push) Has been cancelled
Python Smoke Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Smoke Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Spellcheck / spellcheck (push) Has been cancelled
* Update docs
* Fix prompt links
2025-06-23 13:49:47 -07:00
Nathan Evans
36948b8d2e
Various minor updates ( #1932 )
...
gh-pages / build (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.11) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.11) (push) Has been cancelled
Python Integration Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Integration Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Publish (pypi) / Upload release to PyPI (push) Has been cancelled
Python Smoke Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Smoke Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Spellcheck / spellcheck (push) Has been cancelled
* Add text unit ids to Community model
* Add graph utilities
* Turn off LCC for clustering by default
* Simplify embeddings config/flow
* Semver
2025-05-16 14:48:53 -07:00
Nathan Evans
25bbae8642
Docs: Add models page ( #1842 )
...
gh-pages / build (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.11) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.11) (push) Has been cancelled
Python Integration Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Integration Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Publish (pypi) / Upload release to PyPI (push) Has been cancelled
Python Smoke Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Smoke Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Spellcheck / spellcheck (push) Has been cancelled
* Add models page
* Update config docs for new params
* Spelling
* Add comment on CoT with o-series
* Add notes about managed identity
* Update the viz guide
* Spruce up the getting started wording
* Capitalization
* Add BYOG page
* More BYOG edits
* Update dictionary
* Change example model name
2025-04-28 17:36:08 -07:00
Nathan Evans
56e0fad218
NLP graph parity ( #1888 )
...
* Update stopwords config
* Minor edits
* Update PMI
* Format
* Perf improvements
* Semver
* Remove edge collection apply
* Remove source/target apply
* Add edge weight to graph snapshot
* Revert breaking optimizations
* Add perf fixes back in
* Format/types
* Update defaults
* Fix source/target ordering
* Fix test
2025-04-25 17:09:06 -06:00
Nathan Evans
3b1e70c06b
Update config docs (2.1.0) ( #1818 )
...
gh-pages / build (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (ubuntu-latest, 3.11) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python CI / python-ci (windows-latest, 3.11) (push) Has been cancelled
Python Integration Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Integration Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Notebook Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Python Publish (pypi) / Upload release to PyPI (push) Has been cancelled
Python Smoke Tests / python-ci (ubuntu-latest, 3.10) (push) Has been cancelled
Python Smoke Tests / python-ci (windows-latest, 3.10) (push) Has been cancelled
Spellcheck / spellcheck (push) Has been cancelled
* Align docs with config
* Semver
* Spelling
* Format
* Spelling
2025-03-18 12:39:30 -07:00
Nathan Evans
bcb74789f1
Next release docs ( #1627 )
...
* Wordind updates
* Update yam lconfig and add notes to "deprecated" env
* Add basic search section
* Update versioning docs
* Minor edits for clarity
* Update init command
* Update init to add --force in docs
* Add NLP extraction params
* Move vector_store to root
* Add workflows to config
* Add FastGraphRAG docs
* add metadata column changes
* Added documentation for multi index search.
* Minor fixes.
* Add config and table renames
* Update migration notebook and comments to specify v1
* Add frequency to entity table docs
* add new chunking options for metadata
* Update output docs
* Minor edits and cleanup
* Add model ids to search configs
* Spruce up migration notebook
* Lint/format multi-index notebook
* SpaCy model note
* Update SpaCy footnote
* Updated multi_index_search.ipynb to remove ruff errors.
* add spacy to dictionary
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
Co-authored-by: Dayenne Souza <ddesouza@microsoft.com>
Co-authored-by: dorbaker <dorbaker@microsoft.com>
2025-03-03 14:46:00 -08:00
Nathan Evans
d17dfd01f9
Graph collapse ( #1464 )
...
* Refactor graph creation
* Semver
* Spellcheck
* Update integ pipeline
* Fix cast
* Improve pandas chaining
* Cleaner apply
* Use list comprehensions
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-12-05 11:57:26 -06:00
Josh Bradley
dad2176b3c
Miscellaneous code cleanup procedures ( #1452 )
2024-11-27 13:27:43 -05:00
Nathan Evans
9b4f24ebce
First cut at config cleanup ( #1411 )
...
* Firsst cut at config cleanup
* Reorder top nav
* Add query prompts to tuning page
* Remove dynamic notebook from nav
* Add more thorough yml config descriptions in docs
* Further clean out the config
* Semver
* Add new blog post
* Emphasize yaml
* Clarify output
* Fix unit test
* Fix bullet nesting
2024-11-15 14:33:26 -08:00