A modular graph-based Retrieval-Augmented Generation (RAG) system
Go to file
2024-07-26 14:38:44 -07:00
.github Update issue templates for more explicit guidance (#738) 2024-07-26 14:52:14 -04:00
.semversioner Update the ConfigReader to allow for empty chunk-by arrays (#742) 2024-07-26 14:38:44 -07:00
.vscode Initial Release 2024-07-01 15:25:30 -06:00
docsite add encoding model to text-chunking config (#743) 2024-07-26 14:15:17 -07:00
examples Modify the URL comment to make it correct. (#482) 2024-07-11 16:19:54 -06:00
examples_notebooks add community sample with interactive graph visualization (#569) 2024-07-16 17:55:39 -06:00
graphrag Update the ConfigReader to allow for empty chunk-by arrays (#742) 2024-07-26 14:38:44 -07:00
scripts Initial Release 2024-07-01 15:25:30 -06:00
tests add encoding model to text-chunking config (#743) 2024-07-26 14:15:17 -07:00
.gitignore Fix/feat: Implementation of Minute-Based Rate Limiting in CommunityReportsExtractor Using asyncio and async_mode (#373) 2024-07-05 13:41:11 -06:00
.vsts-ci.yml Initial Release 2024-07-01 15:25:30 -06:00
CHANGELOG.md v0.2.0 (#700) 2024-07-24 23:02:03 -04:00
CODE_OF_CONDUCT.md Initial Release 2024-07-01 15:25:30 -06:00
CODEOWNERS Update CODEOWNERS (#658) 2024-07-22 16:16:32 -06:00
CONTRIBUTING.md Initial Release 2024-07-01 15:25:30 -06:00
cspell.config.yaml Initial Release 2024-07-01 15:25:30 -06:00
DEVELOPING.md Initial Release 2024-07-01 15:25:30 -06:00
dictionary.txt Update dictionary.txt (#350) 2024-07-03 10:01:18 -06:00
LICENSE Initial Release 2024-07-01 15:25:30 -06:00
poetry.lock Read/write files as binary utf-8 (#639) 2024-07-24 13:28:22 -04:00
pyproject.toml v0.2.0 (#700) 2024-07-24 23:02:03 -04:00
RAI_TRANSPARENCY.md Initial Release 2024-07-01 15:25:30 -06:00
README.md Update README.md (#439) 2024-07-08 17:49:15 -06:00
SECURITY.md Initial Release 2024-07-01 15:25:30 -06:00
SUPPORT.md Initial Release 2024-07-01 15:25:30 -06:00

GraphRAG

👉 Use the GraphRAG Accelerator solution
👉 Microsoft Research Blog Post
👉 Read the docs
👉 GraphRAG Arxiv

Overview

The GraphRAG project is a data pipeline and transformation suite that is designed to extract meaningful, structured data from unstructured text using the power of LLMs.

To learn more about GraphRAG and how it can be used to enhance your LLMs ability to reason about your private data, please visit the Microsoft Research Blog Post.

Quickstart

To get started with the GraphRAG system we recommend trying the Solution Accelerator package. This provides a user-friendly end-to-end experience with Azure resources.

Repository Guidance

This repository presents a methodology for using knowledge graph memory structures to enhance LLM outputs. Please note that the provided code serves as a demonstration and is not an officially supported Microsoft offering.

⚠️ Warning: GraphRAG indexing can be an expensive operation, please read all of the documentation to understand the process and costs involved, and start small.

Diving Deeper

Prompt Tuning

Using GraphRAG with your data out of the box may not yield the best possible results. We strongly recommend to fine-tune your prompts following the Prompt Tuning Guide in our documentation.

Responsible AI FAQ

See RAI_TRANSPARENCY.md

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

Privacy

Microsoft Privacy Statement