graphrag/tests/fixtures/min-csv/settings.yml
Derek Worthen 3b09df6e07
Migrate towards using static output directories (#1113)
* Migrate towards using static output directories

- Fixes load_config eagering resolving directories.
    Directories are only resolved when the output
    directories are local.
- Add support for `--output` and `--reporting` flags
    for index CLI. To achieve previous output structure
    `index --output run1/artifacts --reports run1/reports`.
- Use static output directories when initializing
    a new project.
- Maintains backward compatibility for those using
    timestamp outputs locally.

* fix smoke tests

* update query cli to work with static directories

* remove eager path resolution from load_config. Support CLI overrides that can be resolved.

* add docs and output logs/artifacts to same directory

* use match statement

* switch back to if statement

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-09-18 17:36:50 -06:00

33 lines
877 B
YAML

input:
file_type: csv
embeddings:
vector_store:
type: "lancedb"
uri_db: "./tests/fixtures/min-csv/lancedb"
store_in_table: True
entity_name_description:
title_column: "name"
# id_column: "id"
# overwrite: true
# entity_name: ...
# relationship_description: ...
# community_report_full_content: ...
# community_report_summary: ...
# community_report_title: ...
# document_raw_content: ...
# text_unit_text: ...
storage:
type: file # or blob
base_dir: "output/${timestamp}/artifacts"
# connection_string: <azure_blob_storage_connection_string>
# container_name: <azure_blob_storage_container_name>
reporting:
type: file # or console, blob
base_dir: "output/${timestamp}/reports"
# connection_string: <azure_blob_storage_connection_string>
# container_name: <azure_blob_storage_container_name>