## Cookbook for using ChromaDB with Embedchain

### Step-1: Install embedchain package

In [None]:
!pip install embedchain[dataloaders]

### Step-2: Set OpenAI environment variables

You can find this env variable on your [OpenAI dashboard](https://platform.openai.com/account/api-keys).

In [None]:
import os
from embedchain import Pipeline as App

os.environ["OPENAI_API_KEY"] = "sk-xxx"

### Step-3: Define your Vector Database config

In [None]:
config = """
vectordb:
  provider: chroma
  config:
    collection_name: 'my-collection'
    # CHANGE THE BELOW TWO LINES!
    # pass remote database variables - host and port
    host: your-chromadb-url.com
    port: 5200
    allow_reset: true
"""

# Write the multi-line string to a YAML file
with open('chromadb.yaml', 'w') as file:
    file.write(config)

### Step-4 Create embedchain app based on the config

In [None]:
app = App.from_config(config_path="chromadb.yaml")

### Step-5: Add data sources to your app

In [None]:
app.add("https://www.forbes.com/profile/elon-musk")

### Step-6: All set. Now start asking questions related to your data

In [None]:
while(True):
    question = input("Enter question: ")
    if question in ['q', 'exit', 'quit']:
        break
    answer = app.query(question)
    print(answer)