Working with Object Stores

The Object Stores section of the Data Catalog allows you to register your object storage services (like Amazon S3, Azure Blob Storage, and Google Cloud Storage), making it easier to manage and utilize your unstructured data within Kosmoy Studio.

Prerequisites:

Before registering an Object Store, you must have an active Integration with the corresponding cloud provider (AWS, Azure, or Google). Refer to the Managing Integrations section for details.

Registering an Object Store

  1. Navigate to Object Stores: From the Kosmoy Studio home page, click on the “Data Management” menu in the left-hand navigation bar, then select “Data Catalog” and then click on “Object Stores”.

  2. Add a New Object Store: Click the ”+ ADD” button located in the upper right corner of the Object Stores section.

  3. Select Provider: Choose the cloud provider of your object store (e.g., Amazon, Azure, Google).

  4. Select Integration: From the dropdown menu, select the appropriate pre-configured Integration for your cloud provider.

  5. Select Object Store: Choose the specific object store (e.g., S3 bucket, Blob container) you want to register. The available options will be dynamically listed based on your selected Integration.

  6. Click “Next”.

  7. Name and Describe the Object Store: Give your object store a unique name and an optional description.

  8. Click “Review”: Review the object store details.

  9. Click “Create”: Register the object store in the Data Catalog.

Object Store Cards

The Object Stores section displays each registered object store as a card. Each card shows:

  • Object Store Provider Icon: An icon representing the cloud provider (e.g., AWS, Azure, Google).
  • Object Store Name: The name you assigned to the object store.
  • Description: The description you provided for the object store.
  • Edit Icon (Pencil): Click this icon to update the object store’s name, description, or configuration (only if the object store is not in use).
  • Delete Icon (Trash Bin): Click this icon to remove the object store (only if the object store is not in use).

Object Store Usage Restrictions

You cannot edit or delete an Object Store if it is currently referenced by other entities within Kosmoy Studio. This includes being used in:

  • Folders
  • Vector Pipelines
  • Any other Kosmoy Studio component that references Object Stores.

Before attempting to edit or delete an Object Store, ensure it is not actively used in any of these areas.

Updating an Object Store

You can update the configuration of a registered Object Store, including its name, description, and even the selected object store itself, provided it is not currently referenced by any other component.

  1. Navigate to Object Stores: Go to “Data Management” > “Data Catalog” > “Object Stores”.
  2. Locate the Object Store Card: Find the card for the Object Store you want to update.
  3. Click the Edit (Pencil) Icon: This will open the update dialog.
  4. Modify Object Store Parameters: Update the Object Store’s configuration as needed. You can change its name, description, and even re-select the object store from the list provided by the integration.
  5. Click “Save”: Save the changes.

If you attempt to edit an Object Store that is currently in use, a warning banner will be displayed at the top of the screen, preventing the modification.

Removing an Object Store

You can remove a registered Object Store if it’s no longer needed. However, you cannot delete an Object Store that is currently referenced by any other component.

  1. Navigate to Object Stores: Go to “Data Management” > “Data Catalog” > “Object Stores”.
  2. Locate the Object Store Card: Find the card for the Object Store you want to remove.
  3. Click the Delete (Trash Bin) Icon: This will trigger a confirmation prompt.

If you attempt to delete an Object Store that is currently in use, a modal will appear, preventing the deletion and explaining that the Object Store is in use.

  1. Confirm Deletion: Confirm that you want to delete the Object Store.

Warning: Deleting an Object Store is a permanent action and cannot be undone. Ensure that the Object Store is not being referenced by any other component before proceeding.

Using Object Stores

Registered Object Stores are primarily used when configuring Vector Pipelines in the Data Ingestion section. You can select an Object Store and a specific Folder within it as the source for your unstructured data (PDF and Office files) that will be processed by the pipeline.