Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add vertex feature store integration #250

Closed
wants to merge 6 commits into from

Conversation

eliasecchig
Copy link
Contributor

Authors: @lspataroG @eliasecchig

Add VertexAI Langchain integration

Example usage

@lkuligin
Copy link
Collaborator

/gcbrun

1 similar comment
@eliasecchig
Copy link
Contributor Author

/gcbrun

@eliasecchig eliasecchig force-pushed the add-fs branch 3 times, most recently from 127d024 to b96cecf Compare May 27, 2024 11:44
@lkuligin
Copy link
Collaborator

/gcbrun

@lspataroG
Copy link
Contributor

/gcbrun

1 similar comment
@lkuligin
Copy link
Collaborator

/gcbrun

@gmogr
Copy link
Collaborator

gmogr commented May 28, 2024

In integration tests could you please add a check for already created datasets in BQ.
tests/integration_tests/vectorstores/feature_store/test_feature_store_fs_executor.py are failing if BQ dataset is already created.

@eliasecchig
Copy link
Contributor Author

@gmogr can you link an example where tests failed?
If dataset already exists, the class should not complain about it, see here

@lspataroG
Copy link
Contributor

@gmogr Added teardown for feature view.

@eliasecchig
Copy link
Contributor Author

/gcbrun

@gmogr
Copy link
Collaborator

gmogr commented May 28, 2024

/gcbrun

gca_resource = self._online_store.gca_resource
endpoint = gca_resource.dedicated_serving_endpoint.public_endpoint_domain_name
return FeatureOnlineStoreServiceClient(
client_options={"api_endpoint": endpoint}
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it looks like telemetry is missing here too

self._bq_client = bigquery.Client(
project=self.project_id,
location=self.location,
credentials=self.credentials,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same here, client_info is not set

authentication.

To use, you need the following packages installed:
google-cloud-bigquery
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we hint a specific langchain-google-vertexai dep group instead, please?


def sync(self):
"""Sync the data from the Big Query source into the Executor source"""
self._df = self._query_table_to_df()
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we at least add a warning if the amount of records it too large for an in-memory bfs, please?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1

self._env_config = env_config


class BruteForceExecutor(BaseExecutor):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why do we need our own in-memory implementation if LC already has a few? can we just write an adapter that will load data into it?

The FeatureStore aims to facilitate similarity search using different
methodologies on Google Cloud including Big Query, Feature Store and a
local bruteforce search engine.
Big Query is the data source of truth and also the default search
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nit: BigQuery one word instead of Big Query

@lkuligin lkuligin closed this Jun 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

6 participants