You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
For implemetation of remote::nvidia datasetio, the DatasetsRoutingTable does not allow nvidia provider due to below logic: The provider_id get's set to localfs instead.
The issue is being caused by:
# infer provider from sourceifsource.type==DatasetType.rows.value:
provider_id="localfs"elifsource.type==DatasetType.uri.value:
# infer provider from uriifsource.uri.startswith("huggingface"):
provider_id="huggingface"else:
provider_id="localfs"else:
raiseValueError(f"Unknown data source type: {source.type}")
fromllama_stack.distribution.library_clientimportLlamaStackAsLibraryClientimportosos.environ["NVIDIA_PROJECT_ID"] ="experment@1"client=LlamaStackAsLibraryClient("nvidia")
_=client.initialize()
client.datasets.register(
purpose="post-training/messages",
dataset_id="sample-basic-test",
source={
"type": "uri",
"uri": "hf://datasets/default/sample-basic-test"
},
metadata={"format": "json", "description": "This is an example of a dataset"},
)
result==>DatasetRegisterResponse(identifier='sample-basic-test', metadata={'format': 'json', 'description': 'This is an example of a dataset'}, provider_id='localfs', provider_resource_id='sample-basic-test', purpose='post-training/messages', source=SourceUriDataSource(type='uri', uri='hf://datasets/default/sample-basic-test'), type='dataset', access_attributes=None)
provider_id='localfs' which does blocks redirection to the NvidiaDatasetIOAdapter, hence
Expected behavior
Allowing provider_id as an argument to decide where to register dataset.
The text was updated successfully, but these errors were encountered:
Information
🐛 Describe the bug
For implemetation of remote::nvidia datasetio, the DatasetsRoutingTable does not allow nvidia provider due to below logic: The provider_id get's set to
localfs
instead.The issue is being caused by:
https://github.com/meta-llama/llama-stack/blob/main/llama_stack/distribution/routers/routing_tables.py#L434
Error logs
provider_id='localfs'
which does blocks redirection to theNvidiaDatasetIOAdapter
, henceExpected behavior
Allowing provider_id as an argument to decide where to register dataset.
The text was updated successfully, but these errors were encountered: