Hi @Priya Yadav
Thanks for bringing this up. You’re right - Azure Purview relies on the entity_run_id
in the system.access.table_lineage
table to properly trace data lineage from Databricks notebooks.
If entity_run_id
is NULL
, Purview won't be able to associate that lineage information with a specific run, which means the lineage graph in Purview may be incomplete or not show up at all. This is a known limitation when using dbutils.notebook.run
, as it doesn’t always pass the execution context fully, especially in chained notebook scenarios.
To capture complete lineage, it's generally recommended to use Databricks Jobs with task-based orchestration instead. This ensures that the lineage metadata is recorded properly, including the entity_run_id
, and improves the visibility in Purview.
Hope this helps clarify things - happy to dive deeper into your setup if needed.
Kindly consider upvoting the comment if the information provided is helpful. This can assist other community members in resolving similar issues.