• IEEE.org
  • IEEE CS Standards
  • Career Center
  • About Us
  • Subscribe to Newsletter

0

IEEEWhat are some of the key standards and programs for measuring and reporting on Sustainability
  •  Acceleration through Open Source projects
  •  An overview of Egeria and it can jumpstart sustainability projects
  •  A deeper look at the Greenhouse Gas Protocol and Carbon Accounting showing how Egeria can help organizations establish and deliver trusted sustainability reports
  • Open Lineage for Data Trust and UnderstandingOne of the most requested metadata use cases is lineage. This is the ability to understand the origin of your data and the processing (reformatting, enrichment, merging, ...) it has gone through between the data's origin and your AI model. Lineage helps to build trust in your model since it shows you have used appropriate data. Many individual technologies provide some lineage support that covers its own processing. Some data catalogs provide proprietary ways to gather lineage from many sources. However this is expensive to implement and only makes the lineage information available through the data catalog. Now three open source projects from LF AI and Data have come together to create a truely open ecosystem for lineage. Egeria provides open metadata that describes the data sources, data structures, data profiling results and the data pipelines. OpenLinege provides the event mechanism that records each time a data pipeline runs. Marquez provides visualization for lineage. In this talk you will learn about:

    • What is lineage and how it is used
    •  What makes lineage difficult to collect and maintain
    •  How the open ecosystem for lineage works
    •  How you can use lineage in your data science tools (using Jupyter Notebooks as an example)
    LATEST NEWS