在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):Building-ML-Pipelines/building-machine-learning-pipelines开源软件地址(OpenSource Url):https://github.com/Building-ML-Pipelines/building-machine-learning-pipelines开源编程语言(OpenSource Language):Jupyter Notebook 98.8%开源软件介绍(OpenSource Introduction):Building Machine Learning PipelinesCode repository for the O'Reilly publication "Building Machine Learning Pipelines" by Hannes Hapke & Catherine Nelson Update
Set up the demo projectDownload the initial dataset. From the root of this repository, execute
After this script runs, you should have a The datasetThe data that we use in this example project can be downloaded using the script above. The dataset is from a public dataset on customer complaints collected from the US Consumer Finance Protection Bureau. If you would like to reproduce our edited dataset, carry out the following steps:
Pre-pipeline experimentBefore building our TFX pipeline, we experimented with different feature engineering and model architectures. The notebooks in this folder preserve our experiments, and we then refactored our code into the interactive pipeline below. Interactive pipelineThe Full pipelines with Apache Beam, Apache Airflow, Kubeflow Pipelines, GCPThe ChaptersThe following subfolders contain stand-alone code for individual chapters. Model analysisChapter 7. Stand-alone code for TFMA, Fairness Indicators, What-If Tool. Note that these notebooks will not work in JupyterLab. Advanced TFXChapter 10. Notebook outlining the implementation of custom TFX components from scratch and by inheriting existing functionality. Presented at the Apache Beam Summit 2020. Data privacyChapter 14. Code for training a differentially private version of the demo project. Note that the TF-Privacy module only supports TF 1.x as of June 2020. Version notesThe code was written and tested for version 0.22.
|
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论