在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):cyberdefendersprogram/MachineLearning开源软件地址(OpenSource Url):https://github.com/cyberdefendersprogram/MachineLearning开源编程语言(OpenSource Language):Jupyter Notebook 63.8%开源软件介绍(OpenSource Introduction):MachineLearning - DataSet Quality ResearchThis page will contain our progress in creating a report detailing the quality of diferent data sets. We have aquired permission from Mike Sconzo, owner of secrepo.com, to use his security datasets to analyze and report on the data. Security Datasets for Machine Learningby Tien Tran, Citlalin Galvan, Vivian Nguyen, Huy Nguyen WHY FOCUS ON DATASETS?Machine Learning is on the rise ⇑ A Machine Learning Algorithm can:
Detect Suspicious Activity The Problem: One critical problem in Machine Learning is the limited data for security and the quality of training datasets in Cyber Security. Without a good quality dataset, a Machine Learning Algorithm cannot learn properly. Collecting the DataSetsDownloading SecRepo’s Datasets PE Malware Dataset featureExtraction.py Network Dataset Network_LogtoCSV.py Bro Logs Dataset Brolog_LogtoCSV.py System Dataset System_LogtoCSV.py System_Squid_LogtoCSV.py Analysis ReportsDetailing the data inside the Datasets with Jupyter Notebook Elements in Data Quality Report: Data Type Count Unique Values Missing Values Minimum Values Maximum Values Description ReportsReport Format Abstract Source Dataset Information Attribute Information Relevant Papers Associate Data Science Notebook |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论