在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):scalanlp/nak开源软件地址(OpenSource Url):https://github.com/scalanlp/nak开源编程语言(OpenSource Language):Scala 72.4%开源软件介绍(OpenSource Introduction):NakNak is a Scala/Java library for machine learning and related tasks, with a focus on having an easy to use API for some standard algorithms. It is formed from Breeze, Liblinear Java, and Scalabha. It is currently undergoing a pretty massive evolution, so be prepared for quite big changes in the API for this and probably several future versions. We'd love to have some more contributors: if you are interested in helping out, please see the #helpwanted issues or suggest your own ideas. What's insideNak currently provides implementations for k-means clustering and supervised learning with logistic regression and support vector machines. Other models and algorithms that were formerly in [breeze.learn] are now in Nak. See the Nak wiki for (some preliminary and unfortunately sparse) documentation. The latest stable release of Nak is 1.2.1. Changes from the previous release include:
See the CHANGELOG for changes in previous versions. Using NakIn SBT:
In Maven:
ExampleHere's an example of how easy it is to train and evaluate a text classifier using Nak. See TwentyNewsGroups.scala for more details. def main(args: Array[String]) {
val newsgroupsDir = new File(args(0))
implicit val isoCodec = scala.io.Codec("ISO-8859-1")
val stopwords = Set("the","a","an","of","in","for","by","on")
val trainDir = new File(newsgroupsDir, "20news-bydate-train")
val trainingExamples = fromLabeledDirs(trainDir).toList
val config = LiblinearConfig(cost=5.0)
val featurizer = new BowFeaturizer(stopwords)
val classifier = trainClassifier(config, featurizer, trainingExamples)
val evalDir = new File(newsgroupsDir, "20news-bydate-test")
val maxLabelNews = maxLabel(classifier.labels) _
val comparisons = for (ex <- fromLabeledDirs(evalDir).toList) yield
(ex.label, maxLabelNews(classifier.evalRaw(ex.features)), ex.features)
val (goldLabels, predictions, inputs) = comparisons.unzip3
println(ConfusionMatrix(goldLabels, predictions, inputs))
} Questions or suggestions?Post a message to the scalanlp-discuss mailing list or create an issue. |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论