Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms. It runs on single, multiple machines and major distributed environments(hadoop, spark)๏ผŒand supports major operating systems(Linux, Windows, Mac OS)๏ผŒthe communication of distributed environments is implemented based on ytk-mp4j which is pure java, mpi-like message passing interface.

Features

  • Supports most of operating systems: Linux, Mac OS, Windows
  • Supports various platforms: single machine, common cluster, hadoop, spark
  • Supports local file system and hdfs file system
  • Provides uniform file system interface and can be applied to other file systems easily.
  • Provides user friendly codes for online prediction.
  • Without complex installation, only needs Java SE Runtime Environment 8 installation.

For more details, refer to features

Documents

Experiments

We compare our GBDT with XGBoost and LightGBM, see gbdt experiments for more details.

Environment Requirements

To run or develop ytk-learn๏ผŒjust install JRE 8 or JDK 8 and set JAVA_HOME.

Ytk Learn

Ytk-learn is a distributed machine learning library which implements most of popular machine learning algorithms(GBDT, GBRT, Mixture Logistic Regression, Gradient Boosting Soft Tree, Factorization Machines, Field-aware Factorization Machines, Logistic Regression, Softmax).

Ytk Learn Info

โญ Stars347
๐Ÿ”— Source Codegithub.com
๐Ÿ•’ Last Updatea year ago
๐Ÿ•’ Created6 years ago
๐Ÿž Open Issues1
โž— Star-Issue Ratio347
๐Ÿ˜Ž Authoryuantiku