Machine Learning Libraries
Light Gradient Boosting Machine
★ 4.7
Machine Learning in Python
★ 4.9
pip install lightgbmpip install scikit-learnpip install lightgbmpip install scikit-learnLightGBM is the standard gradient boosting library for tabular ML tasks in Python. Data engineers use it to train classification and regression models on structured pipeline outputs — it handles missing values natively, trains significantly faster than XGBoost on large datasets, and integrates directly with pandas DataFrames and scikit-learn Pipelines.
Data engineers use scikit-learn Pipelines to build reproducible feature engineering and model training workflows. A `Pipeline` chains `StandardScaler`, `OneHotEncoder`, and a classifier — ensuring the same transformations apply at both training and inference time, preventing data leakage and making model serving straightforward.
Individual Tool Pages