How to Train a Machine Learning Model Without Code
What Happens When You Train a Model
Training is the process where the algorithm examines your historical data and learns the relationships between input columns and the outcome column. If you are predicting whether a customer will churn, the algorithm figures out that customers with low engagement scores and no recent purchases are more likely to leave. It captures those patterns as mathematical weights stored in the model.
Behind the scenes, the platform splits your data into a training set (typically 80%) and a validation set (20%). The model learns from the training set, then its predictions are checked against the validation set to measure real-world accuracy. This prevents overfitting, where a model memorizes the training data but fails on new examples.
Step-by-Step Training Process
Your data needs to be in a CSV file with column headers in the first row. Each row after that is one data point. One column is your target (what you want to predict). The rest are input features the model uses to make predictions. See the data preparation guide for formatting tips.
In the Data Aggregator app, upload your CSV file. The platform scans the columns, detects data types (numeric, categorical, date), and shows a preview so you can verify the data looks correct before proceeding.
Choose the type of model based on what you want to predict. Use a classifier if the outcome is a category (yes/no, churned/retained, approved/denied). Use a regressor if the outcome is a number (sales amount, visitor count, price). Use a clusterer if you want to group similar items without a target column. The algorithm selection guide covers all 18 available algorithms.
Set the target column and any algorithm-specific parameters. Most defaults work well for initial training. Click train and the platform runs the job. Training time depends on your dataset size, from seconds for small datasets to a few minutes for larger ones.
After training completes, the platform reports accuracy metrics. For classifiers, you see accuracy percentage, precision, and recall. For regressors, you see R-squared and mean absolute error. Use the accuracy testing guide to understand what these numbers mean and whether your model is ready for production use.
Upload new data (same columns as training, minus the target column) and get predictions. Each row receives a predicted value or category. After initial training, predictions are free with zero per-request cost.
Choosing Between Algorithm Types
The platform offers 18 algorithm types across four categories. Here is a quick decision framework:
- Classification (Random Forest, Gradient Boosting, SVM, Logistic Regression, and more): Use when predicting categories. Will this customer churn? Is this transaction fraudulent? Will this lead convert?
- Regression (Linear Regression, Gradient Boosting Regressor, Random Forest Regressor, and more): Use when predicting numbers. How many visitors next week? What price should this product be? How many support tickets tomorrow?
- Clustering (K-Means, DBSCAN, and more): Use when grouping similar items without a predefined outcome. Segment customers, group products, find natural categories in your data.
- Anomaly Detection (Isolation Forest, One-Class SVM): Use when finding outliers. Detect fraud, unusual server activity, or data quality issues.
Tips for Better Results on Your First Model
- Start with more data. A model trained on 100 rows will not be as accurate as one trained on 10,000 rows. If your dataset is small, focus on the most predictive columns rather than adding more features.
- Clean your data before uploading. Remove rows with missing values in the target column. Fill in missing input values with reasonable defaults or remove those rows entirely.
- Try multiple algorithms. Train two or three different algorithms on the same data and compare accuracy. The best algorithm varies by dataset, and testing takes only a few minutes.
- Do not expect perfection. A model that correctly predicts 80% of outcomes is genuinely useful. It does not need to be 99% to save time and improve decisions.
After Training: What Comes Next
Once your model is trained, you can chain it into a pipeline with other models, retrain it periodically as new data arrives, or connect predictions to automated workflows that take action based on model output. The model stays available in your account for predictions at any time.
Train your first machine learning model in minutes. Upload data, pick an algorithm, and start predicting.
Get Started Free