How to Score Leads With Machine Learning
Why ML Lead Scoring Beats Manual Scoring
Manual lead scoring assigns points based on rules you write: +10 for visiting the pricing page, +5 for downloading a whitepaper, +20 for being in the right industry. These rules reflect your assumptions about what makes a good lead, but assumptions are often wrong or incomplete.
ML lead scoring learns from outcomes. It examines hundreds or thousands of past leads where you know who eventually bought and who did not. The model discovers which features actually predict conversion, including combinations of features that no human would think to check. Maybe leads who visit the pricing page AND come from LinkedIn AND are in companies with 10-50 employees convert at 5x the average rate. The model finds these patterns automatically.
What Data You Need
Export your historical lead data with a "converted" column (yes/no or 1/0) as the target. Include every feature you have about each lead:
- Source: Where the lead came from (organic search, paid ad, referral, social media, event)
- Demographics: Industry, company size, job title, geographic location
- Behavior: Pages viewed, time on site, content downloaded, emails opened, forms submitted
- Engagement: Number of visits, days since first visit, return visit frequency
- Qualification data: Budget range, timeline, specific needs expressed (if captured by forms or chatbot conversations)
- Timing: Day of week and time of day they first engaged, speed of follow-up response
You need at least 200-500 leads with known outcomes, and a reasonable balance between converted and non-converted. If only 2% of your leads convert, you may need thousands of leads to have enough conversion examples. See How Much Data Do You Need.
Building the Lead Scoring Model
Pull leads from the past 6-12 months where the outcome is known. Include both leads that converted and leads that did not. Remove leads that are still in the pipeline with no outcome yet because they would confuse the model.
Standardize source names, fill in missing values, and remove any columns that would not be available at the time of scoring (like "final deal size," which you only know after conversion). See How to Prepare Your Data.
Upload to the Data Aggregator app and choose logistic regression or random forest. Logistic regression is especially good for lead scoring because it returns probability scores (73% likely to convert) rather than just yes/no labels. Random forest handles complex feature interactions better if you have many features.
Check which features the model found most predictive. This is valuable business intelligence on its own. You might discover that leads from a particular source or industry convert at much higher rates than you realized. Use this information to refine your marketing targeting. See How to Test Model Accuracy.
Set up a process to score each new lead as it arrives. Send the lead's data through the model and get back a conversion probability. Leads scoring above 70% go to the top of the call queue. Leads scoring 30-70% get added to a nurture drip campaign. Leads below 30% receive automated follow-up only.
Turning Scores Into Actions
A lead score is only valuable if it changes what you do. Design clear action tiers based on score ranges:
- Hot leads (70-100%): Immediate sales outreach within the hour. These leads have the highest conversion probability and should get your best sales rep's attention first.
- Warm leads (40-70%): Same-day follow-up with a personalized email or SMS. Add to an automated nurture sequence that educates and builds trust over 1-2 weeks.
- Cool leads (15-40%): Add to a longer drip campaign with educational content. Re-score after they engage with your content to see if their score improves.
- Cold leads (0-15%): Automated email sequence only. Do not spend sales team time on these unless they re-engage and their score increases.
Automate this routing with workflow automation. A chain command can score the lead, check the tier, and trigger the appropriate follow-up action without any manual intervention.
Combining ML Scoring With AI Chatbot Qualification
For maximum lead intelligence, combine ML scoring with AI chatbot qualification. The chatbot asks qualifying questions during the conversation (budget, timeline, specific needs) and captures structured data. Feed this data into the ML model alongside behavioral features for an even more accurate lead score. The chatbot handles the conversation, the ML model handles the math.
Score every lead automatically and send your sales team to the hottest prospects first. Train a lead scoring model on your own data.
Get Started Free