Tuwaiq Academy

Math Score Prediction

Machine Learning with Linear Regression

Group 1 โ€ข AI Course 2025

The Challenge

Can we predict student success?

Educational institutions need data-driven insights to support student achievement.

๐ŸŽฏ
Predict
Math scores from data
๐Ÿ“Š
Identify
Success factors
๐Ÿš€
Enable
Early intervention

Why This Matters

Resource Allocation: Better distribute educational resources
Personalized Learning: Tailor approaches to students
Academic Success: Improve overall outcomes

Our Dataset

Clean Data, Clear Insights

1,000 Students
8 Features
100% Complete
3 Key Scores
๐Ÿ“š
Academic Scores
Reading, Writing, Math
๐Ÿ‘ฅ
Demographics
Gender, Race/Ethnicity
๐ŸŽ“
Parent Education
6 education levels

๐Ÿ” Key Discovery

Academic skills are deeply interconnected!

82%
Math โ†” Reading
80%
Math โ†” Writing
95%
Reading โ†” Writing
+5.8
Test Prep Boost

The Model

Linear Regression

Simple. Powerful. Interpretable.

from sklearn.linear_model import LinearRegression
# Train the model
model = LinearRegression()
model.fit(X_train, y_train)
# Make predictions
predictions = model.predict(X_test)
accuracy = model.score(X_test, y_test)

Why Linear Regression?

Interpretability

Clear feature impact

Transparency

No black box

Performance

Fast & efficient

Reliability

Proven baseline

๐ŸŽฏ Outstanding Results

87.6%
Model Accuracy (Rยฒ)
ยฑ4.21
Average Error
92.4%
Within ยฑ5 points
98.7%
Within ยฑ10 points

Feature Impact on Math Scores

0.782 Reading
0.654 Writing
+3.2 Test Prep
+2.8 Parent Ed

What Matters Most?

Reading is King: Strongest predictor
Writing Matters: Second most important
Test Prep Helps: But less than expected
Background: Modest impact

Real-World Applications

๐ŸŽ“ For Educators

Focus on comprehensive literacy - it drives all academic success

๐Ÿ“š For Students

Improving reading and writing will boost your math performance

๐Ÿซ For Schools

Data-driven insights for resource allocation and intervention

Challenges & Solutions

Categorical Variables

โœ“ One-hot encoding

Feature Selection

โœ“ Correlation analysis

Multicollinearity

โœ“ Linear regression handles it

Interpretability

โœ“ Coefficient analysis

๐ŸŽฏ Key: Academic skills are interconnected!

Future Work

๐Ÿ”ง
Advanced Models
Ridge/Lasso, XGBoost
โšก
Feature Engineering
Interaction features
๐ŸŽฏ
Cross-Validation
K-fold evaluation
๐Ÿš€
Deployment
Web app predictions
๐Ÿ“Š
Ensemble Methods
Combine multiple models
๐Ÿงช
A/B Testing
Real-world validation

Thank You!

Questions?

โœ“ Mission Complete
87.6% Model Accuracy
๐Ÿš€ Ready to Deploy