How to Do a Machine Learning Project

- July 05, 2025

How to Do a Machine Learning Project Step-by-Step (with Explanation and Code)

In this tutorial, you'll learn how to execute a complete Machine Learning project using Python. We’ll use the popular Iris dataset and the scikit-learn library. This guide is perfect for beginners who want to understand the typical ML workflow.

Tools Required: Python, Jupyter Notebook or VS Code, scikit-learn, pandas, matplotlib.

Step 1: Understand the Problem

The Iris dataset contains measurements of flowers from three species. Your goal is to classify them into the correct species based on sepal and petal measurements.

Step 2: Import Required Libraries

import pandas as pd
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score, classification_report
import matplotlib.pyplot as plt
import seaborn as sns

Step 3: Load and Explore the Dataset

We'll load the dataset and view its structure:

iris = load_iris()
df = pd.DataFrame(iris.data, columns=iris.feature_names)
df['target'] = iris.target
print(df.head())

Let's visualize the relationships:

sns.pairplot(df, hue="target")
plt.show()

Step 4: Prepare the Data

Split the data into features and labels, then into training and test sets:

X = df.drop('target', axis=1)
y = df['target']

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)

Step 5: Train a Model

We'll use a Random Forest Classifier, a simple but powerful ensemble method:

model = RandomForestClassifier(n_estimators=100)
model.fit(X_train, y_train)

Step 6: Make Predictions and Evaluate

Use the trained model to predict and evaluate its performance:

y_pred = model.predict(X_test)

print("Accuracy:", accuracy_score(y_test, y_pred))
print(classification_report(y_test, y_pred))

Step 7: Make a Single Prediction

Let’s try predicting the class for a single flower:

sample = [[5.1, 3.5, 1.4, 0.2]]
prediction = model.predict(sample)
print("Predicted Species:", iris.target_names[prediction[0]])

Step 8: Save and Reload the Model (Optional)

import joblib

# Save model
joblib.dump(model, 'iris_model.pkl')

# Load model
# model = joblib.load('iris_model.pkl')

Conclusion

You just completed your first Machine Learning project! You learned how to load data, visualize it, train a model, and make predictions. Try using different datasets and models to deepen your learning.

What's Next?

Try using other models like LogisticRegression or KNeighborsClassifier.
Practice on different datasets (e.g., Titanic, Wine Quality, MNIST).
Deploy your model using Flask or Streamlit.

Search This Blog

My Teaching Desk - By LokeshDev