How do we calculate the Euclidean distance between two data points using basic Python operations?

by EITCA Academy / Monday, 07 August 2023 / Published in Artificial Intelligence, EITC/AI/MLP Machine Learning with Python, Programming machine learning, Programming own K nearest neighbors algorithm, Examination review

To calculate the Euclidean distance between two data points using basic Python operations, we need to understand the concept of Euclidean distance and then implement it using Python.

Euclidean distance is a measure of the straight-line distance between two points in a multidimensional space. It is commonly used in machine learning algorithms, such as the k-nearest neighbors (KNN) algorithm, to determine the similarity or dissimilarity between data points.

The Euclidean distance between two points (x1, y1) and (x2, y2) in a two-dimensional space can be calculated using the following formula:

distance = sqrt((x2 – x1)^2 + (y2 – y1)^2)

To generalize this formula for n-dimensional space, we can use the following formula:

distance = sqrt((x2 – x1)^2 + (y2 – y1)^2 + … + (zn – z1)^2)

Now, let's implement this formula in Python. We can define a function called euclidean_distance that takes two data points as input and returns the Euclidean distance between them.

python
import math

def euclidean_distance(point1, point2):
    distance = 0.0
    for i in range(len(point1)):
        distance += (point2[i] - point1[i]) ** 2
    return math.sqrt(distance)

In this code, we first import the math module to use the square root function. Then, we define the euclidean_distance function that takes two points as input: point1 and point2. The function initializes the distance variable to 0.0.

Next, we iterate over the dimensions of the points using a for loop. For each dimension, we calculate the squared difference between the corresponding coordinates of the two points and add it to the distance variable.

Finally, we return the square root of the distance, which gives us the Euclidean distance between the two points.

Let's see an example to understand how to use this function:

python
point1 = [1, 2, 3]
point2 = [4, 5, 6]
distance = euclidean_distance(point1, point2)
print(distance)

Output:

5.196152422706632

In this example, we have two points: point1 with coordinates [1, 2, 3] and point2 with coordinates [4, 5, 6]. We pass these points to the euclidean_distance function, which calculates the Euclidean distance between them. The output is approximately 5.196152422706632.

To summarize, the Euclidean distance between two data points can be calculated using the formula sqrt((x2 – x1)^2 + (y2 – y1)^2 + … + (zn – z1)^2). We can implement this formula in Python using a function that takes two points as input and returns the Euclidean distance. The function iterates over the dimensions of the points, calculates the squared differences, sums them up, takes the square root of the sum, and returns the result.

EITCA Academy

How do we calculate the Euclidean distance between two data points using basic Python operations?

Other recent questions and answers regarding Examination review:

More questions and answers:

EITCA Academy is a part of the European IT Certification framework

EITCA Academy

LOG IN TO YOUR ACCOUNT

FORGOT YOUR PASSWORD?

CREATE AN ACCOUNT

How do we calculate the Euclidean distance between two data points using basic Python operations?

Other recent questions and answers regarding Examination review:

More questions and answers: