Spanning Tree

Overview

A spanning tree is a sub-graph, that contains all the vertices of a graph. A Spanning tree may or may not be weighted, a spanning tree does not have cycles and it cannot be disconnected. The Spanning tree has a minimal set of edges. A single connected graph can have multiple spanning trees.

Scope

In this article, we are going to learn about the spanning trees and their interesting properties. We will also learn about the minimum spanning tree, its properties, and algorithms to construct the minimum spanning tree with complexity analysis of the minimum spanning tree algorithm. We will discuss various real-life applications of the spanning tree as well as the minimum spanning tree (MST). We will talk about what is clustering and how it can be achieved.

What is Spanning Tree?

A spanning tree is a sub-graph that connects all the vertices of a graph with the minimum possible number of edges. It may or may not be weighted and does not have cycles.
Let us understand what is spanning tree with an interesting example.

Consider a situation where a cable television network company is laying the cable to a new neighborhood. If the company is constrained to bury the cable only along certain paths for example along the roads then the problem is to find the minimum amount of cable that the company requires to complete the wiring of the television network

The above problem can be solved by representing the cable T.V. network as a graph whose points are connected by those paths. Some of those paths might be more expensive because they are longer and require more amount of cable to be buried. These paths would be represented by the edges of a graph with larger weights.

A spanning tree for that graph would be a subset of those paths that has no cycles but still connects to every house. There might be several spanning trees possible. A minimum spanning tree would be one with the lowest total cost and thus would represent the least expensive path for laying the cable.

Spanning Trees of a given graph G can also be defined as a minimal set of edges that contains all the vertices of G. A spanning tree does not have any cycle and it can never be disconnected. A spanning tree can be weighted or unweighted.

Example of Spanning Tree

A complete undirected graph G can have a maximum n^n-2 number of spanning trees, where n is the number of nodes in a given graph G. Let us Consider a complete graph G with 3 vertices, then the total number of spanning trees this graph can have is 3^(3-2)=3 which are shown in the image below.

Example of Spanning Tree

In the above picture, we can see that the tree have no cycles and they are minimally connected so they are all the possible spanning trees of 3 vertices for a given graph G.

General Properties of Spanning Tree

Let us discuss some properties of a spanning tree.

All possible spanning trees for graph G have the same number of edges and vertices.
Spanning trees do not have any cycles.
A Spanning tree is a minimally connected sub-graph, which means if we remove any edge from the spanning tree then it becomes disconnected.
A Spanning tree is a maximally acyclic sub-graph, which means if we add an edge to the spanning tree then it becomes cyclic.
A connected graph G can have more than one spanning tree.

Mathematical Properties of Spanning Tree

Let us see some mathematical properties of a spanning tree.

A Spanning tree always contains n-1 edges, where n is the total number of vertices in the graph G.
The total number of spanning trees that a complete graph of n vertices can have is n^(n-2).
We can construct a spanning tree by removing atmost e-n+1 edges from a complete graph G, where e is the number of edges and n is the number of vertices in graph G.

Clustering

Let us understand clustering with an interesting example.

Consider a situation where a businessman is trying to get the best return on his marketing investment, in such a case it is crucial that he must target people in the right way. Clustering algorithms are able to group together people with similar traits and likelihood to purchase. Once he has the groups, he can run tests on each group with different marketing copy that will help him better target his customers.

Clustering is the process of grouping similar objects together. Clustering is one of the most famous applications of the spanning tree. In clustering, our goal is to divide n objects into k different groups such that the different groups get placed at maximum distance from one another.

Clustering is the task of dividing the data points into a number of groups such that data points in the same groups are more similar than those in other data points in the same group than those in other groups. In simple words, the aim is to segregate groups with similar traits and assign them into clusters.

Clustring

In the above picture, we can see that the objects are divided into different clusters. Every cluster is represented by a different colour.

How Can Clustering Be Achieved?

Let us take an example to understand how we can achieve clustering.

Suppose we have 12 objects and we have to divide them into 3 groups. So n=12 and k=3, where n is the number of objects and k is the number of groups in which we have to divide the n objects.

Firstly we divide the n objects into k groups. Here n=12 and k=3.

achieve clustering example

Here every group is represented by a different color.

Now we combine these clusters iteratively by adding an edge between them.

achieve clustring step 2

We will stop after we reach the k clusters.

achieve clustring step 3

What is Minimum Spanning Tree?

Let us learn about the Minimum Spanning Tree with an interesting real-life example.

Consider a Parcel delivery Agency which delivers the parcels over the city. The company has delivery agents who deliver the parcel to different parts of the city. The goal of the company is to deliver all the parcels with the minimum cost of transportation and in the minimum amount of time. To achieve this goal company has to decide on an optimal path for every delivery agent such that it requires minimum time and fuel for delivering all the parcels.

The above problem can be solved by representing the route in form of a graph, whose vertices represent the destination of delivery, and the path to reach the destination is represented by the edges of the graph. We can have multiple spanning trees that represent the various routes of delivery In such cases a minimum spanning tree will be one by which we can reach the destination in the minimum time and cost which serves the purpose of the company.

A minimum spanning tree is a spanning tree that has a minimum cost among all the spanning trees. The cost of the spanning tree is the sum of the weights of all the edges in the tree. In real-life situations, this weight can be measured as distance, cost of transportation, manufacturing cost, traffic load, or any arbitrary value denoted by the edges. A minimum spanning tree has (V – 1) edges where V is the number of vertices in the given graph.

For a given graph G a minimum spanning tree of a graph is unique if the weight of all the edges is distinct. Otherwise, there may be multiple possible minimum spanning trees. Minimum Spanning tree can also be written as MST in short form.

Example of Minimum Spanning Tree

Let us understand the Minimum Spanning Tree with the help of the example below.

Consider a weighted graph G with three vertices as shown in the picture below. minimum spanning example 1

Now let us see some of the spanning trees which are possible with this graph G.

1. minimum spanning example 2

Total Cost=4+5=9

2. minimum spanning example 3

Total Cost=4+7=11

3. minimum spanning example 4 Total Cost=7+5=12

From the above three cases, we can see that among all possible spanning trees figure 1 has the minimum cost, So it is the minimum spanning tree among the given spanning trees.

Minimum Spanning Tree Algorithm

Let us study the minimum spanning tree algorithm. So there are two famous algorithms for finding the Minimum Spanning Tree: Prim's and Kruskal's Algorithm

Kruskal’s Algorithm

In Kruskal’s Algorithm, the spanning tree is constructed by adding the edges one by one. Kruskal’s Algorithm is based on the greedy approach because every time we add that edge that has the least weight among all available options.

Algorithm Steps:

Sort all the edges of the graph in the increasing order of their weight.
Pick the edge with the smallest weight.
Check if it forms a cycle with the spanning tree formed so far.
- Include the current edge if it does not form any cycle.
- Otherwise discard it.
Repeat step #3 until there V-1 edges in the spanning tree, where V is the total number of vertices in the graph.

Example

Let us understand the working of Kruskal's Algorithm with an example. Consider a weighted graph G having seven vertices in the picture below.

Kruskal’s Algorithm