[Design] DFS, BFS and space efficiency | Mar2ndx's Algorithmic and System Design Blog

First Word

This post talks about BFS, DFS and space efficiency.

Analysis

For BFS it’s easy to understand. It is implemented with a queue. Space usage is O(width)

For DFS, there are 2 types: true DFS and pseudo-DFS. True DFS space usage is O(depth), while psudo-DFS is O(depth x branching factor).

branching factor is the number of children at each node, the outdegree.

Pseudo-DFS is implemented by simply take the BFS implementation and replace the Queue with Stack. But true-DFS is a very different algorithm where the stack is use for backtracking only. This is a great article talking about this topic.

the true classic DFS cannot be obtained from BFS by such queue-to-stack replacement. The classic DFS is a completely different algorithm with significantly different inner structure. True DFS is a genuinely recursive algorithm that uses stack for backtracking purposes, not for storing the vertex discovery “front” (as is the case in BFS)

The following quoted texts are also worth-reading.

Something worth noting - Pseudo-DFS should give you O(depth * branching factor) space, as opposed to O(depth) for proper DFS, which is still better than O(width).

The defining characteristic that separates true DFS and what you call pseudo DFS is how they use the stack. True DFS use it to store backtracking information in contrast to pseudo DFS which used the stack to store the vertex discovery front.

Examples

Given a perfect binary search tree of 4 levels. BFS would require 8 space, while DFS requires only 4 space. The branching factor is 2, so true and pseudo DFS use same amount of space.

Given a star graph: a single central vertex surrounded by 1000 peripheral vertices, with each connected to the central vertex. If we run BFS on this graph, the queue size will immediately jump to 1000. The same thing happens for pseudo-DFS. But classic DFS algorithm will need stack depth of only 1 (!) to traverse this entire graph.