Building a binary tree problem¶

Question

Given the pre-order traversal preorder sequence and the in-order traversal inorder sequence of a binary tree, construct the binary tree and return its root node. Assume there are no duplicate node values in the binary tree (as shown in the figure below).

Determining if it is a divide-and-conquer problem¶

The original problem of building a binary tree from the preorder and the inorder sequences is a typical divide-and-conquer problem.

The problem can be decomposed: From the perspective of divide-and-conquer, we can divide the original problem into two subproblems—building the left subtree and building the right subtree—plus one operation of initializing the root node. For each subtree (subproblem), we continue applying the same approach, partitioning it into smaller subtrees (subproblems), until reaching the smallest subproblem (an empty subtree).
The subproblems are independent: The left and right subtrees do not overlap. When building the left subtree, we only need the segments of the in-order and pre-order traversals that correspond to the left subtree. The same approach applies to the right subtree.
Solutions to subproblems can be combined: Once we have constructed the left and right subtrees (the subproblem solutions), we can attach them to the root node to obtain the solution to the original problem.

How to divide the subtrees¶

Based on the above analysis, this problem can be solved using divide-and-conquer. However, how do we use the pre-order traversal preorder sequence and the in-order traversal inorder sequence to divide the left and right subtrees?

By definition, both the preorder and inorder sequences can be divided into three parts:

Pre-order traversal: [ Root | Left Subtree | Right Subtree ]. For example, in the figure, the tree corresponds to [ 3 | 9 | 2 1 7 ].
In-order traversal: [ Left Subtree | Root | Right Subtree ]. For example, in the figure, the tree corresponds to [ 9 | 3 | 1 2 7 ].

Using the data from the preceding figure, we can follow the steps shown in the next figure to obtain the division results:

The first element 3 in the pre-order traversal is the value of the root node.
Find the index of the root node 3 in the inorder sequence, and use this index to split inorder into [ 9 | 3 ｜ 1 2 7 ].
According to the split of the inorder sequence, it is straightforward to determine that the left and right subtrees contain 1 and 3 nodes, respectively, so we can split the preorder sequence into [ 3 | 9 | 2 1 7 ] accordingly.

Describing subtree ranges based on variables¶

Based on the above division method, we have now obtained the index ranges of the root, left subtree, and right subtree in the preorder and inorder sequences. To describe these index ranges, we use several pointer variables.

Let the index of the current tree's root node in the preorder sequence be denoted as \(i\).
Let the index of the current tree's root node in the inorder sequence be denoted as \(m\).
Let the index range of the current tree in the inorder sequence be denoted as \([l, r]\).

As shown in the table below, these variables represent the root node’s index in the preorder sequence and the index ranges of the subtrees in the inorder sequence.

Table Indexes of the root node and subtrees in pre-order and in-order traversals

	Root node index in `preorder`	Subtree index range in `inorder`
Current tree	\(i\)	\([l, r]\)
Left subtree	\(i + 1\)	\([l, m-1]\)
Right subtree	\(i + 1 + (m - l)\)	\([m+1, r]\)

Please note that \((m-l)\) in the right subtree root index represents "the number of nodes in the left subtree." It may help to consult the figure below for a clearer understanding.

Code implementation¶

To improve the efficiency of querying \(m\), we use a hash table hmap to store the mapping from elements in the inorder sequence to their indexes:

[file]{build_tree}-[class]{}-[func]{build_tree}

The figure below shows the recursive process of building the binary tree. Each node is created during the "descending" phase of the recursion, and each edge (reference) is formed during the "ascending" phase.

Each recursive function's division of the preorder and inorder sequences is illustrated in the figure below.

Assuming the binary tree has \(n\) nodes, initializing each node (calling the recursive function dfs()) takes \(O(1)\) time. Therefore, the overall time complexity is \(O(n)\).

Because the hash table stores the mapping from inorder elements to their indexes, it requires \(O(n)\) space. In the worst case, if the binary tree degenerates into a linked list, the recursive depth can reach \(n\), consuming \(O(n)\) stack space. Hence, the overall space complexity is \(O(n)\).