Building a binary tree problem¶
Question
Given the pre-order traversal preorder
sequence and the in-order traversal inorder
sequence of a binary tree, construct the binary tree and return its root node. Assume there are no duplicate node values in the binary tree (as shown in the figure below).
Determining if it is a divide-and-conquer problem¶
The original problem of building a binary tree from the preorder
and the inorder
sequences is a typical divide-and-conquer problem.
- The problem can be decomposed: From the perspective of divide-and-conquer, we can divide the original problem into two subproblems—building the left subtree and building the right subtree—plus one operation of initializing the root node. For each subtree (subproblem), we continue applying the same approach, partitioning it into smaller subtrees (subproblems), until reaching the smallest subproblem (an empty subtree).
- The subproblems are independent: The left and right subtrees do not overlap. When building the left subtree, we only need the segments of the in-order and pre-order traversals that correspond to the left subtree. The same approach applies to the right subtree.
- Solutions to subproblems can be combined: Once we have constructed the left and right subtrees (the subproblem solutions), we can attach them to the root node to obtain the solution to the original problem.
How to divide the subtrees¶
Based on the above analysis, this problem can be solved using divide-and-conquer. However, how do we use the pre-order traversal preorder
sequence and the in-order traversal inorder
sequence to divide the left and right subtrees?
By definition, both the preorder
and inorder
sequences can be divided into three parts:
- Pre-order traversal:
[ Root | Left Subtree | Right Subtree ]
. For example, in the figure, the tree corresponds to[ 3 | 9 | 2 1 7 ]
. - In-order traversal:
[ Left Subtree | Root | Right Subtree ]
. For example, in the figure, the tree corresponds to[ 9 | 3 | 1 2 7 ]
.
Using the data from the preceding figure, we can follow the steps shown in the next figure to obtain the division results:
- The first element 3 in the pre-order traversal is the value of the root node.
- Find the index of the root node 3 in the
inorder
sequence, and use this index to splitinorder
into[ 9 | 3 | 1 2 7 ]
. - According to the split of the
inorder
sequence, it is straightforward to determine that the left and right subtrees contain 1 and 3 nodes, respectively, so we can split thepreorder
sequence into[ 3 | 9 | 2 1 7 ]
accordingly.
Describing subtree ranges based on variables¶
Based on the above division method, we have now obtained the index ranges of the root, left subtree, and right subtree in the preorder
and inorder
sequences. To describe these index ranges, we use several pointer variables.
- Let the index of the current tree's root node in the
preorder
sequence be denoted as \(i\). - Let the index of the current tree's root node in the
inorder
sequence be denoted as \(m\). - Let the index range of the current tree in the
inorder
sequence be denoted as \([l, r]\).
As shown in the table below, these variables represent the root node’s index in the preorder
sequence and the index ranges of the subtrees in the inorder
sequence.
Table
Root node index in preorder |
Subtree index range in inorder |
|
---|---|---|
Current tree | \(i\) | \([l, r]\) |
Left subtree | \(i + 1\) | \([l, m-1]\) |
Right subtree | \(i + 1 + (m - l)\) | \([m+1, r]\) |
Please note that \((m-l)\) in the right subtree root index represents "the number of nodes in the left subtree." It may help to consult the figure below for a clearer understanding.
Code implementation¶
To improve the efficiency of querying \(m\), we use a hash table hmap
to store the mapping from elements in the inorder
sequence to their indexes:
The figure below shows the recursive process of building the binary tree. Each node is created during the "descending" phase of the recursion, and each edge (reference) is formed during the "ascending" phase.
Each recursive function's division of the preorder
and inorder
sequences is illustrated in the figure below.
Assuming the binary tree has \(n\) nodes, initializing each node (calling the recursive function dfs()
) takes \(O(1)\) time. Therefore, the overall time complexity is \(O(n)\).
Because the hash table stores the mapping from inorder
elements to their indexes, it requires \(O(n)\) space. In the worst case, if the binary tree degenerates into a linked list, the recursive depth can reach \(n\), consuming \(O(n)\) stack space. Hence, the overall space complexity is \(O(n)\).