CSE3358 Problem Set 6: K-Way Merging, Sorting, and Hashing | Exams Data Structures and Algorithms

CSE3358 Problem Set 6

Practice Problems

02/22/05

Due 02/25/05

Decision trees and lower bounds

Problem 1: Merging ksorted lists

We have so far considered a number of variations to this problem. Through our study of merge sort, we

saw how to merge two sorted lists containing a total of nelements in Θ(n) time. Moreover, Problem

Set 2 introduced a modification to merge sort that calls insertion sort on small inputs (≤kin size).

This modification of merge sort can be viewed essentially as merging n/k sorted lists of size keach.

We have seen how to do this in Θ(nlog n/k) time.

Now we will revisit the problem in a generalized form: We consider having klists containing a total of

nelements. We need to merge them into one sorted list. One could generalize the two-way merging

to k-way merging in the following way: We start with an empty list which will eventually contain

all nelements in order. We keep kpointers, one per list. Each pointer originally points to the first

element of the corresponding list. By comparing all kelements, the smallest among these is chosen

and placed in the big list and the corresponding pointer is incremented. This procedure is repeated

until all elements are taken. The drawback of this appraoch is that determining the smallest among

kelements takes O(k) time, leading to a O(nk) running time for the merging. A better way is to

perform 2-way merging of two lists in a tree-like form as we did previously for the modified merge sort.

Here’s another way:

(a) Using a heap data structure, describe a O(nlog k) time algorithm for performing k-way merging

of klists containing a total of nelements.

(b) Show that any algorithm for merging ksorted lists containing a total of nelements and that uses

those elements in comparisons only, has to run in Ω(nlog k) time. To do this, obtain all possible

interleavings of ksorted lists and use a decision tree argument similar to the one we used to obtain

the sorting lower bound.

Note: Merge sort is nothing but merging nlists of size 1 each.

Problem 2: Yet another variation

Consider klists that are not necessarily sorted containing a total of nelements. In this variation, all

the elements in the first list are less than or equal to all the elements in the second list, and so on...

One possible method for sorting the elements is to sort the individual lists independently and then

concatenate the sorted results. This will take Θ(Pinilog ni) where niis the number of elements in

list i.

Algorithm OBVIOUS

for i←1 to k

do sort list i . for example using merge sort

concatenate the lists

Althouhg the above algorithm is OBVIOUS, nothing better can be done. Regardless of what algorithm

we use for this variation of the sorting problem, show that Ω(Pinilog ni) time is needed. Note that

CSE3358 Problem Set 6: K-Way Merging, Sorting, and Hashing, Exams of Data Structures and Algorithms