We define the group-based collective keyword (GBCK) query problem.
We show the problem of answering GBCK query is NP-hard.
An exact algorithm for answering the GBCK queries is presented.
An approximation algorithm with a 15/7-factor approximation is proposed.
A number of experiments on real datasets are done.