06. Search Pattern (Rabin-Karp Algorithm)
β GFG solution for Rabin-Karp based substring search. Fast, efficient pattern matching using rolling hash! π
The problem can be found at the following link: π Question Link
π§© Problem Description
Given two strings:
- A text string in which to search. 
- A pattern string to find. 
Return all starting positions (1-based) where the pattern occurs in the text. Use the Rabin-Karp Algorithm (rolling hash-based pattern matching).
π Examples
Example 1
Input: text = "birthdayboy", pattern = "birth"
Output: [1]Example 2
Input: text = "geeksforgeeks", pattern = "geek"
Output: [1, 9]π Constraints
- $1 \leq \text{len(text)} \leq 5 \times 10^5$ 
- $1 \leq \text{len(pattern)} \leq \text{len(text)}$ 
- All characters are lowercase English letters ( - a-z)
β
 My Approach
The Rabin-Karp Algorithm uses hashing to match the pattern with substrings in the text.
Hashing Basics
- Convert strings to numerical hash using a rolling hash formula. 
- If two hashes match, perform a direct string comparison to confirm (to avoid false positives from collisions). 
Algorithm Steps:
- Hash Function: Use a prime modulus - qand radix/base- d = 256(covers lowercase letters).
- Precompute Hashes: - Compute the hash of the - patternand the initial window in- text.
- Use a rolling hash to update hash as the window slides by 1 character. 
 
- Match Check: - When hash matches, check characters one by one to confirm. 
 
π Time and Auxiliary Space Complexity
- Expected Time Complexity: O(N + M), where - Nis text length and- Mis pattern length. We compute hashes in linear time and only compare strings when hashes match.
- Expected Auxiliary Space Complexity: O(1), as we only use a fixed number of variables for hashing. 
π§βπ» Code (C++)
class Solution {
  public:
    vector<int> search(string pat, string txt) {
        int d = 256, q = 101, m = pat.length(), n = txt.length();
        int ph = 0, th = 0, h = 1;
        vector<int> res;
        for (int i = 0; i < m - 1; i++) h = (h * d) % q;
        for (int i = 0; i < m; i++) {
            ph = (d * ph + pat[i]) % q;
            th = (d * th + txt[i]) % q;
        }
        for (int i = 0; i <= n - m; i++) {
            if (ph == th) {
                bool match = true;
                for (int j = 0; j < m; j++)
                    if (txt[i + j] != pat[j]) { match = false; break; }
                if (match) res.push_back(i + 1);
            }
            if (i < n - m) {
                th = (d * (th - txt[i] * h) + txt[i + m]) % q;
                if (th < 0) th += q;
            }
        }
        return res;
    }
};π§βπ» Code (Java)
class Solution {
    ArrayList<Integer> search(String pat, String txt) {
        int d = 256, q = 101, m = pat.length(), n = txt.length();
        int ph = 0, th = 0, h = 1;
        ArrayList<Integer> res = new ArrayList<>();
        for (int i = 0; i < m - 1; i++) h = (h * d) % q;
        for (int i = 0; i < m; i++) {
            ph = (d * ph + pat.charAt(i)) % q;
            th = (d * th + txt.charAt(i)) % q;
        }
        for (int i = 0; i <= n - m; i++) {
            if (ph == th) {
                boolean match = true;
                for (int j = 0; j < m; j++) {
                    if (txt.charAt(i + j) != pat.charAt(j)) {
                        match = false;
                        break;
                    }
                }
                if (match) res.add(i + 1);
            }
            if (i < n - m) {
                th = (d * (th - txt.charAt(i) * h) + txt.charAt(i + m)) % q;
                if (th < 0) th += q;
            }
        }
        return res;
    }
}π Code (Python)
class Solution:
    def search(self, pat, txt):
        d, q, m, n = 256, 101, len(pat), len(txt)
        ph = th = 0; h = 1; res = []
        for i in range(m - 1): h = (h * d) % q
        for i in range(m):
            ph = (d * ph + ord(pat[i])) % q
            th = (d * th + ord(txt[i])) % q
        for i in range(n - m + 1):
            if ph == th and txt[i:i + m] == pat:
                res.append(i + 1)
            if i < n - m:
                th = (d * (th - ord(txt[i]) * h) + ord(txt[i + m])) % q
                if th < 0: th += q
        return resπ§  Contribution and Support
For discussions, questions, or doubts related to this solution, feel free to connect on LinkedIn: π¬ Any Questions?. Letβs make this learning journey more collaborative!
β If you find this helpful, please give this repository a star! β
πVisitor Count
Last updated