2559. Count Vowel Strings in Ranges
Problem Description
The problem presents us with an array of strings words
and a two-dimensional array queries
. We need to determine how many strings within a specific index range in the words
array start and end with a vowel. A string is said to start and end with a vowel if the first and the last character are both vowels. The range for each query is defined by two indices [l_i, r_i]
, where l_i
is the start index and r_i
is the end index. Our goal is to answer each query by counting the relevant strings and returning the results in an array ans
, where each element ans[i]
corresponds to the count for the i-th
query.
The vowel characters are specifically 'a', 'e', 'i', 'o', and 'u'. The array words
is 0-indexed, meaning the first element has an index of 0, the second an index of 1, and so on. This indexing is important to consider when processing the ranges specified in each query within queries
.
Intuition
To efficiently solve this problem, we leverage two techniques:
-
Precomputation with Cumulative Sums: The idea is to precompute the number of valid strings (strings that start and end with a vowel) up to each index in the
words
array. This is achieved by creating an auxiliary arrays
where each elements[i]
stores the cumulative count of valid strings fromwords[0]
towords[i-1]
. This allows us to quickly calculate the number of valid strings in any query range with a simple subtraction:s[r + 1] - s[l]
, wherel
andr
are the start and end indices of the query. -
Checking String Endpoints: For each string in
words
, we only need to check the first and the last character to determine if it qualifies as a valid string. We define the set of vowels and then, for each word, check if both endpoints are in that set. The Python expressionint(w[0] in vowels and w[-1] in vowels)
evaluates to1
if both conditions are true and0
otherwise.
By combining these two ideas, the solution efficiently computes the answer to each query by referencing the precomputed cumulative sum array, resulting in a significant reduction of the time complexity, especially when dealing with a large number of queries over the words
array.
Learn more about Prefix Sum patterns.
Solution Approach
The Solution
class has a method named vowelStrings
which takes in two parameters: words
, a list of strings, and queries
, a 2D list of integers. The output is a list of integers representing the counts of strings in words
that satisfy the query conditions (start and end with a vowel).
Here's a closer look at how the solution is implemented:
-
Defining a Set of Vowels: A set
vowels
is created with all the vowel characters{'a', 'e', 'i', 'o', 'u'}
. This allows for constant time complexityO(1)
for checking if a character is a vowel since set membership checking is typicallyO(1)
due to hashing. -
Calculating Cumulative Sums with
accumulate
: Theaccumulate
function from Python'sitertools
module is used to generate the cumulative sum of valid strings. For every wordw
inwords
, an expressionint(w[0] in vowels and w[-1] in vowels)
is computed which results in1
if the word starts and ends with a vowel and0
otherwise. These values are accumulated to get cumulative sums. Theinitial=0
parameter is used to start the accumulation from0
which implicitly adds an extra0
at the beginning of the accumulated list, making the cumulative sum arrays
one element longer than thewords
. This simplification helps in handling edge cases and simplifies the subtraction step in the queries. -
Handling Queries: With the cumulative sums array
s
ready, each query[l, r]
is processed to find the count of valid strings in the range by subtracting the cumulative sum up tol
from the cumulative sum up tor + 1
. This operation takes constant timeO(1)
for each query. The result of this subtraction gives the count of valid strings within the specified range of indices, which is then appended to the resulting list. -
Return the Results: A list comprehension is used to iterate through each query in
queries
, and the described subtraction operation is applied to produce the final answer list, which is then returned.
The overall time complexities for initializing the cumulative sum and for processing each query are O(N)
and O(1)
respectively, where N
is the total number of words in words
. Given Q
queries, the total time complexity for the query processing part is O(Q)
. Hence the entire solution has an overall time complexity of O(N + Q)
which is highly efficient, especially for large datasets.
All in all, this approach utilizes space-time trade-offs (precomputing the cumulative sum) and efficient data structure access patterns (using a set and direct indexing) to provide an optimal solution to the problem.
Ready to land your dream job?
Unlock your dream job with a 2-minute evaluator for a personalized learning plan!
Start EvaluatorExample Walkthrough
Let's go through a walkthrough example to illustrate the solution approach. Suppose we are given an array of strings words
and a 2D array queries
as follows:
words
: ["apple", "orange", "coop", "peach", "echo"]queries
: [[1, 3], [0, 4], [2, 2]]
First, we will start by identifying which words start and end with a vowel:
- "apple" starts with 'a' and ends with 'e', so it's valid.
- "orange" starts with 'o' and ends with 'e', so it's valid.
- "coop" does not end with a vowel, so it's not valid.
- "peach" does not start with a vowel, so it's not valid.
- "echo" starts with 'e' and ends with 'o', so it's valid.
Next, we create a cumulative sum array s
which will hold the number of valid strings at each index. It will look like this:
s
: [0, 1, 2, 2, 2, 3]
Now, we process each query:
For the query [1, 3]
, we subtract the cumulative sum at index 1
from the cumulative sum at index 3 + 1
(since s
has an extra zero at the beginning):
s[3 + 1] - s[1]
=s[4] - s[1]
=2 - 1
=1
. So there is one valid string in the range from index 1 to 3 inwords
.
For the query [0, 4]
, we do:
s[4 + 1] - s[0]
=s[5] - s[0]
=3 - 0
=3
. There are three valid strings in the range from index 0 to 4 inwords
.
For the query [2, 2]
, we do:
s[2 + 1] - s[2]
=s[3] - s[2]
=2 - 2
=0
. There are no valid strings at index 2 inwords
, because index 2 to 2 is the string "coop" which does not start and end with a vowel.
Therefore, the final result (counts of valid strings for each query) is:
ans
: [1, 3, 0]
We have used the vowels
set to identify valid strings, the accumulate
function to prepare the cumulative sum array s
, and simple subtraction for each query to get the counts efficiently. This outlines the solution approach and demonstrates how it would work with an actual set of inputs.
Solution Implementation
1from itertools import accumulate
2from typing import List
3
4class Solution:
5 def vowel_strings(self, words: List[str], queries: List[List[int]]) -> List[int]:
6 # A set of vowels is initialized for quick lookup.
7 vowels = {'a', 'e', 'i', 'o', 'u'}
8
9 # This creates a cumulative sum list which indicates the count of words that
10 # begin and end with a vowel up to the current index.
11 # 'int(w[0] in vowels and w[-1] in vowels)' evaluates to 1 when both the first
12 # and the last characters of word 'w' are vowels, contributing to the sum.
13 cumulative_count = list(
14 accumulate(
15 [int(words[i][0] in vowels and words[i][-1] in vowels) for i in range(len(words))],
16 initial=0
17 )
18 )
19
20 # For each query, calculate the number of words that start and end with a vowel
21 # in the given range by subtracting the cumulative count at the start index from
22 # the cumulative count at the end index (+1 because of the way the cumulative
23 # count is constructed).
24 results = [cumulative_count[r + 1] - cumulative_count[l] for l, r in queries]
25
26 return results
27
28# Usage
29# solution = Solution()
30# result = solution.vowel_strings(["apple", "banana", "kiwi"], [[0, 1], [1, 2]])
31# print(result) # Output would be the number of words with vowel start and end in the given ranges
32
1class Solution {
2 // Function to determine the number of words with a starting and ending vowel in given queries.
3 public int[] vowelStrings(String[] words, int[][] queries) {
4 // Set containing all the vowels for easy reference.
5 Set<Character> vowels = Set.of('a', 'e', 'i', 'o', 'u');
6
7 // The number of words in the input array.
8 int wordCount = words.length;
9
10 // An array to store the cumulative count of words that start and end with a vowel.
11 int[] cumulativeVowelCount = new int[wordCount + 1];
12
13 // Loop through each word to populate the `cumulativeVowelCount` array.
14 for (int i = 0; i < wordCount; ++i) {
15 char firstChar = words[i].charAt(0); // First character of the current word.
16 char lastChar = words[i].charAt(words[i].length() - 1); // Last character of the current word.
17
18 // Update the cumulative count. Increment if both first and last chars are vowels.
19 cumulativeVowelCount[i + 1] = cumulativeVowelCount[i] + (vowels.contains(firstChar) && vowels.contains(lastChar) ? 1 : 0);
20 }
21
22 // The number of queries to process.
23 int queryCount = queries.length;
24 // Array to hold the answers to each query.
25 int[] answer = new int[queryCount];
26
27 // Process each query to get the number of words that start and end with a vowel.
28 for (int i = 0; i < queryCount; ++i) {
29 int leftIndex = queries[i][0], rightIndex = queries[i][1]; // Extracting the range from the query.
30
31 // Answer for this query is the difference in the cumulative counts.
32 answer[i] = cumulativeVowelCount[rightIndex + 1] - cumulativeVowelCount[leftIndex];
33 }
34
35 // Return the array containing the answer to each query.
36 return answer;
37 }
38}
39
1#include <vector>
2#include <string>
3#include <unordered_set>
4using namespace std;
5
6class Solution {
7public:
8 // Function to calculate the number of strings in subarrays that start and end with vowels.
9 vector<int> vowelStrings(vector<string>& words, vector<vector<int>>& queries) {
10 // Define the set of vowels for easy checking later.
11 unordered_set<char> vowels = {'a', 'e', 'i', 'o', 'u'};
12
13 // Get the size of the words array for future loops.
14 int numWords = words.size();
15
16 // Initialize a prefix sum array with an extra element for simplifying calculations.
17 vector<int> prefixSum(numWords + 1, 0);
18
19 // Build the prefix sum array with the count of words starting and ending with a vowel.
20 for (int i = 0; i < numWords; ++i) {
21 char firstChar = words[i][0], lastChar = words[i].back();
22 prefixSum[i + 1] = prefixSum[i] + (vowels.count(firstChar) && vowels.count(lastChar));
23 }
24
25 // Prepare a vector to store answers for queries.
26 vector<int> ans;
27
28 // Iterate through the queries to calculate the result.
29 for (auto& query : queries) {
30 // Retrieve the left and right boundaries for this query.
31 int leftBoundary = query[0], rightBoundary = query[1];
32 // Calculate the result for the current query using prefix sums.
33 // This gives the number of strings with first and last vowels in the range [l..r]
34 ans.push_back(prefixSum[rightBoundary + 1] - prefixSum[leftBoundary]);
35 }
36
37 // Return the final answers for all the queries.
38 return ans;
39 }
40};
41
1// Function to calculate the number of strings within a given range
2// where both the first and last characters are vowels
3function vowelStrings(words: string[], queries: number[][]): number[] {
4 // Create a set of vowel characters for easy checking
5 const vowels = new Set(['a', 'e', 'i', 'o', 'u']);
6 // Get the length of the words array
7 const wordsCount = words.length;
8 // Initialize an array to store the cumulative count of words
9 // that start and end with vowels
10 const cumulativeVowelCounts: number[] = new Array(wordsCount + 1).fill(0);
11
12 // Iterate over the words to populate the cumulative count array
13 for (let index = 0; index < wordsCount; ++index) {
14 // Check if both the first and last character of the current word are vowels
15 if (vowels.has(words[index][0]) && vowels.has(words[index][words[index].length - 1])) {
16 // Increment the cumulative count at the next index
17 cumulativeVowelCounts[index + 1] = cumulativeVowelCounts[index] + 1;
18 } else {
19 // If not, just carry over the previous cumulative count
20 cumulativeVowelCounts[index + 1] = cumulativeVowelCounts[index];
21 }
22 }
23
24 // Use map to transform the queries into the result by taking the
25 // difference of cumulative counts, thus obtaining the count for each range
26 return queries.map(([left, right]) => {
27 // Subtract the cumulative count at the start of the range from the
28 // cumulative count at the end of the range to get the count of valid words within the range
29 return cumulativeVowelCounts[right + 1] - cumulativeVowelCounts[left];
30 });
31}
32
Time and Space Complexity
Time Complexity
The time complexity of the code mainly comes from three parts:
- Creating the list comprehension which checks every word in the input list
words
. This process is O(n) where n is the number of words inwords
. - The use of
accumulate
from theitertools
module to create a prefix sum array of the boolean values obtained from the list comprehension. Sinceaccumulate
goes through each element in the provided list, this is O(n). - Finally, answering the queries involves iterating over each query and performing constant-time subtraction for the prefix sums. If there are
q
queries, this part has a complexity of O(q).
Combining these gives a total time complexity of O(n + q).
Space Complexity
The space complexity considerations include:
- Storage of the boolean list comprehension which requires O(n) space.
- The prefix sum list
s
which also needs O(n) space. - The space required for the output list which will store
q
integers, hence O(q) space.
The total space complexity is thus the sum of the above, which is O(n + q).
Learn more about how to find time and space complexity quickly using problem constraints.
A person thinks of a number between 1 and 1000. You may ask any number questions to them, provided that the question can be answered with either "yes" or "no".
What is the minimum number of questions you needed to ask so that you are guaranteed to know the number that the person is thinking?
Recommended Readings
Prefix Sum The prefix sum is an incredibly powerful and straightforward technique Its primary goal is to allow for constant time range sum queries on an array What is Prefix Sum The prefix sum of an array at index i is the sum of all numbers from index 0 to i By
LeetCode Patterns Your Personal Dijkstra's Algorithm to Landing Your Dream Job The goal of AlgoMonster is to help you get a job in the shortest amount of time possible in a data driven way We compiled datasets of tech interview problems and broke them down by patterns This way we
Recursion Recursion is one of the most important concepts in computer science Simply speaking recursion is the process of a function calling itself Using a real life analogy imagine a scenario where you invite your friends to lunch https algomonster s3 us east 2 amazonaws com recursion jpg You first
Want a Structured Path to Master System Design Too? Don’t Miss This!