apache spark - How many RDDs does DStream generate for a batch interval? -
does 1 batch interval of data generate one , one rdd in dstream regardless of how big quantity of data?
yes, there 1 rdd per batch interval, produced @ every batch interval independent of number of records (that included in rdd -- there 0 records inside).
if there wasn't, , rdd creation conditioned on number of elements, wouldn't have synchronous (micro-batching) streaming, rather form of asynchronous processing.
Comments
Post a Comment