apache spark - How many RDDs does DStream generate for a batch interval? -


does 1 batch interval of data generate one , one rdd in dstream regardless of how big quantity of data?

yes, there 1 rdd per batch interval, produced @ every batch interval independent of number of records (that included in rdd -- there 0 records inside).

if there wasn't, , rdd creation conditioned on number of elements, wouldn't have synchronous (micro-batching) streaming, rather form of asynchronous processing.


Comments

Popular posts from this blog

Delphi XE2 Indy10 udp client-server interchange using SendBuffer-ReceiveBuffer -

Qt ActiveX WMI QAxBase::dynamicCallHelper: ItemIndex(int): No such property in -

Enable autocomplete or intellisense in Atom editor for PHP -