1 min readMar 27, 2020
Great article. I wasn’t aware of this:
“One of Chinese internet giants even modified Spark source code in order to optimally read/write Hive bucketing table :-)”
But you tease us, without providing a reference! Is the code open-source? Who did it?