-
Notifications
You must be signed in to change notification settings - Fork 60
switch to new Hadoop API #18
Comments
See https://github.com/butlermh/behemoth/commit/7411aa9cbd0fd1bddd61545a9a503daff5d8dcf8 It turns out updating to the new API is a bad idea, DistributedCache does not work with the new API - see Also for the IO module, for WARC, not quite sure how to deal with MultiFileSplits. This has been replaced by CombineFileSplit, however it still implements the interface InputSplit, whereas the new api uses a class called InputSplit. So not clear how this needs to change either. |
In the end, I did manage to find a way of doing this, except for WARC - see https://github.com/butlermh/behemoth/commit/97150bd579ae74eefacae85422937698f2c72445 |
No description provided.
The text was updated successfully, but these errors were encountered: