mapreduce - Can I use Hadoop's distributed cache in standalone mode? -


i'm writing java mapreduce program in hadoop, , every reducer needs have access static xml file (that contains business logic applying rules value lists). i've decided store file in distributed cache , utilize distributed cache api access (serialize it) within every reducer. haven't implemented yet, api seems straightforward when comes using functionality in hdfs/distributed mode. however, possible utilize in pseudo-distributed standalone mode testing purposes? how work?

also, bad idea serialize file in every reducer? i'm open suggestions on other ways distributed "global static data" reducers.

thanks!

yes can - work same way in real cluster.

this idea use distributed cache if isn't fixed. other alternative pack data jar contains job's code in case wouldn't flexible when have change xml (you can still change jar that's not clean way go it)


Comments