Is Hadoop in Docker container faster/worth it? -


i have hadoop based environment. use flume, hue , cassandra in system. there big hype around docker nowadays, examine, pros , cons in dockerization in case. think should more portable, can set using cloudera manager few clicks. maybe faster or why worth it? advantages? maybe should multi node cassandra cluster dockerized?

is maybe faster or why worth it?

it sounds have hadoop cluster. have ask yourself, how long take reproduce environment? how need reproduce environment?

if not needing way reproduce environment repeatedly , and contain dependencies may conflicts other applications on host, don't yet see use case you.

what advantages?

if running hadoop in environment may need mixed java versions, running container isolate dependencies (in case, java) host system. in case, more reproducible artifact move around , set up. java apps simple dependencies included in jar.

maybe should multi node cassandra cluster dockerized?

i don't think comes down whether is multi-node environment or not. comes down problems solves. doesn't sound have pain point in deploying or reproducing hadoop environments (yet), don't see need "dockerize" because hot new thing on block.

when have need reproduce hadoop environment easily, might @ docker of orchestration , management tools (kubernetes, rancher, etc.) make deploying , managing clusters of applications on overlay network more appetizing regular docker. docker tool in eyes. starts shine when can leverage of neat overlay multi-host networking, discovery, , orchestration other packages building on top of it.


Comments

Popular posts from this blog

sublimetext3 - what keyboard shortcut is to comment/uncomment for this script tag in sublime -

java - No use of nillable="0" in SOAP Webservice -

ubuntu - Laravel 5.2 quickstart guide gives Not Found Error -