Load distribution on a cluster | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

		Load distribution on a cluster
		1 point by liverpoolfan on Aug 19, 2010 \| hide \| past \| favorite \| 2 comments

		Hi, A computer cluster is composed of nodes that execute jobs on files stored in that node (data locality optimization). Certain files have more jobs assigned to them than others. Let's say we have: - file A with load X - file B with load X - file C with load 2X - two nodes in the cluster So the best distribution is: file A and file B in one node and file C in the other node. How can I distribute the files in the nodes? Does a greedy algorithm solve my problem?

lzw on Aug 19, 2010 [–]

I think maybe you are looking for hadoop or ore map reduce based system? Possibly i misunderstood your question.

liverpoolfan on Aug 20, 2010 | [–]

A job could be a map/reduce operation.

In this case, my question would be:

How does a map/reduce system redistribute the chunks to balance the load if some chunks are more accessed than others?

Consider applying for YC's Spring batch! Applications are open till Feb 11.
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact