1. First, analyze the algorithm parts and step, ex: point of interest algorithm consist of five parts, we divide it into 9 steps.
2. Program the spe, each spe can do the whole job, depends on the worknum sent to spe. Ex: the single CPU can do as in x86 model.
3. Define the partition and size of work. Ex: here we part the work into 8, so 8*9 steps need to be worked.