Suppose a program is running on a distributed-memory multiprocessor. There are 1,000 instructions in the program and 80% of instruction references hit in the local memory and 20% of instruction references involve the remote communication in the remote memory. 10 ns time is required for the remote communication for each instruction reference. If the CPU clock rate is 4 GHZ and CPI is 0.5, what is the running time of this program?