Parallel processing during the backward pass

Concerning the Pickling of Pinocchio Data, it would be possible to do it. I have implemented Pickling for Spatial objects. It is then a matter of continuing this job.

By the way, Pickling Data can be shorten as the creation of a new Data object.

Clearly, we should not go with multiprocess, but with multi thread. If not possible in Python ... too bad. Could you copy the (failed) test with multithread somewhere for archive?

Great to report this to us. I tend to think that we should keep this task for c++ implementation. Python for my is more about analysing the scientific computation, instead in c++ we should carefully address the software efficiency (and this kind of things).

mentioned in issue #111 (closed)

assigned to @proyan

changed milestone to %Towards python version 1.0

@proyan is pushing for multiprocess as multithread is not feasible in Python. https://github.com/stack-of-tasks/pinocchio/pull/682 is one step in this direction. WIP.

removed documentation future labels

unassigned @proyan

@proyan tried and it is more complex that expected. As it is not relevant for the C++ version (that will not use multiprocess anyway), we will stop exploring this direction.

@proyan, please record the result of your research here.

closed

The report on what I did:

Since multitasking cannot work on python for a CPU-intensive task, I decided to try multiprocessing for the issue. multiprocessing on python requires pickling of the global dict, which requires pickling of pinocchio quantities as well (model and data). I worked on that two weeks ago, and exposed pinocchio::JointData for this purpose.

However, on Friday, I looked on the pickling again after discussion with Justin about the pickling performance.

Looking at the pickling performance when compared to the expected benefits, I decided that pickling might not be the best way forward.
Looking at the multiprocessing module, I found that it might be possible to use shared memory without pickling, where we can store the models and datas directly inside the multiprocessing heap.

While I did not have numerical proof of the pickling performance, (2) offered much better performance, so I investigated it further.

Working on (2) would require

A constructor in pinocchio::Model and pinocchio::Data which allows us to create an object at a given memory location
Updating the forking of multiprocessing to avoid pickling of the shared memory
implementing this in crocoddyl.

This is too intensive work, much of which is concentrated in the multiprocessing modul. Also, this is prone to bugs. I investigated the issue for one day, but I don't think I should spend more time on it.

Looking at the cost-benefit of including multiprocessing in python, I'm not pursuing this anymore. I'll revisit this topic after we have the c++ implementation.

Parallel processing during the backward pass

I tried the multiprocessing library of python for doing parallel processing of the backward pass. Then I tried the multi-threading library. Neither of these worked, and here is why:

I didn't do (a) because it demanded a lot of time. For (b) is not really efficient, and (b) and (c) require that the data elements be picklable, which is not the case for the pinocchio objects.

This is indeed what I noticed. 1 iteration of the biped example took me 3s. 1 iteration with 4 threads took 3.2s, and 1 iteration with 8 threads took 3.5s.

Designs

Child items ...

Activity