Open
Description
As of right now multiple queues create multiple contexts. Is this intended to always be the case or will this be moved to streams over time? If I have understood correctly openCL queues are akin to CUDA streams so that's why I'm asking. If it will continue to be contexts what would be some ways to have them utilize the GPU in a good way. MPS maybe, or something else. But it feels wierd having to do a workaround for a core feature.