-
Notifications
You must be signed in to change notification settings - Fork 83
Revised Unified Virtual Memory(UVM) support #107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Conversation
- Full functional implementation; Performance/time simulation pending - CUDA APIs for cudaMallocManaged, cudaMemPrefetchAsync, and cudaDeviceSynchronize - Data-structures to maintain mapping between CPU and GPU side memory during cudaMallocManaged - Track mapping during cudaSetupArgument to override CPU memory pointer by GPU side memory pointer
- Same benchmark with cudaMalloc and cudaMemcpy, i.e., old APIs - Same benchmark with cudaMallocManaged, i.e., new UVM API
- Adding PCI-e latency as part of config (value subject to change based on architecture) - Parsing logic of latency and setting clock domain - Adding valid flag as part of page table implementation
This reverts commit f708eb3.
… gddr size constraint
- TLB look up - page table access/walk latency - multi-lane bidirectional PCI-E with far fetch latency
- Don't process if warp instruction's access queue is empty
- Change input size for bfs, nw, pathfinder, srad_v2 etc. to finish in reasonable time - Add pathfinder benchmark for managed code - Fixing BFS managed code
…ze, and a bollean flag to denote whether data is copied from cpu to gpu on first kernel launch
…llocations from GPU to CPU
… queue is ready to complete. Determine based on whether dispatched warps from all SMs are stalling for PCI-E transfer and no progress can be made by any component of the simulator.
…ion write back or device synchronize check for allocations which were copied at the first place during kernel launch from CPU to GPU
…nslated to managed code
- kernel by kernel, data structure wise basic block access - need to implement actual detection logic and policy engine
- remaining policy engine
- from experiments on smart adaptive runtime with dynamic migration pattern detection and memory management - along with validation results
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@yechen3 - one weird side-effect of the integration changes is that you ended up re-adding a bunch of files that have since been deleted. Jenkinsfile, bitbucket*, all the old configs in the old folders outside of tested and deprecated.
Please clean this up. Make sure you are only adding files that are actually added for this to work. Otherwise, the directory structure should be the same as the mainline dev.
@tgrogers Got it. I've already cleaned things up, so everything should be in order now. Let me know if there's anything else that needs attention. |
Same to PR#85, but this has a cleaner commit history.