This issue tracks the proposal of evaluating Apache Arrow IPC as an alternative on‑disk data representation format for the MLPerf Storage reader pipeline, with the goal of improving effective data loading reducing CPU overhead during sample deserialization.
This issue tracks the proposal of evaluating Apache Arrow IPC as an alternative on‑disk data representation format for the MLPerf Storage reader pipeline, with the goal of improving effective data loading reducing CPU overhead during sample deserialization.