Apple Metal Performance

Apple Metal Performance | Original

Home 2025.01

Below is a script to test Metal (GPU) performance using PyTorch.

import torch
import argparse
import time

parser = argparse.ArgumentParser(description="Test torch with MPS or CPU.")
parser.add_argument("--device", type=str, default="mps", choices=["mps", "cpu"], help="Device to use (mps or cpu)")
args = parser.parse_args()

if args.device == "mps":
    if torch.backends.mps.is_available():
        print("Metal is available")
        device = torch.device("mps")
    else:
        print("Metal is not available, using CPU instead")
        device = torch.device("cpu")
elif args.device == "cpu":
    device = torch.device("cpu")
    print("Using CPU")
else:
    print("Invalid device specified, using CPU instead")
    device = torch.device("cpu")


# Create a tensor and move it to the specified device
x = torch.randn(5000, 5000, device=device)
y = torch.randn(5000, 5000, device=device)

# Perform a more complex computation
start_time = time.time()
result = torch.matmul(x, y)
for _ in range(10):
    result = torch.matmul(result, y)
end_time = time.time()

# Print the result
print(result)
print(f"Time taken: {end_time - start_time:.4f} seconds")

The results demonstrate that MPS is significantly faster than CPU. The execution time on MPS is only about 0.2% of the CPU time.

% python scripts/test_metal.py --device cpu
Time taken: 2.8784 seconds

% python scripts/test_metal.py --device mps
Time taken: 0.0061 seconds

Back Donate