如何将 for 循环拆分为 3 个单独的数据框？

Question

PD_Sathya

Asked: 2024-10-20 07:01:30 +0800 CST2024-10-20 07:01:30 +0800 CST 2024-10-20 07:01:30 +0800 CST

这个基准测试有效吗？对于中型（10000 x 10000）矩阵乘法（CPU），tinygrad 的速度比 torch 或 numpy 快得多

772

我在启用了高内存的 google collab CPU 上运行了以下基准测试代码。请指出我在基准测试过程中的任何错误（如果有），以及为什么 tinygrad 的性能提升如此之高。

# Set the size of the matrices
size = 10000

# Generate a random 10000x10000 matrix with NumPy
np_array = np.random.rand(size, size)

# Generate a random 10000x10000 matrix with PyTorch
torch_tensor = torch.rand(size, size)

# Generate a random 10000x10000 matrix with TinyGrad
tg_tensor = Tensor.rand(size, size)  

# Benchmark NumPy
start_np = time.time()
np_result = np_array @ np_array  # Matrix multiplication
np_time = time.time() - start_np
print(f"NumPy Time: {np_time:.6f} seconds")

# Benchmark PyTorch
start_torch = time.time()
torch_result = torch_tensor @ torch_tensor  # Matrix multiplication
torch_time = time.time() - start_torch
print(f"PyTorch Time: {torch_time:.6f} seconds")

# Benchmark TinyGrad
start_tg = time.time()
tg_result = tg_tensor @ tg_tensor  # Matrix multiplication
tg_time = time.time() - start_tg
print(f"TinyGrad Time: {tg_time:.6f} seconds")

NumPy 时间：11.977072 秒
PyTorch 时间：7.905509 秒
TinyGrad 时间：0.000607 秒

这些就是结果。多次运行代码后，结果非常相似

1 个回答

Voted

jared · Answer 1 · 2024-10-20T09:09:47+08:00

Best Answer

jared

2024-10-20T09:09:47+08:002024-10-20T09:09:47+08:00

Tinygrad 以“懒惰”的方式执行操作，因此尚未执行矩阵乘法。将矩阵乘法行更改为：

tg_result = (tg_tensor @ tg_tensor).realize()

或者

tg_result = (tg_tensor @ tg_tensor).numpy()

4

这个基准测试有效吗？对于中型（10000 x 10000）矩阵乘法（CPU），tinygrad 的速度比 torch 或 numpy 快得多

为什么要通过 where 子句中绑定的通用特征来约束单位类型（如 `where () : Trait<…>`）？

`(表达式，左值) = 右值` 在 C 或 C++ 中是有效的赋值吗？为什么有些编译器会接受/拒绝它？

何时应使用 std::inplace_vector 而不是 std::vector？

在 C++ 中，一个不执行任何操作的空程序需要 204KB 的堆，但在 C 中则不需要

如果 T 既不可构造、不可复制、也不可移动，那么我可以拥有 std::optional<T> 吗？

为什么我可以定义一个 constinit 的 std::string 实例？如果对象需要动态初始化，constinit 不是被禁止的吗？

如何分配以后放置的新“如同新”

PowerBI 目前与 BigQuery 不兼容：Simba 驱动程序与 Windows 更新有关

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

我正在尝试仅使用海龟随机和数学模块来制作吃豆人游戏

这个基准测试有效吗？对于中型（10000 x 10000）矩阵乘法（CPU），tinygrad 的速度比 torch 或 numpy 快得多

1 个回答

相关问题