如何将 for 循环拆分为 3 个单独的数据框？

Question

Giampaolo Levorato

Asked: 2024-08-21 18:35:55 +0800 CST2024-08-21 18:35:55 +0800 CST 2024-08-21 18:35:55 +0800 CST

如何在 python 中从 glm 模型获取系数

772

我已经用 Python 训练了以下 glm 模型：

fitGlm = smf.glm( listOfInModelFeatures,
              family=sm.families.Binomial(),data=train, freq_weights = train['model_weight']).fit()

然后我给出了训练模型的摘要：

    print(fitGlm.summary())

报告内容如下：

                 Generalized Linear Model Regression Results                  
==============================================================================
Dep. Variable:                 Target   No. Observations:              1065046
Model:                            GLM   Df Residuals:               4361436.81
Model Family:                Binomial   Df Model:                            8
Link Function:                  Logit   Scale:                          1.0000
Method:                          IRLS   Log-Likelihood:            -6.1870e+05
Date:                Wed, 21 Aug 2024   Deviance:                   1.2374e+06
Time:                        10:27:37   Pearson chi2:                 4.01e+06
No. Iterations:                     8   Pseudo R-squ. (CS):             0.1479
Covariance Type:            nonrobust                                         
===============================================================================
                  coef    std err          z      P>|z|      [0.025      0.975]
-------------------------------------------------------------------------------
Intercept       3.2619      0.003   1126.728      0.000       3.256       3.268
e1_a_11_sp      0.9318      0.004    256.254      0.000       0.925       0.939
sp_g_37         0.5850      0.006    102.522      0.000       0.574       0.596
sp_f3_35        0.6510      0.005    135.114      0.000       0.642       0.660
e1_a_07_sp      0.4930      0.006     79.698      0.000       0.481       0.505
e1_e_02_sp      0.9956      0.008    120.253      0.000       0.979       1.012
e1_b_03_sp      0.7493      0.013     56.539      0.000       0.723       0.775
e2_k_02_spa     0.4996      0.014     34.512      0.000       0.471       0.528
ea5_s_01_sp     0.3305      0.008     41.524      0.000       0.315       0.346
===============================================================================

问题：如何获取每个特征的系数列表（包括截距）？我的意思是，我如何获得这样的东西？

[3.2619,0.9318,0.5850,0.6510,0.4930,0.9956,0.7493,0.4996,0.3305]

提前致谢。

2 个回答

Voted

Franta Marada · Answer 1 · 2024-08-21T20:20:48+08:00

Best Answer

Franta Marada

2024-08-21T20:20:48+08:002024-08-21T20:20:48+08:00

要使用 statsmodels 从 Python 中训练的 GLM 模型中提取系数列表，可以使用拟合模型对象的 params 属性。这将为您提供一个 pandas.Series 对象，其中索引包含特征的名称（包括截距），值是相应的系数。

您可以按照以下方式操作：

# Extract the coefficients from the fitted model
coefficients = fitGlm.params

# Convert the coefficients to a list
coefficients_list = coefficients.tolist()

# Print the list of coefficients
print(coefficients_list)

说明：fitGlm.params：这将为您提供一个 pandas.Series，其中特征名称作为索引，相应的系数作为值。tolist()：此方法将 pandas.Series 转换为 Python 列表，该列表仅为您提供数值系数。根据您的示例，此代码将生成列表：

[3.2619, 0.9318, 0.5850, 0.6510, 0.4930, 0.9956, 0.7493, 0.4996, 0.3305]

该列表包括截距作为第一个值，后跟每个特征的系数，按照它们在模型中出现的顺序排列。

1

Giampaolo Levorato · Answer 2 · 2024-08-21T18:39:20+08:00

Giampaolo Levorato

2024-08-21T18:39:20+08:002024-08-21T18:39:20+08:00

已分类！

我需要使用这样的东西：

print(fitGlm.params["e1_a_11_sp"])
print(fitGlm.params["sp_g_37"])

0

如何在 python 中从 glm 模型获取系数

Vue 3：创建时出错“预期标识符但发现‘导入’”[重复]

为什么这个简单而小的 Java 代码在所有 Graal JVM 上的运行速度都快 30 倍，但在任何 Oracle JVM 上却不行？

具有指定基础类型但没有枚举器的“枚举类”的用途是什么？

如何修复未手动导入的模块的 MODULE_NOT_FOUND 错误？

`(表达式，左值) = 右值` 在 C 或 C++ 中是有效的赋值吗？为什么有些编译器会接受/拒绝它？

何时应使用 std::inplace_vector 而不是 std::vector？

在 C++ 中，一个不执行任何操作的空程序需要 204KB 的堆，但在 C 中则不需要

PowerBI 目前与 BigQuery 不兼容：Simba 驱动程序与 Windows 更新有关

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

我正在尝试仅使用海龟随机和数学模块来制作吃豆人游戏

如何在 python 中从 glm 模型获取系数

2 个回答

相关问题