杭州建设网站设计的公司_视频永久免费生成二维码_昆山优化外包_今日国内新闻热点

时间:2025/7/13 17:02:45来源：https://blog.csdn.net/qq_61706514/article/details/144138525 浏览次数:0次

图解：

代码：

class Mlp(nn.Module):"""MLP as used in Vision Transformer, MLP-Mixer and related networks"""def __init__(self, in_features,               #输入特征的维度hidden_features=None,      #隐藏层特征的维度，默认为noneout_features=None,         #输出特征的维度，默认为noneact_layer=nn.GELU,         #激活函数层，默认使用nn.GELUdrop=0.):                  #丢弃率,默认值为 0，表示不进行丢弃操作super().__init__()out_features = out_features or in_features
#如果输出特征的维度没有指定则默认与输入特征维度相同。hidden_features = hidden_features or in_features
#如果隐藏层特征的维度没有指定则默认与输入特征维度相同。self.fc1 = nn.Linear(in_features, hidden_features)self.act = act_layer()
#默认使用nn.GELUself.fc2 = nn.Linear(hidden_features, out_features)self.drop = nn.Dropout(drop)def forward(self, x):x = self.fc1(x)x = self.act(x)x = self.drop(x)x = self.fc2(x)x = self.drop(x)return x

GELU函数的优点：