attention_bilstm attention_attention ocr - 腾讯云开发者社区

、、、

我在序列分类-二进制问题中使用了长运算器。我已经下载了所需的文件 # load model and tokenizer and define length of the text sequence model = LongformerForSequenceClassification.from_pretrained('allenai/longformer-base-4096', gradient_checkpointing=False,

浏览 7提问于2022-03-22得票数 0

回答已采纳

1回答

LSTM注意。模块:TypeError对象不可调用。我的实现出了什么问题？

、、、、

1270个时间步，每个时间步长度为36；10类 max_length = 1270 step_dim = 36 _input = Input(shape=[max_length, step_dim]) activations = LSTM(128, return_sequences=True)(_input) # compute importance for each step attention = Dense(1, activation='tanh')(activations) attention = Flatten()(attention) attention =

浏览 18提问于2020-07-03得票数 0

1回答

TensorFlow 2中tf.contrib.seq2seq.prepare_attention的等价性是什么

、、、

我最近写了一些用tensorflow 1.0.1编写的代码，我想让它在tenorflow 2上可用。我对seq2seq不是很熟悉。非常感谢。 (attention_keys, attention_values, attention_score_fn, attention_construct_fn) = tf.contrib.seq2seq.prepare_attention( attention_states=attention_states, attention_option="bahdanau", num_units=self.decoder_hid

浏览 0提问于2020-07-12得票数 1

1回答

加载预训练模型时不初始化编码器权重

、、、、

我正在XLMRobertaModel之上编写一个自定义类，但是当从经过预先训练的检查点初始化模型时，我会收到一个警告：encoder.layer.*权重不是从相应的检查点初始化的。下面是一个重现错误的最小示例： from transformers import XLMRobertaModel class CustomXLM(XLMRobertaModel): def __init__(self, config): super().__init__(config) self.roberta = XLMRobertaModel(config)

浏览 48提问于2022-04-13得票数 1

1回答

如何可视化注意力权重？

、、、、

我已经注意到了我的 (它将输入序列分为两类)，如下所示。 visible = Input(shape=(250,)) embed=Embedding(vocab_size,100)(visible) activations= keras.layers.GRU(250, return_sequences=True)(embed) attention = TimeDistributed(Dense(1, activation='tanh'))(activations) attention = Flatten()(attention) attention = Activati

浏览 17提问于2018-12-20得票数 7

回答已采纳

1回答

注意层是如何在keras中实现的？

、、、、

我正在学习注意力模型及其在keras中的实现。在搜索过程中，我发现了这两种方法和，我们可以使用这两种方法在keras中创建一个注意层 # First method class Attention(tf.keras.Model): def __init__(self, units): super(Attention, self).__init__() self.W1 = tf.keras.layers.Dense(units) self.W2 = tf.keras.layers.Dense(units) self.V =

浏览 1提问于2019-07-11得票数 1

回答已采纳

1回答

在python中使用带有注意力层的BI LSTM进行文本分类

、、、、

我想应用这个方法来实现Bi-LSTM。这里讨论了该方法：Bi-LSTM Attention model in Keras 我得到以下错误：'module' object is not callable 它不能在以下行中应用乘法：sent_representation = merge([lstm, attention], mode='mul') from keras.layers import merge import tensorflow as tf from tensorflow.keras.layers import Concatenate, Dense,

浏览 33提问于2020-11-04得票数 0

1回答

如何在文本文件中打印Mutt用户代理收件箱列表？

、、

如何在文本文件中打印Mutt用户代理收件箱列表？这是我的问题：我需要打印收件箱文件列表，这些文件具有以下设置： set index_format="%4C %Z %{%d/%m/%y %H:%M} %s" 这样，我需要在文本文件中打印它们，并包含以下示例内容： 10 N F 08/07/19 08:53 Attention: alarm(14286247:motion detection) 11 N F 08/07/19 08:53 Attention: alarm(14033396:motion detection) 12 N F 08/07/19 08:53 A

浏览 0提问于2019-07-08得票数 0

回答已采纳

1回答

Swin变压器注意力图可视化

、、、、

我正在使用Swin变压器来解决多值多标签分类的分层问题。我想形象化自我关注的地图在我的输入图像试图从模型中提取他们，不幸的是，我没有成功地完成这项任务。你能告诉我怎么做吗？我向您介绍了代码中我试图完成此任务的部分。 attention_maps = [] for module in model.modules(): #print(module) if hasattr(module,'attention_patches'): #controlla se la variabile ha l' attributo print(module.a

浏览 5提问于2022-01-19得票数 0

2回答

运行seq2seq模型时的流量误差

、、

在运行RNN教程时，在读取数据行语句后会出现以下错误： reading data line 22500000 W tensorflow/core/common_runtime/executor.cc:1052] 0x3ef81ae60 Compute status: Not found: ./checkpoints_directory/translate.ckpt-200.tempstate15092134273276121938 [[Node: save/save = SaveSlices[T=[DT_FLOAT, DT_INT32, DT_FLOAT, DT_FLOAT

浏览 3提问于2015-11-18得票数 0

5回答

枚举等于()和==

、

enum Drill{ ATTENTION("Attention!"), AT_EASE("At Ease"); private String str; private Drill(String str){ this.str = str; } public String toString(){ return str; } } public class EnumExample { public s

浏览 0提问于2014-07-02得票数 7

1回答

理解gpu使用拥抱面分类-总体优化步骤

、、、

我训练拥抱脸长一个分类问题，并得到低于输出。我对Total optimization steps感到困惑。由于我有7000个训练数据点，5个时代和Total train batch size (w. parallel, distributed & accumulation) = 64，难道我不应该得到7000*5/64步骤吗？这是546.875吗？为什么它会显示Total optimization steps = 545 为什么在下面的输出中，有16个步骤的Input ids are automatically padded from 1500 to 1536 to

浏览 18提问于2022-03-24得票数 3

回答已采纳

1回答

Longformer last_hidden_state

、、、

我正在尝试遵循huggingface文档中的这个示例，这里是https://huggingface.co/transformers/model_doc/longformer.html import torch from transformers import LongformerModel, LongformerTokenizer model = LongformerModel.from_pretrained('allenai/longformer-base-4096') tokenizer = LongformerTokenizer.from_pretrained('

浏览 240提问于2021-03-16得票数 1

回答已采纳

2回答

想知道如何减少选择数量以提高性能

、

我被困在一个问题上。实际上，我想减少我正在使用的选择的数量。我希望计算每个场景的行数。表列的说明如下- 这里month是一年中的一个月 Name是人名需要注意的是是否需要对他们给予任何关注。我们有条件- 如果需要注意的是'N‘和isdate(日期)=0，那么如果需要注意的是'N’和isdate(日期)=1，那么需要注意的是‘N’和isdate(日期)=1，那么如果需要注意的是'Y‘(然后不需要考虑日期列)，它就出现在No Attention required.中。日期只是一个他们需要医疗照顾的日期，它可以是空的，也可以是任何日期。局外人-如果'0‘那么它

浏览 2提问于2016-02-12得票数 0

回答已采纳

5回答

如何在角膜缘中增加注意机制？

、

我目前正在使用从获得的代码，这是注意机制的代码： _input = Input(shape=[max_length], dtype='int32') # get the embedding layer embedded = Embedding( input_dim=vocab_size, output_dim=embedding_size, input_length=max_length, trainable=False, mask_zero=False )(_input) activa

浏览 1提问于2017-03-21得票数 23

回答已采纳

1回答

Keras的双LSTM注意模型

、、、、

我试图用Bi建立一个使用word嵌入的注意力模型。我偶然遇到了，和。然而，我对Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification的实现感到困惑。所以, _input = Input(shape=[max_length], dtype='int32') # get the embedding layer embedded = Embedding( input_dim=30000, output_dim=300,

浏览 0提问于2018-10-18得票数 3

1回答

seq2seq马卢巴模型中注意机制的实现

Hello，我试图对简单的Maluuba/qgen-工作坊seq2seq模型添加注意，但是我不知道应该传递到初始状态的正确的batch_size是什么--我尝试了如下： # Attention # attention_states: [batch_size, max_time, num_units] attention_states = tf.transpose(encoder_outputs, [1, 0, 2]) # Create an attention mechanism attention_mechanism = tf.contrib.seq2seq.LuongAtte

浏览 0提问于2018-07-24得票数 0

1回答

Neuropy库端口：init()缺少1个必需的位置参数：‘TypeError’

、

我使用的是neurosky mindwave工具包，所以我下载了Neuropy库来获取工具包的读数，我尝试了一个示例代码： from NeuroPy.NeuroPy import NeuroPy from time import sleep neuropy = NeuroPy() def attention_callback(attention_value): """this function will be called everytime NeuroPy has a new value for attention"""

浏览 33提问于2020-07-31得票数 1

1回答

tensorflow r1.0中注意力解码器的实现

、

我被tensorflow r1.0中的注意力解码器实现搞糊涂了。可以在这里找到原始代码：。下面是我感到困惑的代码部分： def decoder_fn(time, cell_state, cell_input, cell_output, context_state): if cell_state is None: # first call, return encoder_state cell_state = encoder_state # init attention attention = _init_attention(encod

浏览 2提问于2017-04-06得票数 0

1回答

无法在中断更新后打开虚拟框6.0

、

我意外地中断了在我的Ubuntu18.04上安装和正确工作的Virtual 6.0更新。之后，Virtual就不会打开。通过终端运行virtualbox会引发多个**Qt警告: QString::arg:参数丢失:最后不会打开应用程序。我试着用：-清除‘^ VirtualBox *’完全卸载。但是，当我重新下载并安装Virtual 6.0时，也会发生同样的错误。 **我的终端不是英文，这是对抛出的错误信息的大致翻译： Qt ATTENTION: QString :: arg: argument missing: upgrade virtual machine registration (% 1

浏览 0提问于2019-11-17得票数 0

2回答

如何在MultiRNNCell和dynamic_decode中使用AttentionMechanism？

、、

我想创建一个使用注意力机制的基于RNN的多层动态解码器。为此，我首先创建了一个注意力机制： attention_mechanism = BahdanauAttention(num_units=ATTENTION_UNITS, memory=encoder_outputs, normalize=True) 然后我使用AttentionWrapper来包装一个带有注意力机制的LSTM cell： attention_wrapper = At

浏览 16提问于2017-07-06得票数 2

回答已采纳

2回答

我想在dojo中单击accordion的内容窗格时触发事件

、

这是我的代码。我尝试使用script=dojo/connect，但它不起作用。我的dojo版本是1.6。 <div dojoType="dijit.layout.AccordionContainer" style="height: 500px;"> <div dojoType="dijit.layout.ContentPane" title="Total Enrollment" id="TotalEnrollDiv" selected="false" name="abc&

浏览 6提问于2012-10-23得票数 1

1回答

注意力在神经机器翻译中的注意

、、

我正在尝试使用以下教程来理解Bahdanaus的注意：计算如下： self.attention_units = attention_units self.W1 = Dense(self.attention_units) self.W2 = Dense(self.attention_units) self.V = Dense(1) score = self.V(tf.nn.tanh(self.W1(last_inp_dec) + self.W2(input_enc))) 我有两个问题：我不明白为什么tf.nn.tanh(self.W1(last_inp_dec) + self.W2(inp

浏览 3提问于2020-08-05得票数 1

回答已采纳

1回答

如何理解.hdf5 A和B的ShanghaiTech文件注释？

、、、

我正在查看用于人群计数的ShanghaiTech A和B数据集，在这个链接中可以找到这些数据集，我注意到每个图像都伴随着一个.mat文件和一个.hdf5文件。 .mat文件 .mat文件包含每个头部的坐标以及地面真相。例如，对于图1 coordinates are [[ 29.6225116 472.92022152] [ 54.35533603 454.96602305] [ 51.79045053 460.46220626] ... [597.89732076 688.27900015] [965.77518336 638.44693908] [166.9965574 62

浏览 5提问于2022-08-24得票数 0

回答已采纳

1回答

如何从HuggingFace Longformer中提取文档嵌入

希望做一些类似的事情 tokenizer = BertTokenizer.from_pretrained('bert-base-uncased') model = BertModel.from_pretrained('bert-base-uncased') input_ids = torch.tensor(tokenizer.encode("Hello, my dog is cute")).unsqueeze(0) # Batch size 1 outputs = model(input_ids) last_hidden_states = o

浏览 80提问于2020-09-02得票数 5

3回答

tensorflow‘模块’对象没有属性'prepare_attention‘

、、、

我正在使用tensorflow版本1.3。但是我下面的教程是在1.0版上编写的，而且我在tensorflow上还是很新的。我遇到的问题是：模块‘对象没有属性'prepare_attention 而代码是； tf.contrib.seq2seq.prepare_attention(attention_states, attention_option = "bahdanau", num_units = decoder_cell.output_size) 我不知道用什么代替tf.contrib.seq2seq.prepare_attention()函数。有人能帮忙吗

浏览 2提问于2018-02-05得票数 0

1回答

如何在匹配的子字符串中添加前缀？

假设我有以下文件： ../dir |__ attention.md |__ attention.percent.20.pt.md |__ attention.percent.60.st.md |__ attention.reserved.md 我想将一个[tag]添加到任何名字包含percent的文件中，但是显然下面的代码替换了一个直到并包含percent的整个前缀。 $ rename -n 's/(\w+\.)*(percent)(\w+\.)*/[tag]/' *.md rename(attention.percent.20.pt.md, [tag

浏览 4提问于2020-11-23得票数 1

回答已采纳

1回答

Tensorflow NotFoundError

、、

我正在运行一个自定义代码来在tensorflow上训练我自己的Seq2Seq模型。我使用的是多核神经网络细胞和embedding_attention_seq2seq。在恢复模型时，我得到了以下错误： 2017-07-14 13:49:13.693612: W tensorflow/core/framework/op_kernel.cc:1158] Not found: Key embedding_attention_seq2seq/rnn/embedding_wrapper/multi_rnn_cell/cell_1/basic_lstm_cell/kernel not found in ch

浏览 18提问于2017-07-14得票数 1

回答已采纳

2回答

如何使用jquery获取元素的第#个html子元素(包含该子元素的开始和结束标记)？

、、、

我的html代码如下： <div id="FormMessages"> <div id="user-info-message" class="error"> <asp:Image ID="imgError1" CssClass="imgError" runat="server" ImageUrl="~/Images/Login/Exclamation.png" Alterna

浏览 0提问于2011-07-04得票数 2

回答已采纳

1回答

将CNN与注意力网络相结合

、、、

这是我的关注层 class Attention(Layer): def __init__(self, **kwargs): self.init = initializers.get('normal') self.supports_masking = True self.attention_dim = 50 super(Attention, self).__init__(**kwargs) def build(self, input_shape): assert len(input_

浏览 3提问于2019-07-22得票数 1

1回答

如何在seq2seq_model的注意译码器中获取注意力值来绘制bleu分数

、

我正在研究一种语言翻译模式。 1. I want to visualize data as mentioned in [http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/](http://www.wildml.com/2016/01/attention-and-memory-in-deep-learning-and-nlp/) using bleu score. 2. for a in xrange(num_heads): with variable_scope.variabl

浏览 3提问于2016-08-22得票数 0

回答已采纳

3回答

如何确保sqlite不会缓存特定的select查询？

、、

我的情况是，我将sqlite与ActiveRecord和Rails一起使用(而且，这是JRuby，所以我实际上使用的是jdbcsqlite适配器，以防万一)。现在，我正在尝试将一行插入到表attention_seekers中，但前提是没有其他现有的类似行。相应地， unless AttentionSeeker.find(:first, :conditions => {:key_id => key.id, :locale_id => l.id}) item = AttentionSeeker.new(:key_id => key.id, :locale_id =>

浏览 0提问于2009-08-21得票数 1

回答已采纳

1回答

Tensorflow稀疏操作需要排序索引

、、、、

我正试着微调伯特的文档分类。我首先对文档进行标记，以生成input_ids、attention_mask和token_type_ids列表，以满足TFBertModel的需求： def tokenize_sequences(tokenizer, max_length, corpus): input_ids = [] token_type_ids = [] attention_masks = [] for i in tqdm(range(len(corpus))): encoded = tokenizer.encode_plus(

浏览 13提问于2020-11-30得票数 0

3回答

在熊猫中查找与数组匹配的列名

、、、

我有一个大型的dataframe (5000x12039)，我希望得到与numpy数组匹配的列名。例如，如果我有一张桌子 m1lenhr m1lenmin m1citywt m1a12a cm1age cm1numb m1b1a m1b1b m1b12a m1b12b ... kind_attention_scale_10 kind_attention_scale_22 kind_attention_scale_21 kind_attention_scale_15 kind_attention_scale_18 kind_attention_sca

浏览 4提问于2017-07-25得票数 7

回答已采纳

1回答

如何在序列化期间控制xml输出

如果任何属性为空或null，则为该属性生成类似于</Attention>的xml标记，但我希望像<Attention></Attention>那样生成它。因此，实现IXmlSerializable接口，并且通过WriteXml()方法，我尝试在值为null或空时更改xml标记，但仍然没有得到正确的输出。因此，请告诉我，如果我的注意力属性为空或空，我需要做些什么才能生成像<Attention></Attention>这样的xml标记。基本上，我想要定制的xml输出。请给我引路。

浏览 1提问于2011-05-09得票数 1

回答已采纳

1回答

Keras:文本摘要的注意机制

、

我试图实现Attention机制，以便使用Keras生成抽象的文本摘要，方法是从这个线程获得大量帮助，在这个线程中有很多关于实现的内容丰富的讨论。我很难理解代码的某些非常基本的部分，以及我需要修改什么才能成功地获得输出。我知道attention是通过以前所有时间戳的所有隐藏状态生成的上下文向量的加权和，这就是我们下面要做的。数据：我得到的BBC新闻数据集包括新闻文本和各种类别的标题，如政治，娱乐和体育。参数： n_embeddings = 64 vocab_size = len(voabulary)+1 max_len = 200 rnn_size = 64 代码： _input = I

浏览 3提问于2018-06-17得票数 7

1回答

不能在JSX属性中使用布尔值

、

我正在学习React + Typescript，我面临着一个奇怪的问题。本质上，我定义了一个FunctionComponent const AddLogModal: React.FC<{ addLog: Function }> = ({ addLog }) => { const [message, setMessage] = useState(''); const [attention, setAttention] = useState(false); const [tech, setTech] = useState(''

浏览 1提问于2020-10-15得票数 2

回答已采纳

2回答

为什么BERT中的矩阵被称为查询、键和值？

、、、、

在的变压器单元中，有一些模块称为查询、键和值，或者简单地称为Q、K、V。基于伯特和 (特别是在中)，我对单个注意头的注意模块(使用Q、K、V)向前通过的伪码理解如下： q_param = a matrix of learned parameters k_param = a matrix of learned parameters v_param = a matrix of learned parameters d = one of the matrix dimensions (scalar value) def attention(to_tensor, from_tensor, atten

浏览 0提问于2019-06-25得票数 3

回答已采纳

2回答

减少BERT的推理时间

、、

我想进一步改进BERT的推理时间。代码如下： for sentence in list(data_dict.values()): tokens = {'input_ids': [], 'attention_mask': []} new_tokens = tokenizer.encode_plus(sentence, max_length=512, truncation=True, padding='max_length',

浏览 6提问于2021-09-15得票数 2

2回答

注意层抛出TypeError: Permute层不支持Keras中的掩蔽

、、、、

为了在我的模型上实现注意层，我一直在跟踪这个LSTM。 attention layer的代码 INPUT_DIM = 2 TIME_STEPS = 20 SINGLE_ATTENTION_VECTOR = False APPLY_ATTENTION_BEFORE_LSTM = False def attention_3d_block(inputs): input_dim = int(inputs.shape[2]) a = Permute((2, 1))(inputs) a = Reshape((input_dim, TIME_STEPS))(a) a = D

浏览 5提问于2017-08-15得票数 15

2回答

伯特自我注意层

、

我正在尝试为基本模型使用第一个单独的BertSelfAttention层，但我从torch.hub加载的模型似乎与hugginface transformers.models.bert.modeling_bert中使用的模型不同： import torch, transformers tokenizer = transformers.BertTokenizer.from_pretrained('bert-base-uncased', do_lower_case=True) torch_model = torch.hub.load('huggingface/pytorc

浏览 0提问于2021-05-05得票数 0

回答已采纳

1回答

如何添加SMOTE？

、

你好，我有一个很大程度上不平衡的数据集，我正在尝试在使用bert之前合并SMOTE。然而，当我将我的数据分成训练，验证和测试时，我有点困惑，不知道该如何做？因为im使用文本数据，所以它是在标记化之后发生的吗？代码片段： def tokenize(df): input_ids = [] attention_masks = [] for i, text in enumerate(df["tidy_tweet"]): tokens = tokenizer.encode_plus(text, max_length=SEQ_LEN,

浏览 13提问于2021-06-09得票数 0

2回答

将Pythorch模型转换为Onnx时缺少的输入示例误差

、、、、

我试图用以下代码将火把-闪电模型(ckpt)更改为onnx：. test_comment = 'I am still waiting on my card?' start= time.time() encoding = tokenizer.encode_plus( test_comment, add_special_tokens=True, max_length=512, return_token_type_ids=False, padding="max_length", return_attention_mask=True, r

浏览 11提问于2022-08-23得票数 0

1回答

将复杂字典保存到excel文件中

、、

我在将复杂的字典保存到excel文件时遇到了问题。这是我到目前为止的代码： attention_relevance = model.get_attention(test) attentionres = [] for key in attention_relevance.keys(): attentionres.append(attention_relevance[key]) dfattention = pd.DataFrame(attentionres) dfattention.to_excel(r'/savepath/attention.xlsx', index

浏览 18提问于2021-10-25得票数 0

2回答

SFML 3D MouseLook

、、、

Edit2:我解决了大部分问题，但有一个困扰我的地方。当光标到达屏幕边缘并被拉到另一边时，相机会发生抖动，这将不起作用。有人能看看怎样才能阻止这一切吗？ bool attention = true; Vector2 p, mousePos; private float MOUSE_SENSITIVITY = 4.0f; private void OnMouseMove(object sender, MouseMoveEventArgs e) { float DeltX = 0, DeltY = 0; int border

浏览 0提问于2011-07-23得票数 2

回答已采纳

1回答

在Tensorflow中使用bucketing时，如何在Adam优化器中共享梯度和变量？

、、、

全, 我在seq2seq任务中使用了类似于bucketing的技术： # For different length in encoder and decoder model_map = {} for i in encoder_shape: for j in decoder_shape: with variable_scope.variable_scope(variable_scope.get_variable_scope(), reuse=True if tt > 0 else None):

浏览 2提问于2016-11-22得票数 1

1回答

RuntimeError:张量的大小必须匹配，除非尺寸为0。预期尺寸为30，但张量1在列表中为31。

、、、、

这是我代码的一部分。 from transformers import BertTokenizer,BertForSequenceClassification,AdamW tokenizer = BertTokenizer.from_pretrained('bert-base-uncased',do_lower_case = True,truncation=True) input_ids = [] attention_mask = [] for i in text: encoded_data = tokenizer.encode_plus( i, ad

浏览 13提问于2022-12-03得票数 2

1回答

Tensorflow:通过数据生成器输入数据时，未能将NumPy数组转换为张量

、、、、

下面是我人工创建的数据，以模拟我正在处理的实际数据： train_data_list_x = [] train_data_list_y = [] number_of_patients = 20 for i in range(number_of_patients): sample_size = int(np.random.randint(low=2000, high=5000, size=1)) sequence_length = 1024 feature_size = 3 random_data_x = np.random.rand(sample_

浏览 2提问于2022-04-04得票数 0

1回答

如何打印kerasTensor的哪种类型的值

、、

这是网络结构的一部分： inputs = Input(shape=(time_step, dim)) lstm_out = LSTM(5, return_sequen`enter code here`ces=True)(inputs) print(lstm_out) attention_mul = attention_3d_block(inputs) attention_flatten = Flatten()(attention_mul) output = Dense(1, activation='linear')(attention_flatten) model = Mode

浏览 2提问于2022-03-10得票数 0

1回答

如何将BertForSequenceClassification用于1700设置的令牌max_length？

我想在路透社50 50数据集上执行作者分类，其中最大令牌长度是1600+令牌，总共有50个类/作者。有了max_length=1700和batch_size=1，我得到了RuntimeError: CUDA out of memory。可以通过设置max_length=512来防止此错误，但这会产生截断文本的不需要的效果。标记和编码： from keras.preprocessing.sequence import pad_sequences MAX_LEN = 1700 def get_encodings(texts): token_ids = [] attention_

浏览 3提问于2020-04-27得票数 0

回答已采纳