��Seq2Seq��

ʱ�䣺2025-12-26

phigros��°�

��ͣ��
��С��1.85gb
��ԣ��
��֣�

�鿴��

��Seq2Seq��

��Χ�ƻ��Seq2Seq��չ��Ƚ��Seq2Seq�ڻ��롢�ı��ɡ��з��ȷ��Ӧ�ã��-��ṹ��ۻ��Ž��ʵ�ٹ��̣��װ��Ԥ��ִʡ��ֵ�ȣ��Encoder��Decoder�ṹ��漰ģ��ѵ��Ժ��ݣ��˵��ݼ��Ч��޵��ܽ��л��Ի��

��Seq2Seq��

��һ��ת��Ϊ��һ��е��ģ�ͣ�Seqeq��Ӧ�ã�- ��룺�ṩԴ��Ե��ı��Զ�ת��ΪĿ��ԣ��ԭʼ�ı��ṹ��- �ı��ɣ��ͻ��Ի��з��磬��ж��ı��

Seq2Seq ��

��

��˵��о��׷�ݵ�20��50��Alan Turing��һ��ͼ��ش𡰻��˼��𣿡��һ��⣬Ȼ��˹��о��ȳ�� Ȼ��Խ�ģ��չѸ�١��ǣ��Ȼ��Ҫѧϰ��ͳ��nlp��δ��ߵø�Զ��Ǳ��Ļ��һ�ֳ��Ķ˵��ѧϰ��seq2seqʹ�ö�㳤��ڼ��䣨LSTM��ӳ�䵽�̶�ά��Ȼ��ʹ��һ��LSTM��뵽Ŀ��С�

��ṹ��

�ο��Ľ��û��

Seqeq��е��У��ֳ��ɷ��һ��ͨ��ض�;��Ӹ��һ��еķ��еĳ��ȿ��Բ�һ�¡��˽ṹҲ��Ϊ��-��ģ�ͣ��-��ģ�ͣ��RNN��һ�ֱ��壬��ڽ��RNN��ȳ��е��⡣

Encoder

ͨ��˫��ݹ��磨Bi-Directional Recurrent Neural Networks, Bi-RNN��RNN��Խ��ͷ��򴫵ݡ��ȿɱ��ת��Ϊ��״�̶��ı��ڴ˹��е��Ϣ��б��롣��ǿ��ԴӶ��Ƕ��ʹ��ı��Ϣ��

��ǰ��У��ǽ�Դ��䴫�벢ʹ��Ƕ��㽫��ת��Ϊ�ܼ��Ӧ��dropout��ǰ��Щʸ��ݵ�RNN��С��д��ݸ�RNNʱ��Զ��е��״̬�ݹ��̡�ֵ��ע��ǣ��û�н��ʼ��״̬��Ԫ״̬��ݸ�RNN��Ϊ��û��ⲿ��Ϣ��Paddle��Զ��һ��ȫ��Ϊ��ʼ״̬��Ǵ��ʾ��```python # ǰ�� def forward(x)�� # ��Դ��ת��Ϊ�ܼ�� embedded = embedding_layer.forward(x) # Ӧ��dropout�� drop_out = dropout_layer.forward(embedded) # ��ʸ��ݸ�RNN�� outputs, hidden_states = rnn_network(drop_out) ```��չʾ��ν��ת��Ϊ�ܼ��Զ��״̬��ͬʱ��ֶ��ʼ��ͼ��Ĺ��̡�

��һ��ʵ�֣�ʹ��Ƕ��LSTM��Ϊ��Ҫ��ɲ��֡��x��Ӧ��һ��dropout��Է�ֹ��ϣ��ͨ��Զ��巽��`forward`��״̬��h��͵�Ԫ״̬��c��Ǵ��Ϣ�Ĺؼ��

Decoder

The function is to output text(��ı�)

��۾͵��˹�~ ��ҽ��һ��򵥵��ӣ��ʵ��һ��s2s��

ʵ��

��װ�� In [1]

!pip install jieba !pip install --upgrade pip��¼��

Looking in indexes: https://mirror.baidu.com/pypi/simple/, https://mirrors.aliyun.com/pypi/simple/ Requirement already satisfied: jieba in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (0.42.1)[notice] A new release of pip available: 22.1.2 -> 24.0[notice] To update, run: pip install --upgrade pipLooking in indexes: https://mirror.baidu.com/pypi/simple/, https://mirrors.aliyun.com/pypi/simple/ Requirement already satisfied: pip in /opt/conda/envs/python35-paddle120-env/lib/python3.7/site-packages (22.1.2) Collecting pip Downloading https://mirrors.aliyun.com/pypi/packages/8a/6a/19e9fe04fca059ccf770861c7d5721ab4c2aebc539889e97c7977528a53b/pip-24.0-py3-none-any.whl (2.1 MB) �� 2.1/2.1 MB 316.0 kB/s eta 0:00:0000:0100:01 Installing collected packages: pip Attempting uninstall: pip Found existing installation: pip 22.1.2 Uninstalling pip-22.1.2: Successfully uninstalled pip-22.1.2 Successfully installed pip-24.0��¼��

��Ԥ��

ʹ�õ��ǶԻ��ɵ��ͼ��ʾ��

��ȣ��Ҫ��ԭʼ��ݽ��г��Ƴ��Ч�ַ��ٽ�ÿ��Ӳ�ֳɴ����Ž��Щ��ת��Ϊ��ʽ��һ��ǹ��Ȼ��Դ��ģ�͵Ļ��֮һ��

import jiebaimport numpy as npimport re#��Ч�ַ�ȥ��with open("datasets/one.txt","r",encoding="utf-8") as f:# with open("data/data86810/human_chat.txt","r",encoding="utf-8") as f: data=f.read().replace("Human 1"," ").replace("Human 2"," ").replace("."," ").replace("*"," ").replace("@"," ").replace("^"," ").replace("&"," ").replace("!"," ").replace("#"," ").replace("$"," ").replace("?"," ").replace(";"," ").replace(":"," ").replace(","," ").replace('"',' ').replace("%"," ").replace("/"," ").replace("@"," ").replace("("," ").replace(")"," ").replace("'"," ").lower() data=list(data.split("\n"))#print(len(data))lst=[]#�ָ��ʣ��for obj in data: # sen=list(obj.split(" ")) sen=list(jieba.cut(obj, cut_all=False)) lst.append(sen)��¼��

Building prefix dict from the default dictionary ... Dumping model to file cache /tmp/jieba.cache Loading model cost 1.100 seconds. Prefix dict has been built successfully.��¼�� In [2]

# �ִʽ��lst��¼��

[['��'], ['��'], ['��', '��', '��', '��'], ['��', '��', '��', 'ѽ'], ['��', '��', '��', 'ʲô'], ['��', '��', '��', '��'], ['��', '��', '��', '��', '��', '��', '��', '��'], ['��', '��', '��', '��', 'û�Թ�', '��', '��', '��', '��', '��', '�ٺ�'], ['��', '��', '��', '��'], ['��', '�²�'], ['��', '��', '��', 'û��', '20'], ['��ͷ', '��', 'ү', '��', '��', '18'], ['��', '��', '˭'], ['��Ů'], ['��', '��', '˭'], ['��', '��', '��'], ['��', '��', '˭'], ['��', '��'], ['��', '֪��', '��', '��', '˭', '��'], ['��', '��', 'û', '��', '��', '��'], ['��', '��', '��'], ['��', '��', '��', '��', '��ס', '��'], ['��', '��', '˭'], ['��'], ['��', '��ô��', '��'], ['��', '��', '��'], ['��', '��ð', '��', '��'], ['�е�', '��', '��'], ['��', '��', '��'], ['��', '��', '��'], ['��', '��', '��', '��'], ['�й�', '��'], ['��', '��', '��', '˭', '��'], ['��', '��', '��', '��', 'ѽ'], ['��', '��', '��', '��', 'Ů', '��', '��'], ['��', '��', '��', '��', '��Ů', '��', '��', '�ɹ�', '��'], ['��', '��λ', '��'], ['��', '��', '��', '�黧��', '��', '��', '��', '��'], ['��', '��', '��', '��'], ['��', '��', '��', '��', '��', '��', 'ɶ', '��', '��', '��', '��', 'ô'], ['��', '�·�', '�л�', '��'], ['��Ե�', '��', '��', '��', '��', '��', 'һ��'], ['��', '��', '�·�', '��', '��ô��', '��'], ['��', '��', '��', '��', '��', '��', '��', '��', '�κ�', '��', '��', '��', '��', '7', '��', '֮��', '��', '�˻�'], ['�·�', '��', '��', '��', '��ô��'], ['��', '��', '��', '�·�', 'ʵ��', '��', '��', '�Ļ�', '��', '��', '��', 'Ϊ', '��', '�ṩ', '�˻�', '��', '��', '��', '��', '��', '�˽�', 'һ��', '��', '��', '�˻��', '��֪', '��'], ['��', '��', '˯', '��', '��', '��', '��'], ['��', '֪��', '��', '��Ϊ', '��', '˯��', '��', '��'], ['ˮ��', 'ʲô', 'ʱ��', '��ժ', '��', '��'], ['��', '��', '��', '��', 'ʱ��', '��'], ['��ֹ', 'ʳ��', '��', '��', '��', '��', '��', '��', 'ʲô', '��'], ['�Ե�', '��'], ['��', '�䶳', '��', 'Һ��', '��', 'ʲô', '��'], ['��ˮ', '��'], ['Ϊʲô', '��', 'Ů��', 'վ', '��', 'ŦԼ', '�ۿ�', '��'], ['��Ϊ', '��', '��', '��', '��', '��'], ['��'], ['��', 'ѽ', '��', 'ϣ��', '��', '��', '��', '��', '��'], ['��'], ['��', 'ѽ', '��', '~', '��', 'ʲô', '��¶�', 'Ҫ', '��ҽ�', '��'], ['��'], ['��', '��', '��', '��', '��', '��'], ['�Ұ��'], ['��', 'Ҳ', '��', '��', '��', '��', '��', '��', '��'], ['лл'], ['��', '��', '��'], ['��һ��'], ['Ŀǰ', '��', '��', 'ѽ', '��', '��', '��', '�ĵ�', '��', '��', ' '], ['С��'], ['��', '��ʶ', '��', '��', '��', '��', '��', '��'], ['��', '��', '��', '��', '��'], ['Ϊʲô', '��', '��', '��', '��', '��', '[', '��', ']'], ['��', '��', '��', '��'], ['��', '��', '��', '�ѵ�', '��', '��', '��'], ['��', '��', 'ʲô', '��'], ['��', 'һ��', '��', '��', '��', 'һ��', '��', '��', 'ϲ��', '��', '��', '��', '��', '��ô', 'ţ', '��'], ['ɵ', '��'], ['��', 'ʹ��', '��', '��ס', '��', '��'], ['�ϻ�'], ['��֮��'], ['��ӭ'], ['��ӭ', '��'], ['��', '��', '��'], ['û��', '��', '��ֽ', '��ô', '��', '��', '��', '��'], ['Ǳˮ', '��'], ['��Ǳ', '��', ' ', '��', '��', '��', '��'], ['��'], ['��', '��', '��', '��', '��', '��', '��', 'ѽ', ' '], ['��˧', '��', '��'], ['��˧', '��', '��˧', '��'], ['��', '�ܳ�'], ['��', '��', '��', '��', '��', '��', 'һ��', '��', '��'], ['�ǳ�', '��'], ['��', '��', '��', '��'], ['��', '˵', '��', '��', '��'], ['˵', '��', 'ʵ��', '̫��', '��'], ['��', '˵', '��', '��', '��'], ['��', '��', 'С', '��', '��', '��', '��', '˵', '��ô', '��', '��', '��', '��', 'ѽ', '��'], ['��', '�ó�'], ['лл', 'С', '��', '��', '��', '��ס', '��'], ['��', '��', '��', '��', '��'], ['��', '', '��', '��', ''], ['˵��', '��'], ['��', '��', '��'], ['��', '��'], ['��', '��', '�', '��', '��', '��', '��', '��', 'Ϲ', '��', '��', '��'], ['��'], ['��', '��ҵ�']]��¼�� In [3]

��һ��ַ��ֵ䲢��ַ��Ĺ��̣��ʹ��б��Ƶ�ʽ��ַ��б��е�Ԫ��Կո��зָ��Ž��ַ��ӵ�ԭ�ַ��У��γ��µ��ַ��ں��ͨ��ָ��ȥ��ܵĶ��հף��ͨ��ȷ���ظ��֡��㵥��͹��ֵ䣬��ӡ��ֵ䳤�ȣ��Ϊ��ս��

- ¼��

��ݼ�ϡȱ��ʹ��jieba��зִʺ��ǹ��һ��ֵ䡣��У��ֵ�ļ��ַ��ֵ��ָ��λ�ã��ЩԪ�ع�ͬ��һ�仰��ʾ��

dic��¼��

sen_len Ҳ��Ϊÿ��ά�� In [5]

#�洢�Ի��index_data=[]#ÿ�仰�ĳ��ȣ��̾��"pad",��en_len=or i,sen in enumerate(lst)�� #tokenӳ��index��ֹ��ֿ��ַ� sen=[dic[word] for word in sen if word!='' and word!=' '] #�ڿ�ͷ��"sos" sen.insert(dic["sos"]) while len(sen)<sen_len- #��"pad",��ֹ��Ȳ�� sen.append(dic["pad"]) #��ȡsen_len-�� sen=sen[��sen_len- #ĩβ��"eos" sen.append(dic["eos"]) #��ask��answer�ָ� if i%= one=[] one.append(sen) else�� one.append(sen) index_data.append(one)#print(len(index_data))index_data=np.array(index_data)print(index_data.shape)print(index_data[)

(54, 2, 10) [[ 10 205 219 219 219 219 219 219 219 131] [ 10 13 219 219 219 219 219 219 219 131]]��¼��

�ڽ��д��ʱ��ǽ�ÿ��ת��һ��ں��ʹ��ǿ��Ensembling��Ԥ��׼ȷ�ԣ��ʾ��еĸ��Ӷȡ��磬��ǿ��ʹ��е��ģ�ͣ��Ensembled Model��д��Ϊ�˼��ʾ��в�δʹ��һ��

#��һ��Ч��ask,ans=index_data[3]#��index��ת��Ϊ�ַ��ask_str=[words[i] for i in ask] ans_str=[words[i] for i in ans]print(ask_str)print(ans_str)#print(dic)��¼��

['sos', '��', '��', '��', '��', '��', '��', '��', '��', 'eos'] ['sos', '��', '��', '��', '��', 'û�Թ�', '��', '��', '��', 'eos']��¼��

DataLoader

��ѵ��ʱ��Ч��Ҫ��贴��DataLoader��ݴ��ڴ��Դ棬ȷ��ٴ��

import paddlefrom paddle.io import Dataset,DataLoaderimport paddle.nn as nnimport random#batch��Сbatch_size=128class Mydataset(Dataset): def __init__(self,index_data,dic): super(Mydataset, self).__init__() self.index_data=index_data self.dic=dic def __getitem__(self,index): ask_data,ans_data=self.index_data[index] #ask��ֵ��ϵ ask_data,ans_data=ask_data[:][::-1],ans_data return ask_data,ans_data def __len__(self): return self.index_data.shape[0]#ʵ��ȡ��dataset=Mydataset(index_data,dic)#��װΪ��dataloader=DataLoader(dataset,batch_size=batch_size,shuffle=True,drop_last=True)#��Ч��for _,__ in dataloader(): print(_,__) # break��¼��

��ݼ��ɹ�֮��Ǿ�Ҫ��ʼ���ˡ�

��Encoder�ṹ

��encoder��ʹ��paddle�ĸ߼�API nn.Lstm��ṹ In [9]

class Encoder(nn.Layer)�� def __init__(self,vocab_size,emb_dim,hid_dim,drop_out,n_layers)�� super(Encoder, self).__init__ self.hid_dim = hid_dim self.n_layers = n_layers self.emb = nn.Embedding(vocab_size, emb_dim) #[batch_size,sen_len,emb_dim] self.lstm = nn.LSTM(emb_dim,hid_dim,n_layers) self.drop = nn.Dropout(drop_out) def forward(self,x)�� x = self.drop(self.emb(x)) # x��[batch_size,sen_len,emb_dim] y,(h,c) = self.lstm(x) # y��[batch size,sen_len,hid dim*n_directions] return h, cvocab_size=len(dic) emb_dim= hid_dim= drop_out=7 n_layers=ʵ��encoder encoder = Encoder(vocab_size,emb_dim,hid_dim,drop_out,n_layers)x=paddle.randint([batch_size,sen_len]) h,c=encoder(x)#��״ print(h.shape,c.shape)

[2, 128, 256] [2, 128, 256]��¼��

�Encoder�ṹ

In [10]

class Decoder(nn.Layer): def __init__(self,vocab_size,emb_dim,hid_dim,drop_out,n_layers): super(Decoder, self).__init__() self.vocab_size=vocab_size self.emb_dim=emb_dim self.hid_dim=hid_dim self.emb=nn.Embedding(vocab_size,emb_dim) self.lstm=nn.LSTM(emb_dim,hid_dim,n_layers) self.drop=nn.Dropout(drop_out) self.fc=nn.Linear(hid_dim,vocab_size) def forward(self,x,hidden,cell): #x = [batch_size] #hidden = [n_layers*n_directions, batch_size, hid_dim] #cell = [n_layers*n_directions, batch_size, hid_dim] #��ά x=paddle.unsqueeze(x,axis=1) #x=[batch_size,1] x=self.drop(self.emb(x)) #x=[batch_size,emb_dim] output,(h,c)=self.lstm(x,(hidden,cell)) #output = [batch_size,1, hid_dim * n_directions] #hidden = [n_layers * n_directions, batch_size, hid_dim] #cell = [n_layers * n_directions, batch_size, hid_dim] prediction=self.fc(output.squeeze(1)) #prediction=[batch_size,vocab_size] return prediction,h,c decoder=Decoder(vocab_size,emb_dim,hid_dim,drop_out,n_layers) x=paddle.randint(0,136,[batch_size]) y,h,c=decoder(x,h,c)print(y.shape)��¼��

[128, 241]��¼��

Encoder �� Decoder ��

In [11]

import randomclass seq2seq(nn.Layer): def __init__(self,encoder,decoder): super(seq2seq, self).__init__() nn.initializer.set_global_initializer(nn.initializer.XavierNormal(),nn.initializer.Constant(0.)) self.encoder=encoder self.decoder=decoder def forward(self,source,target,teacher_forcing_ratio=0.5): #src = [batch_size,src_len] #trg = [batch_size,trg_len] #teacher_forcing_ratio is probability to use teacher forcing target_len=target.shape[1] batch_size=target.shape[0] outputs=paddle.zeros([target_len,batch_size,decoder.vocab_size]) #outputs=[tar_len,batch_size,vocab_size] hidden,cell=self.encoder(source) #xΪ��һ��"sos" x=target[:,0] #loop (tar_len-1)�� for t in range(1,target_len): output,hidden,cell=self.decoder(x,hidden,cell) #��token�� outputs[t]=output #�ж��Ƿ��teacher_forcing flag=random.random()<teacher_forcing_ratio #Ŀ��token top1=paddle.argmax(output,axis=1) #xΪ��һ��token x=target[:,t] if flag else top1 return outputs x=paddle.randint(0,136,[20,sen_len]) y=paddle.randint(0,136,[20,sen_len]) model=seq2seq(encoder,decoder) predict=model(x,y)print(predict.shape)��¼��

[10, 20, 241]��¼��

�鿴��ṹ

In [12]

#�ض��ݶ�@paddle.no_grad()def init_weights(m): for name, param in m.named_parameters(): #��ȷֲ��ʼ�� param.data=paddle.uniform(min=-0.2,max=0.2,shape=param.shape)#ģ�ͳ�ʼ��model.apply(init_weights)��¼��

seq2seq( (encoder): Encoder( (emb): Embedding(241, 128, sparse=False) (lstm): LSTM(128, 256, num_layers=2 (0): RNN( (cell): LSTMCell(128, 256) ) (1): RNN( (cell): LSTMCell(256, 256) ) ) (drop): Dropout(p=0.7, axis=None, mode=upscale_in_train) ) (decoder): Decoder( (emb): Embedding(241, 128, sparse=False) (lstm): LSTM(128, 256, num_layers=2 (0): RNN( (cell): LSTMCell(128, 256) ) (1): RNN( (cell): LSTMCell(256, 256) ) ) (drop): Dropout(p=0.7, axis=None, mode=upscale_in_train) (fc): Linear(in_features=256, out_features=241, dtype=float32) ) )��¼�� In [13]

def check(str_lst): index_set=set(str_lst) #ɸ��ظ��ĵ�� lst=list(index_set) #�ظ�� zeros=[0 for index in lst] #��Ϊ�ֵ� index_dic=dict(zip(lst,zeros)) index_list=[] #�ҳ��ظ��index�ط� for i in range(len(str_lst)): index=str_lst[i] if index in index_set: index_dic[index]+=1 if index_dic[index]>1: index_list.append(i) #ɾ��ظ�� str_lst=np.delete(str_lst,index_list) str_lst=paddle.to_tensor(str_lst,dtype="int64") return str_lst arr=np.array([1,2,3,4,1,1,2,2])print(check(arr))��¼��

Tensor(shape=[4], dtype=int64, place=CPUPlace, stop_gradient=True, [1, 2, 3, 4])��¼��

Ϊ�˷��ԣ��

In [14]

``` # ��Ժ�� def evaluate(model, ask_sen=ask)�� ask_sen = paddle.to_tensor(ask_sen).unsqueeze(axis= tar = paddle.zeros([ len(sen_len)]) # ��һ��token��Ϊsos tar[ = dic[sos] tar = tar.astype(int) ans = model(ask_sen, tar, # ѹ��batch_size�� ans = ans.squeeze(axis= ans = ans.argmax(axis= ans_str = [words[i] for i in ans] string = & .join(ans_str) return stringprint(evaluate(model, ask)) ```

�䶳 û�� ²� ��˧��¼��

ģ��ѵ��

��ݣ��ṹ��Ϳ��Կ�ʼѵ��˹�~ In [15]

learning_rate=2e-4epoch_num=1000#�ݶȲü��ֹLSTM�ݶȱ�ըclip_grad=nn.ClipGradByNorm(1)#�趨loss��"pad"��tokenloss=nn.CrossEntropyLoss(ignore_index=dic["pad"])#ʵ��Ż��optimize=paddle.optimizer.Momentum(learning_rate,parameters=model.parameters(),grad_clip=clip_grad) model.train()for epoch in range(epoch_num): for i,(user_data,assist_data) in enumerate(dataloader()): #��ݶ� optimize.clear_grad() #��ȡԤ�� predict=model(user_data,assist_data,0) #��predictչ��ȥ��һ�� predict=paddle.reshape(predict[1:],[-1,vocab_size]) #��assist_dataչ��ȥ��һ�� assist_data=paddle.reshape(assist_data[:,1:],[-1]) #ԭpredict=[0,y_hat1,y_hat2...] #ԭassist_data=["sos",y1,y2...] #��Ҫ��һ��ȥ�� predict=paddle.to_tensor(predict,dtype="float32") str_predict=predict.argmax(axis=1) str_del=check(str_predict.numpy()) #print("predict:",str_predict) #print("del:",str_del) num=str_predict.shape[0]-str_del.shape[0] assist_data=paddle.to_tensor(assist_data,dtype="int64") #��ȡ��ʧֵ avg_loss=loss(predict,assist_data) #print(avg_loss.numpy(),num) avg_loss+=num #��򴫲��ݶ� avg_loss.backward() #�Ż�� optimize.minimize(avg_loss) #��ݶ� optimize.clear_grad() if i%10==0: print("epoch:%d,i:%d,loss:%f"%(epoch,i,avg_loss.numpy())) print(evaluate(model,index_data[random.randint(0,500)][0])) if epoch%10==0: #��ģ�Ͳ�� paddle.save(model.state_dict(),"work/zh/seq2seq_1.pdparams")��¼��

��ʼ��

��ݼ��㣬ģ��ѵ��ٶȽϿ쵫Ч��һ�㡣Ȼ��Ի��Կ��ɵ�ģ�͡�

encoder=Encoder(vocab_size,emb_dim,hid_dim,drop_out,n_layers) decoder=Decoder(vocab_size,emb_dim,hid_dim,drop_out,n_layers) model=seq2seq(encoder,decoder) state_dict=paddle.load("work/zh/seq2seq.pdparams") model.load_dict(state_dict)��¼��

��һ��תΪ��ӵĺ�� In [16]

def transform(index_tensor): string=[words[i] for i in index_tensor] return " ".join(string)��¼��

��Կ�ʼ

In [20]

print("human 1:",transform(index_data[10][0]))print("human 2",evaluate(model,index_data[10][1]))��¼��

human 1: sos �� pad pad pad pad pad eos human 2 �䶳 û�� ²µ�¼�� In [22]

transform(index_data[10][0])��¼��

'sos �� pad pad pad pad pad eos'��¼��

��Ͼ��ǻ��Seq2Seq��˵��ϸ��ݣ��ע��£�

��Seq2Seq��

��Seq2Seq��

��Seq2Seq��

Seq2Seq ��

��

��ṹ��

Encoder

Decoder

ʵ��

��Ԥ��

DataLoader

��Encoder�ṹ

�Encoder�ṹ

Encoder �� Decoder ��

�鿴��ṹ

Ϊ�˷��ԣ��

ģ��ѵ��

��ʼ��

��Կ�ʼ

��Ʒ�Ƽ�

��

��Ѷ

����Seq2Seq�����������

����Seq2Seq�����������

����Seq2Seq�����������

Seq2Seq ����

����

����ṹ����

Encoder

Decoder

ʵ�����������

����Ԥ����

DataLoader

����Encoder�ṹ

�Encoder�ṹ

Encoder �� Decoder ����

�鿴����ṹ

Ϊ�˷�����ԣ�����������

ģ��ѵ��

��ʼ����

���Կ�ʼ

��Ʒ�Ƽ�

�������

������Ѷ

��Seq2Seq��

��Seq2Seq��

��Seq2Seq��

Seq2Seq ��

��

��ṹ��

ʵ��

��Ԥ��

��Encoder�ṹ

Encoder �� Decoder ��

�鿴��ṹ

Ϊ�˷��ԣ��

��ʼ��

��Կ�ʼ

��

��Ѷ