Skip to content

Commit bdf7bb1

Browse files
authored
Add files via upload
1 parent 9b8f101 commit bdf7bb1

7 files changed

+24237
-17
lines changed

EnglishTest(30%).txt

+6,031
Large diffs are not rendered by default.

TagalogTest(30%).txt

+6,031
Large diffs are not rendered by default.

TagalogtoEnglish.py

+6-6
Original file line numberDiff line numberDiff line change
@@ -30,22 +30,22 @@ def sentence_tokenizer(sentence_list) :
3030
def translate() :
3131
tokenizer = nltk.data.load('tokenizers/punkt/english.pickle')
3232
f_Output = ""
33-
with open("input.txt") as f:
33+
with open("input.txt", encoding="utf-8") as f:
3434
tagalog_data = f.readlines()
3535

3636
tagalog_lines = sentence_tokenizer(tagalog_data)
3737

3838
tagalog_sentences = list()
39-
for line in tagalog_lines :
39+
for line in tagalog_lines:
4040
l = tokenizer.tokenize(line)
41-
for sen in l :
41+
for sen in l:
4242
tagalog_sentences.append(sen)
4343

44-
otp_file = open("output.txt", "w+")
45-
for index in range(len(tagalog_sentences)) :
44+
otp_file = open("output.txt", "w+", encoding="utf-8")
45+
for index in range(len(tagalog_sentences)):
4646
current_sentence = tagalog_sentences[index]
4747
translated_sentence = ModelTester.sentence_tester1(current_sentence, 1)
4848
otp_file.write(translated_sentence)
49-
otp_file.write(". ")
49+
otp_file.write(". \n")
5050

5151
print("Successfully translated! Translated document is 'output.txt' ")

changes.txt

+41
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,41 @@
1+
Changes:
2+
3+
File: (DutchtoEnglish.py)
4+
5+
DutchtoEnglish.py ->TagalogtoEnglish.py
6+
7+
final_list -> f_list
8+
index -> index_list
9+
output_sentence -> otp_sentence
10+
final_output -> f_output
11+
current_sen -> current_sentence
12+
curr_translated_sen -> translated_sentence
13+
out_file -> otp_file
14+
15+
File: (EnglishtoTagalog.py)
16+
17+
EnglishtoDutch.py ->EnglishtoTagalog.py
18+
final_list -> f_list
19+
index -> index_list
20+
l -> curr_line
21+
current_sen -> current_sentence
22+
curr_translated_sen -> translated_sentence
23+
out_file -> otp_file
24+
25+
File: (utlis.py)
26+
27+
nCr -> n_factorial_r
28+
f -> factorial
29+
30+
31+
32+
33+
34+
35+
36+
37+
38+
39+
40+
41+

input.txt

+6,031-10
Large diffs are not rendered by default.

output.txt

+6,082-1
Large diffs are not rendered by default.

tasks.txt

+15
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,15 @@
1+
Program:
2+
• Test other data sets for training model
3+
• Create an algorithm that will get the BLEU score (Cedric, Miguel, Hakob)
4+
• Change the variable, method, and file names of the programs (Note* yung main, model_trainer, ibm1_em, at utils ay napalitan ko na yung iba so yung mga di pa na lang ang gawin siguro) (Cedric)
5+
update (nabago na ni cedric yung tagalogtoenglish at englishtotagalog) ✔
6+
• Disseminate the train, test, and validation of data
7+
• Document the results
8+
• Try to improve the accuracy
9+
• Edit the input and output text. Make sure the system will work
10+
11+
Document:
12+
• Chapter 1 - Abstract, motivation (Ruffy) ✔
13+
• Chapter 2 - RRL (Jessie, Miguel) ✔
14+
• Chapter 4 - Implementation
15+
• Chapter 5 - Conclusion

0 commit comments

Comments
 (0)