allanj
diff --git a/‎README.md
Lines changed: 4 additions & 4 deletions b/‎README.md
Lines changed: 4 additions & 4 deletions
diff --git a/‎config/reader.py
Lines changed: 0 additions & 50 deletions b/‎config/reader.py
Lines changed: 0 additions & 50 deletions
diff --git a/‎docs/transformers_benchmark.md
Lines changed: 7 additions & 7 deletions b/‎docs/transformers_benchmark.md
Lines changed: 7 additions & 7 deletions
diff --git a/‎common/__init__.py renamed to ‎src/common/__init__.py b/‎common/__init__.py renamed to ‎src/common/__init__.py
diff --git a/‎common/instance.py renamed to ‎src/common/instance.py b/‎common/instance.py renamed to ‎src/common/instance.py
diff --git a/‎common/sentence.py renamed to ‎src/common/sentence.py b/‎common/sentence.py renamed to ‎src/common/sentence.py
diff --git a/‎config/__init__.py renamed to ‎src/config/__init__.py b/‎config/__init__.py renamed to ‎src/config/__init__.py
diff --git a/‎config/config.py renamed to ‎src/config/config.py b/‎config/config.py renamed to ‎src/config/config.py
diff --git a/‎config/eval.py renamed to ‎src/config/eval.py b/‎config/eval.py renamed to ‎src/config/eval.py
diff --git a/‎config/transformers_util.py renamed to ‎src/config/transformers_util.py b/‎config/transformers_util.py renamed to ‎src/config/transformers_util.py
@@ -8,10 +8,10 @@ and [benchmark results](/docs/transformers_benchmark.md) with fine-tuning BERT).
 
 | Model| Dataset | Precision | Recall | F1 |
 |-------| ------- | :---------: | :------: | :--: |
-|BERT-base-cased (this repo)| CONLL-2003 | 91.69 | 92.05 | 91.87 |
-|Roberta-base (this repo)| CoNLL-2003 | **91.88**  | **93.01** |**92.44**|
-|BERT-base-cased (this repo)| OntoNotes 5 |89.57  | 89.45 | 89.51 |
-|Roberta-base (this repo)| OntoNotes 5 | **90.12**  | **91.25** |**90.68**|
+|BERT-base-cased + CRF (this repo)| CONLL-2003 | 91.69 | 92.05 | 91.87 |
+|Roberta-base + CRF  (this repo)| CoNLL-2003 | **91.88**  | **93.01** |**92.44**|
+|BERT-base-cased + CRF  (this repo)| OntoNotes 5 |89.57  | 89.45 | 89.51 |
+|Roberta-base + CRF  (this repo)| OntoNotes 5 | **90.12**  | **91.25** |**90.68**|
 
 More [details](/docs/transformers_benchmark.md)
 
 
@@ -9,10 +9,10 @@ We strictly follow the optimizer configuration as in HuggingFace and use batch s
     |-------| ------- | :---------: | :------: | :--: |
     |HuggingFace Default (bert-base-cased)| Test Set | 90.71 | 92.04| 91.37|
     |HuggingFace Default (roberta-base)*| Test Set | 89.41 | 91.47|90.43|
-    |BERT-base-cased (this repo)| Test set | 91.69 | 92.05 | 91.87 |
-    |BERT-large-cased (this repo)| Test Set | 92.03 | 92.17 | 92.10 |
-    |Roberta-base (this repo)| Test Set | 91.88  | 93.01 |92.44|
-    |Roberta-large (this repo)| Test Set | **92.27**  | **93.18** |**92.72**|
+    |BERT-base-cased-CRF (this repo)| Test set | 91.69 | 92.05 | 91.87 |
+    |BERT-large-cased-CRF (this repo)| Test Set | 92.03 | 92.17 | 92.10 |
+    |Roberta-base-CRF (this repo)| Test Set | 91.88  | 93.01 |92.44|
+    |Roberta-large-CRF (this repo)| Test Set | **92.27**  | **93.18** |**92.72**|
 HuggingFace Default (roberta-base)* has an issue with tokenization (There is no leading space).
 
 We didn't achieve 92.4 F1 as reported in the BERT paper. 
@@ -25,8 +25,8 @@ I think one of the main reasons is they are using the document-level dataset ins
 
     | Model| Dataset | Precision | Recall | F1 |
     |-------| ------- | :---------: | :------: | :--: |
-    |BERT-base-cased (this repo)| Test Set |89.57  | 89.45 | 89.51 |
-    |BERT-large-cased (this repo)*| Test Set | - | -|-|
-    |Roberta-base (this repo)| Test Set | **90.12**  | **91.25** |**90.68**|
+    |BERT-base-cased-CRF (this repo)| Test Set |89.57  | 89.45 | 89.51 |
+    |BERT-large-cased-CRF (this repo)*| Test Set | - | -|-|
+    |Roberta-base-CRF (this repo)| Test Set | **90.12**  | **91.25** |**90.68**|
 
 Roberta-base (this repo)* is still running. The others are not finished yet.