-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathcode.htm
More file actions
107 lines (98 loc) · 5.02 KB
/
code.htm
File metadata and controls
107 lines (98 loc) · 5.02 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Xu SUN</title>
<style type="text/css">
.name {
font-size: 24px;
}
body {
background-color: #FFF;
}
body,td,th {
font-size: 18px;
}
.crf-adf {
font-size: 24px;
font-weight: bold;
}
.crf-adf2 {
font-size: 22px;
}
name1 {
font-weight: bold;
}
.size20 { font-size: 20px;
}
</style>
<script>
(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
ga('create', 'UA-73800643-1', 'auto');
ga('send', 'pageview');
</script>
</head>
<body>
<h2 class="crf-adf"><a href="index.htm">Xu SUN</a>'s code</h2>
<h2>Recent softwares are available at <a href = "https://github.com/lancopku">https://github.com/lancopku</a> </h2>
<h2>Others: </h2>
<blockquote>
<p class="crf-adf">Structure Regularization for Structured Prediction</p>
<p class="size20">Very simple code & some notes about the NIPS paper "structure regularization for structured prediction" --> click here [<a href="software/SR.txt">code & notes</a>]</p>
<p class="crf-adf"> </p>
<p class="crf-adf"><strong>CRF-ADF Sequential Tagging Toolkit v1.0
</strong></p>
<blockquote>
<p>This is a general purpose software for sequential tagging (or called sequential labelling, linear-chain structured classification). The CRF (Conditional Random Fields) model is described in (Lafferty et al., 2001) and the ADF (Adaptive stochastic gradient Decent based on Feature-frequency information) fast training algorithm is described in (Sun et al., ACL 2012).</p>
<p>Main features: </p>
<ul>
<li> Developed with C#</li>
<li> High accuracy (72.3% on Bio-Entity Recognition Task at BioNLP/NLPBA 2004, and 97.5% on Chinese Word Segmentation MSR Task)</li>
<li> Fast training (faster convergence rate than traditional batch/online training methods, including LBFGS & SGD)</li>
<li> General purpose (it is task-independent & trainable using your own tagged corpus)</li>
<li> Support rich edge features (Sun et al., ACL 2012)</li>
<li> Support various training methods, including ADF training, SGD training, & Limited-memory BFGS training</li>
<li> Support automatic n-fold cross-validation for tuning hyper-parameters</li>
<li> Support various evaluation metrics, including token-accuracy, string-accuracy, & F-score</li>
</ul>
<p class="crf-adf2">[<a href="software/CA.tu.pdf">Tutorial</a>] [<a href="software/CRFADF.zip">Download the source code</a>]</p>
<p> </p>
</blockquote>
<p class="crf-adf">Latent Structured Perceptron Toolkit v1.0</p>
<blockquote>
<p>This is a general purpose software for sequential tagging with the emphasis on fast training speed. This toolkit includes Latent Structured Perceptron (LSP) model (Sun et al., IJCAI 2009, TKDE 2013). It also includes traditional Structured Perceptron (SP) model and with the averaged version (Collins, 2002).</p>
<p>Main features:</p>
<ul>
<li>Developed with C#</li>
<li> Automatic modeling of hidden information (latent structures) in the data (Sun et al., IJCAI 2009, TKDE 2013)</li>
<li> Fast training (much faster than probabilistic models like CRFs)</li>
<li> General purpose (it is task-independent & trainable using your own tagged corpus)</li>
<li> Support rich edge features (Sun et al., ACL 2012)</li>
<li> Support various evaluation metrics, including token-accuracy, string-accuracy, & F-score<br />
</li>
</ul>
<p><span class="crf-adf2">[<a href="software/LSP.tu.pdf">Tutorial</a>] [<a href="software/LSP.zip">Download the source code</a>]</span></p>
<p> </p>
</blockquote>
<p class="crf-adf">Online Multi-Task Learning Toolkit (OMT) v1.0</p>
<blockquote>
<p>This is a general purpose software for online multi-task learning. The online multi-task learning is mainly based on Conditional Random Fields (CRF) model and Stochastic Gradient Descent (SGD) training. The work is described in (Sun et al., TKDE 2013).</p>
<p> Main features:<br />
</p>
<ul>
<li> Developed with C#</li>
<li>High accuracy on the human activity recognition tasks (Sun et al., TKDE 2013)</li>
<li> General purpose (it is task-independent & trainable using your own tagged corpus)</li>
<li> Support SGD training & Limited-memory BFGS training</li>
<li> Support various evaluation metrics, including token-accuracy, string-accuracy, & F-score </li>
</ul>
<p><span class="crf-adf2">[<a href="software/OMT.tu.pdf">Tutorial</a>] [<a href="software/OMT.zip">Download the source code</a>]</span><br />
</p>
</blockquote>
<p class="crf-adf"> </p>
</blockquote>
</body>
</html>