2003年12月31日

New Year

This the end day of 2003, tomorrow is a new year.

New year, will be new vision.

I'll try again!

2003年12月30日

First Volume of FSNLP

Last midnight, I finished the reading of the first volume of FSNLP.
This morning, I changed it for the second volume.

The first book gives me a summary of FSNLP. And I have found out that FSNLP is very good for the primary researcher to read for being familiar with NLP.

With the Adding Knowledge Program, I have persisted and gained a little.

I will keep on.

2003年12月29日

YOCSEF Meeting

This morning, I arrived at the administration building's YOCSEF assembly room.
After we had disposed the assembly room, the meeting's chairman Pro. Zhao Tiejun declare the begin of YOCSEF. This is my second time to attend YOCSEF.

The subject of this YOCSEF is Digital Olympics and muitillanguage information processing. And there were three specially invited guests. They described the blueprint of Olympics and the current most difficulties.

I have been familiar with the general picture of the Digital Olympics. And I found out that the Information Processing is more useful in the current times. And the prospect of our lab is very beautiful.

2003年12月28日

找到高中校友

今天晚上在实验室忽然接到一个电话,说是和我一个高中毕业的同学。一想才知道是那个大二的和我一个高中的校友在一位计算机系大三同学的帮助下找到了我。我让他马上过来。

我们聊了很多。包括他的学习状况,工作和生活,还有我的学习工作和生活。从他那里知道了在哈工大的其他几个高中校友的联系方式。以前就听说过有这么几个校友在,但是一直都没有找到。今天终于找到了。刚才给一个在化学系的大三的校友通了电话。原来他们都知道我,只是一直没有找到我而已。

想和他们约好时间一起聚聚,共述家乡情谊。

久旱逢甘露,他乡遇故知。 故乡情谊总是最真的。 感动……

2003年12月27日

《手机》观后感

回到寝室,和同学一起看了《手机》。

首先,佩服葛优的演技。

其次,影片的反映的主题很沉重。剧中台词:近,太近了,近的人都喘不过气来了。片尾严守一的侄女给他演示手机的全球精确定位和即时照相的功能的时候,严守一吓呆了。

科技是把双刃剑。想想小时候度过的那种信息不很发达的时代,对比现在的存在于任何空间的信息,人类确实进步了很多。但是,进步的同时是否有失去了很多……

2003年12月26日

Rough Set Theory

Our WSD research group will do some experiments on Rough Set. And Mr.Lzm believes that rough set can be used for WSD very effectively.

So I am arranged to read dome materials on RS.

Firstly, I read Knowledge Discover by TSinghua Express. But the content of this book is not enough. So I find some papers about RS.

I have understood that reading summarize article is the fast way to be familiar with the area. I read a summarize paper about RS. And the knowledge points are very clearly.

2003年12月25日

Studying paper of Li Juanzi

This morning I am reading Li Juanzi's paper 《语言模型中的一种改进的最大熵方法及其应用》which is published on JOURNAL OF SOFTWARE.

In her paper, she used an updated method combining maximum entropy, mutual information and Z-test to choose the best feature of context for a multivocal word and then used IIS algorithm to optimize the parameters of the linear model for Word Sense Disambiguation.

The experiment results are displaying the advantage of this approach. But I think the paper has two flaws. Firstly, the experiments for WSD is not enough. Secondly, Z-test is usually used to test normal distribution for large scale's samples. And in this paper there is a connotative hypothesis that the mutual information between the feature set and the category a multivocal word is followed normal distribution. The experiments did not prove this hypothesis.

And I think this experiments could be done more fully, and the experiment should do the hypothesis test.

OK. There is an idea. We can arrange some little experiments of our current information corpus to do some hypothesis test. En, this idea should be thought more and more.

2003年12月24日

收到李涓子老师的博士论文

感谢李涓子老师的热心帮助!

2003年12月23日

A bless mail from Korean

One of my last year's MCM teammate, who is studying for her graduate degree in Korean Puxiang University, mailed a Christmas card to me.

She told me a lot of news of her study and life. She has gained the usually first achievement of her class. And at this Spring Festival she won't come back. Inseadly she will go to Seoul University to meet her friends. She has a long term scheme to publish two or more international papers. To this scheme she has a lot of confidence. And I believe her can do so.

Also, I have told my recent state and my works in recent months to her. She told me to study more studious and do more practices and prepare to do more achievement.

I make a good bless to her study and life in Korean and may her journey to Seoul.

2003年12月22日

A exciting news!!

There is a exciting news that our lab has gained first in a recent evaluating.

I'm excited. Dr.Tliu tells us that we should have self-confidence to do everything better.

Great!


2003年12月21日

Matlab 程序升级

卢老师让我再做10-10,3-3,3-7,7-3等神经网络下的词义消歧实验。
想到上次5-5的实验花了我一整天的时间,而且还需要人工不断的切换,非常麻烦。
这次实验的总量是5-5的4倍,肯定不能采用原先的那种半自动的方法了。为此我花去了好几个小时来编制全自动实验的Matlab程序。

磨刀不误砍柴功。我在下午4:30编写完程序并将10-10的语料全部规范化后开始在服务器上运行程序,伪词01的21组实验花去三个小时左右就完成了。

这让我再次体会到了程序全自动化的好处。比起上次不断的人工切换程序,这次没有花费任何中途的人工干涉,效率大幅度提高亚。

2003年12月20日

Study for vc++ again!

Visual C++.NET is somewhat different from VC++ 6.0.

I think so.

2003年12月19日

Visual C++

In yesterday's adding knowledge program, I only read some base knowledge of Visual C++. But this evening, I did a small program on MFC after the guide of the book.

After I understood some codes of this program I was very happy.

MFC in .NET is more wonderful. Because it's desktop program like visual basic.

Great, study for program must do a lot of practices. I'll keep on.

2003年12月18日

My Adding Knowledge Program

Up to now I have joined in IRLab for nearly five months. And I gradually discovered my shortage of my knowledge system. I summarized it into three main aspects: English general ability, Visual C++'s programming ability and the foundation knowledge of Natural Language Processing.

I had been analyzing them for a whole week. After the relative systemic analysis I made the Adding Knowledge Program aiming at the three aspects.

After compute everyday studying time, I was somewhat astonished. Because it is five hours. And after my careful thought, I found it was feasible.

The detailed program was made sure yesterday evening. And I carried it out this evening. Now I find it is reasonable for me.

I like the program. I believe I can stick to it.

2003年12月17日

Deep research of our method of WSD

Our method has achieved to a good effect. But in our paper there are a lot of view points should be deep researched.

Dr.Tliu discussed detailed with Mr.Lzm and me. And we got a lot of constructive results which could lead us to do deep research of this point for WSD.

I should do some constructive experiments for our research.

Just so.

2003年12月16日

Dr. Rahmat Shoureshi's visit

This afternoon, about 4 o'clock, the American guest Dr. Rahmat Shoureshi visited our computer department. When he, with our dean, went into our lab, I made the demos for him.

I had being prepared for his visitation for three hours. But I had somewhat strain. When I made the demo of Dependency Parsing, the visitor maybe be confused. So Dean Xu let me change another one. I made the latter demo of Englishing Writing Assistant. I said a lot of prepared sentences to him. At last, he understood the main idea of the demos.

I am very happy to this chance for communicating with this Doctor.

2003年12月15日

Rough Set theory

Today I read through the paper of Dr.Chen Qingcai. In his paper rough set theory has been used very often.

It can be used in changeing pinyin into words, and reducting the baseline words for computing similitude degree of words.

A doctorial paper is very ample. I have found the difficult of it.
But at the same time I find out that grey system, like rough set, can be used widely into Chinese Information Processing. But the foundation of Chinese Information Processing and Grey system should be combined. In order to do so, I should learn more of them.

Try, and again.

2003年12月14日

陈清才的论文学习之一

内容很丰富,今天才看了35页,明日继续。

2003年12月13日

Get along with the experiment

At 8:30 I began to do the one hundred and sixty-eight experiments. I divided the experiments into four parts. I followed the way which I had used in the science and technology innovation contest that change the iterate times to observe the trend of results.

I implemented them one by one, and wrote the results on my notebook.

At 4:30 this afternoon, I had done them finally. At that time I was very happy.

Mr.Lzm observed the results, and asked me to study the paper of Dr.Chen Qingcai and try to use the rough set theory.

Rough set is very good theory. I think it is useful as Grey system theory. I will contrast them and use them in WSD.

2003年12月12日

Detailed experiment for NN-WSD

Based on the result of yesterday's full xperiments, after discussing the detaild experiment with Mr.Lzm, I begin to do the series of experiments on all kinds of corpor scales.

The diffilcult portion of the series experiments is to chage the original corpus into the format for matlab.

After I had done the difficult portion, I implement the programms and get the results quickly.

When I hand over today's results to Mr.Lzm, he gives me a lot of suggests for updating the experiment. So I will do careful experiments on five cmputers tomorrow.

It will be taken a whole day.

2003年12月11日

Full Experiment of WSD

In order to make sure the optimal inside net structure, I have done a self-contained experiment plan.

Likely to the experiment of my science and technology innovation, I confirm the parameters one by one.

At last, just now, I complete the whole experiment plan. I make the conclusion that: the magnify parameter is 9, the err_goal is 0.3, one connotative layer is enough and obligatory, the number of the connotative layer's node is 12.

I thought originally that two connotative layers is the best choice. But after my test experiment, I find out my former thought is wrong.

Experiment is the best tool to prove your idea.

2003年12月10日

O-O for Software Engineer

This evening, I begin to study O-O for Software Engineer.

At the class, I find out that the teacher's style is very different from Chinese teacher. Because the teacher, with five years oversea working career, is back from America. After the discuss of my friend and me, we agree with each other. We think that American teaching style is to make the problem simplier and simplier. The questions we are asked are very easy and can be answered quickly. Their teaching contains a lot of instances and is short of clearly logic.

At the end of the class, I begin to adapt to her teaching style.

Great! Continue studying for it and going to the classes.

2003年12月9日

续昨日

今晚按照今日计划,我继续阅读《自然语言理解-计算机能思维吗》。虽然今晚已经看完了全书厚度的一半,但我明显感觉我所学习到的东西没有全书的一半,大概只有十分之一。书中有些费解之处或是繁琐之处我略过了一些。

现在感觉这本书写的非常好,非常适合我们初步涉猎中文信息处理的学生好好研读。

今日从书中最大的收获便是:三段论正确的本质原因是包含关系的传递性。

还有一点是:语法包含传统语法、结构语法、短语结构文法、转换生成语法、格文法、CD概念从属理论。

此书需读百遍,其义方能初显。 感觉如此。

2003年12月8日

《自然语言理解-计算机能思维吗》

昨天开始学习王开铸老师在1995年写的《自然语言理解-计算机能思维吗》,书很薄,但很精辟。我感悟颇深。

摘抄一些如下:

经过漫长的社会演变,已经形成如今的八大语系:汉藏语系、印欧语系、亚非语系、阿尔泰语系、乌拉尔语系、尼日尔-刚果语系、马来-玻里尼西语系和德拉维达语系。

自然语言理解的三种观点:系统工程观点、层次结构观点、层次间单向观点。

对话双方的言语链过程:思维层-〉生理层-〉物理层-〉生理层-〉语言层-〉思维层

以上这些均是从第一章中摘录的。 仔细理解,确实很耐人寻味。

2003年12月7日

The adjusting period

This afternoon our lab has the weekly report meeting. It is turn to our Dic-Constructing group to report our works and difficult.

Zhu Liuliu take the report firstly. And I give two demos of our Cup-Dic. Indeed, there are a lot of shortages of our data files and demos. And I have the confidence to make it better, under the guide from Dr.Tliu and Mr.Lzm.

If you find the errors, you can solve them one by one. Yes, our group has found the shortages of our work. We will cut them one by one. So, I think we will do better after some time.

And after these days busy, I find some lack of my work style and arranging of my life. I analysis and get the conclusions as follows.

Firstly, my attention is easy to be thrown into confusion. If there is a emergent task which I must finish in few days, I easily do not carry out my former plan.

Secondly, my emotion is impacted easily by the current working state. Before a week, I should finish three big tasks parallel in few days. During my hard and hard working, I was not happy. But after the busy period I find I can do them better if I plan better.

Sharpen the saw. I must plan carefully enough, and do them one by one every day. Every morning, I should plan my detailed tasks of this day. At evening, I must chek them one by one, and adjust the leaves. Every week, I should do like this. So do every month.

Do them at once.


2003年12月6日

Construction of the Dictionary

This weekend, it is turn to Zhu Liuliu and me to report the latest development of the construction os the dictionary.

Zhu Liuliu and I were preparing the materials for the report. I have outlined the report into two mainparts. First one, for Zhu Liuliu, are bottom data files construction and the similitude degree of Chinese words, and the other one, for me, are the two demos of IdeaNet, Cup-Dicthe shortages and the next scheme.

I have done my part. And just Zhu Liuliu have done,too.


2003年12月4日

Exciting of the message from Dr.Tliu

These days our lab is devoting for the Big project of National center.

And Dr.Tliu gave us some very exiting news.

This afternoon, Dr.Tliu came into our room with the successing message.

All of us were excited.

Yes, it is exciting. We should try more!

2003年12月3日

心绪 2

前几天写下了《心绪》,那时心情不太好,是因为许多事情压过来,感觉无法承受。

今天再写《心绪》时,我感觉到的是需要充实的生活。 这种感觉伴随着软件工程的考试结束而更加强烈。我们的大学四年生涯已经所剩无几了。每天在寝室听到考研的倒计时,每减少一天我都会默默的祝福我的同寝室的同学们,同时也发现时间确实过的很快,特别是早上起床后,在晚上睡觉时感觉更是强烈。

时间如流水,匆匆逝去。 我们无法改变时间之水的流动速度,但是我们可以认认真真的度过每一分钟,这样才会充实得不至于整天都很忙,但碌碌无为。

好好珍惜这些大好时光,不要浪费掉每一秒。
这就是我对时间的心绪。

2003年12月2日

Exam of S.E.

Software Engineering is the last exam of the classes that should be numbered in the total achievement.

I have been reviewing for it for five days. I read through the book at least two times. And I have done one exam paper and the all homework to Mrs.Wang Yuying's powerpoint.

I think I have prepared fully for it.

But when I doing my exam paper, I find there is a lot of difference. The paper include some knowledge about the Compile Theory and the whole system of our school. I add a lot of my experience of my development of some software.

I think I have tried my best. It is enough.

2003年12月1日

The last class of Communist Party Colledge

This evening, there is the last class of Communist Party Colledge.

After the compere's audios of Divine Boat and Yang Liwei, some students go to the dais to talk about the self-feeling.

I am the third one who go to the dais. Firstly, I talk about the Diving boat success's significance. Then I state my viewpoint about the Diving boat's success. I anlysis the contrast between little success of Diving boat and the austere situation of our nation. I think the success of Diving boat is very little of the full project of our nation's renaissance, and we must foucs on the other things of the face of the project.

But the career is not by only day, it is a long term scheme. We must try our best to do what we must do and what we want to do.

After my address, all of our students agree with my view.

It is ok!