黄宜华 博士, 教 授博导

Yihua Huang, Ph.D., Professor

南京大学 计算机科学与技术系

Department of Computer Science & Technology Nanjing University, China

PASA大数据技术研究组

PASA Big Data Research Lab

中国计算机学会大数据专家委员会

委员,副秘书长

江苏省计算机学会大数据专家委员会

主任

江苏省计算机学会云计算专业委员会

副主任

 

主要研究兴趣

主要学习工作经历

近期承担的研究项目

教学工作

 

获奖

书籍与研究论文

代表性研究成果与评价

会议报告

个人爱好

    联系信息                                                                                         Contact Information

邮件:

黄宜华

Mail:

Yihua Huang

 

南京大学计算机科学与技术系

 

Department of Computer Science & Technology

汉口路22

Nanjing University

中国南京 210093

22 Hankou Road, Nanjing 210093, China

办公室:

计算机系大楼408

Office:

408, Computer Department Building

南京大学仙林校区

Xianlin Campus of Nanjing University

电话:

025-8968-6517

Tel:

025-8968-6517

邮箱:

yhuang@nju.edu.cn

Email:

yhuang@nju.edu.cn

 

    主要研究兴趣                                         Research Interest

         大数据并行处理与云计算技术

Big data parallel processing and cloud computing

         Web信息挖掘与集成

Web data mining and integration

         体系结构与并行计算技术

Computer architecture and parallel computing

         Internet/Web技术与应用

Internet/Web technology and application

         中文信息处理与中文文本语义分析

Chinese information processing and Chinese text semantic analysis

 

 

 

    近期承担的研究项目                                    Recent Research Projects

1.  Apache Spark Tachyon优化与功能增强

1. Optimization and Enhancement for Apache Spark and Tachyon

UC Berkeley AMP实验室开源合作研究

UC Berkeley AMP Lab Joint Open Source Research

2014-2015

2014-2015

2. 面向大数据的媒体内容分析与关联语义挖掘研究

2. Research on Big Media Data Content Analysis and Associated Semantic Mining

国家自然科学基金专项基金项目(项目号61223003

    China National Science Foundation Special Research Grant(#61223003)

        资助额:300万,2013.1-2016.12,项目主要参与者

Funding Amount: RMB 3 Million Yuan, 1/2013-12/2016, Co-PI

3. Gradient Boosting决策树(GBDT)Spark并行化训练算法研究

3. Gradient Boosting Decision Tree(GBDT) Parallel Training Algorithm with Spark

   百度主题研究项目

    Baidu Research Project

   资助额:10万,2014,项目负责人

    Funding Amount: RMB 100,000, 2014, PI

4. HBase二级索引与查询技术研究

4. Secondary Index and Query for HBase

    中兴通讯,项目负责人

    ZTE, China

    资助额:35万,2013-2014

    Funding Amount: RMB 350,000 Yuan, 2013-2014, PI

    5. 大规模中文文本语义分析与医疗文本挖掘

5. Large Scale Chinese Text Semantic Analysis and Medical Record Mining  

        美国Intel Labs大学研究资助项目

    USA Intel Labs URO Funding

        资助额:US$ 6万,2013.4-2014.3,项目负责人

Funding Amount: US$ 60,000, 4/2013-3/2014, PI

6. 面向复杂结构的精确Web信息抽取集成模型与关键技术研究

6. Research on Model and Techniques for Web Info Extraction & Integration

        国家自然科学基金面上项目(项目号61072152

China National Science Foundation Research Grant(#61072152)

资助额:32万,2011.1-2013.12,项目负责人

Funding Amount: RMB 300,000 Yuan, 1/2011-12/2013, PI

    7. 精确信息定制服务Web信息抽取集成通用引擎与服务软件平台

7. Accurate Web Info Extraction and Integration Engine and Service Platform

        江苏省科技支撑计划项目(项目号BE2011172

    Jiangsu Province Science & Technology Research Grant (#BE2011172)

        资助额:60万,2011.4-2013.12,项目负责人

    Funding Amount: RMB 600,000 Yuan, 4/2011-12/2013, PI

 

 

 

    主要学习和工作经历                                         

     2008-现在  南京大学计算机科学与技术系  教授

 

     2002-2008  美国佐治亚医学院生物技术与基因药物研究中心 研究员

 

     1998-2001  美国佛罗里达大学数据库研究中心 访问学者

 

     1998-2001  南京大学计算机科学与技术系  教授

 

     1993-1997  南京大学计算机科学与技术系  副教授

 

     1988-1993  南京大学计算机科学与技术系  讲师

 

     1986-1988  南京大学计算机科学与技术系  助教

 

     1994-1997  南京大学计算机科学与技术系  博士

 

     1983-1986  南京大学计算机科学与技术系  研究生

 

     1979-1983  南京大学计算机科学与技术系  本科

 

 

    教学工作                                         

 

 

讲授课程:

大规模海量数据并行处理(本科与研究生)
(Google大学合作部网站课件下载)

曾开设课程:

Web技术与应用开发

 

 

计算机原理

课程建设:

计算机硬件类课程群建设与实验教学研究

 

微机原理与接口 

 

 

 

程序设计语言

年级导师:

第一讲:如何尽快适应大学学习和生活(PDF

 

中文信息处理

 

第二讲:欲立业先立人-大学时代个人品德和综合素质的培养(PDF

 

数字电路设计 

 

第三讲:计算机学科专业、课程和知识体系(PDF

 

 

研究生培养:

研究生学习培养要求与指南(课题组内使用)(PDF

 

                                           

 

    会议报告                                         

 

 

2012 Hadoop与大数据技术大会报告:大数据研究的技术层面与主要研究内容(PDF

2013       大数据技术研究与教学:大数据技术研究与教学(PDF

 

                                           

 

    获奖                                         

 

2012Google奖教金

2012年课程研究生组队参赛第一届“中国云/移动互联网创新大奖赛”,

获得9项优胜奖,4项优秀领队奖,赢得大赛奖金20万元

2000年江苏省科技进步二等奖

1993年江苏省科技进步二等奖

1997年第三届中国PC应用软件设计大赛优胜奖

1997/1996/1995年南京大学优秀青年教师

1995年 江苏省八五先进科技工作者

1995国家教委教材二等奖,南京大学优秀教材一等奖

1992年江苏省优秀软件一等奖

1991年南京大学科技开发特别贡献奖

 

  兴趣爱好                                         

 

    个人爱好                                         

 

 

   乒乓球,阅读,哲学,中国传统文化,中医保健

散文:远走高飞的小鸟

 

 

 

    书籍与发表论文                                          Publications

 

书籍《深入理解大数据大数据处理与编程实践》,机械工业出版社,2014,国家教委计算机教指委计算机类专业系统能力培养系列教材。

研究论文:

1.      Rong Gu, Shanyong Wang, FangFang Wang, Chufeng Yuan, Yihua Huang. Cichlid: Efficient Large Scale RDFS/OWL Reasoning with Spark. Accepted by 2015 IEEE International Parallel & Distributed Processing Symposium (IPDPS 2015), India, May 25-29, 2015

2.      Rong Gu, Xiaoliang Yang, Jinshuang Yan, Yuanhao Sun, Bing Wang, Chunfeng Yuan, and Yihua Huang. SHadoop: Improving MapReduce Performance By Optimizing Job Execution Mechanism in Hadoop Clusters. Journal of Parallel and Distributed Computing. Vol.74(3), 2014, pp. 2166-2179.

3.      Rong Gu, Wei Hu, Yihua Huang. Rainbow: A Distributed and Hierarchical RDF Triple Store with Dynamic Scalability. Proc. of the 2014 IEEE International Conference on Big Data (IEEE BigData 2014), p 561-566, Oct. 27-30, Washington, USA.

4.      Shengsheng Shi, Chengfei Liu, Chunfeng Yuan, Yihua Huang. Multi-Feature and DAG-Based Multi-Tree Matching Algorithm for Automatic Web Data Mining. The 2014 Web Intelligence Congress(WI 2014), Aug. 11-14, Warsaw, Poland.

5.      Hongjian Qiu, Rong Gu, Chunfeng Yuan and Yihua Huang. YAFIM: A Parallel Frequent Itemset Mining Algorithm with Spark. The 3rd International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics, conjunction with IPDPS 2014, May 23, 2014. Phoenix, USA

6.      Lei Jin, Rong Gu, Chunfeng Yuan and Yihua Huang. Large Scale Deep Learning On Xeon Phi Many-core Coprocessor. The 3rd International Workshop on Parallel and Distributed Computing for Large Scale Machine Learning and Big Data Analytics, conjunction with IPDPS 2014, May 23, 2014. Phoenix, USA

7.      Ge, Wei; Huang, Yihua; Zhao, Di; Luo, Shengmei; Yuan, Chunfeng; Zhou, Wenhui; Tang, Yun; Zhou, Juan. CinHBa: A secondary index with hotscore caching policy on key-value data store. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), v 8933, p 602-615, 2014.

8.      顾荣, 王芳芳, 袁春风, 黄宜华. YARM:基于MapReduce的高效可扩展的语义推理引擎. 《计算机学报》,01期,pp 74-852014/8.

9.      顾荣,严金双, 杨晓亮, 袁春风, 黄宜华. Hadoop MapReduce短作业执行性能优化. 《计算机研究与发展》,2014Vol. 51 (6): 1270-1280.

10.   赵博, 黄书剑, 戴新宇, 袁春风, 黄宜华. 基于分布内存数据库的并行化层次短语机器翻译算法.《计算机研究与发展》,2014Vol. 51 (12): 2724-2732.

11.   Rong Gu, Furao Shen, and Yihua Huang. A Parallel Computing Platform for Training Large Scale Neural Networks. Proceedings of the IEEE International Conference on Big Data (IEEE BigData 2013), pp. 376 - 384, Santa Clara, CA, USA, Oct. 6-9, 2013.

12.   Shengsheng Shi, Wu Wei, Yulong Liu, Haitao Wang, Lei Luo, Chunfeng Yuan, and Yihua Huang. NEXIR: A Novel Web Extraction Rule Language toward a Three-Stage Web Data Extraction Model. The 14th International Conference on Web Information System Engineering (WISE2013), Nanjing, China, 13-15 Oct. 2013. WISE 2013, Part I, “Lecture Notes in Computer Science” Proceedings 8180, p29-42, Springer-Verlag Berlin Heidelberg, 2013.

13.   Wu Wei, Shengsheng Shi, Yulong Liu, Haitao Wang, Chunfeng Yuan and Yihua Huang. Extraction Rule Language for Web Information Extraction and Integration. The 10th Web Information System and Application Conference, WISA2013, p65-70, Nov. 1-3, Yangzhou, China, 2013.

14.   Shengsheng Shi, FuliangQuan,Tao Xie,Chunfeng Yuan and Yihua Huang. Layered and Weighted Tree Matching Algorithm for Automatic Web Data Records Recognition, The 10th Web Information System and Application Conference, WISA 2013, p55-60, Nov. 1-3, Yangzhou, China, 2013.

15.   Yi Shen, Shengsheng Shi, Haitao Wang, Wu Wei, Chunfeng Yuan, and Yihua Huang. Parallel Approach and Platform for Large-scale Web Data Extraction. 2013 The First International Conference on Advanced Cloud and Big Data(CBD 2013), Nanjing, Dec. 13-15, 2013.

16.   Wenhui Zhou, Chunfeng Yuan, Rong Gu, Yihua Huang. Large Scale Nearest Neighbors Search Based on Neighborhood Graph. 2013 The First International Conference on Advanced Cloud and Big Data(CBD 2013), Nanjing, Dec. 13-15, 2013.

17.   Jinshuang Yan, Xiaoliang Yang, Rong Gu, Chunfeng Yuan, and Yihua Huang. Performance Optimization for Short MapReduce Job Execution in Hadoop. Proceedings of 2nd International Conference on Cloud and Green Computing and 2nd International Conference on Social Computing and Its Applications, CGC/SCA 2012, p 688-694, 2012

18.   Tao Xie, Shengsheng Shi, Fuliang Quan, Chunfeng Yuan, and Yihua Huang. Research on Complex Structure-Oriented  Accurate Web Information Extraction Rules. Proceedings of the 2010 IEEE International Conference on Progress in Informatics and Computing, PIC 2010, p 312-316, 2010

19.   Xiaoliang Yang, Chunfeng Yuan, Yihua Huang. Parallization of BLAST with MapReduce for long sequence alignment. Proceedings - 2011 4th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011, p 241-246, 2011

20.   Tao Xiao, Shuai Wang, Chunfeng Yuan, Yihua Huang. PSON: A Parallelized SON Algorithm with MapReduce for Mining Frequent Sets. The Fourth International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2011, p 252-257, 2011

21.   Yongzhuang Wei, Shuai Wang, Chunfeng Yuan, and Yihua Huang. Parallelized Near-Duplicate Document Detection Algorithm for Large Scale Chinese Web Pages. Proceedings of the 13th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2012, p 523-529, 2012.

22.   Jian Zhang, Chunfeng Yuan, and Yihua Huang. Parallelized Similarity Flooding Algorithm for Processing Large Scale Graph Datasets with MapReduce. Proceedings of the 13th International Conference on Parallel and Distributed Computing, Applications and Technologies, PDCAT 2012, p 184-188, 2012.

23.       Yulong Liu, Shengsheng Shi, Chunfeng Yuan and Yihua Huang, Automated Text Data Extraction based on Unsupervised Small Sample Learning. The 7th Intellegent System and Knowledge Engineering (ISKE 2012), Dec. 15-17, 2012, Beijing. Chapter in book “Foundations and Applications of Intelligent Systems”, Advances in Intelligent Systems and Computing 213, p133-150, Springer-Verlag Berlin Heidelberg, 2013.

24.   Chunfeng Yuan, Yihua Huang, Zhesheng Zhang, Guihai Chen, Wanchun Dou. Improvements on Teaching Methods and Contents for the “Computer Organization and Architecture” Curriculum. Proceedings of International Conference on Scalable Computing and Communications - The 8th International Conference on Embedded Computing, ScalCom-EmbeddedCom 2009, p 560-565, 2009

25.   Jianxin Yu, Yihua Huang, The Curriculum in Embedded System for Undergraduates of Major on Computer. Proceedings of 2009 4th International Conference on Computer Science and Education, ICCSE 2009, p 1683-1688, 2009

26.   Jin Yu, Jianxin Yu, Yihua Huang. Design and Implementation of Embedded Networked Intelligent Chinese Checkers Game Software. International Conference on Automatic Control and Artificial Intelligence (ACAI2012), March 24-26,2012, Xiamen, China

27.   Yihua Huang, Tianyun Ni, Lei Zhou and Stanley Su. JXP4BIGI: a generalized, Java XML-based approach for biological information gathering and integration. Bioinformatics. Vol. 19 no. 18. 2003.

28.   Stanley Su, Chunbo Huang, Joachim Hammer, Yihua Huang, Haifei Li, Liu Wang, Youzhong Liu, Charnyote P., Minsso Lee, Herman Lam. An Internet-Based Negotiation Server for E-Commerce. The VLDB (Very Large Data Bases) Journal, Special Issue on E-Services. Vol. 10, 2001.

29.   黄宜华,尤晓白,纪元等,超媒体文档库协作写作系统的数据结构设计,软件学报,Vol(8), no 3, 1997

30.   黄宜华,尤晓白,纪元等, CCHMDBS: 一个分布协作超媒体中文文档库写作系统,计算机研究与发展,Vol 33. No 3, 1997.

31.   杨文清,黄宜华,冯坚等,基于文档目录树的WWW文档协同写作机制,计算机研究与发展,Vol. 36, no.9, 1999.

32.   黄宜华,尤晓白,纪元等, 大型文档库写作系统中的分布协作机制,软件学报,1997,增刊

33.   黄宜华,纪元,杨小江等,分布式超媒体中文文档库协作创作系统的设计,软件学报,863专刊,1996.

34.   冯坚,孙颖,卢坚, 王智慧,黄宜华,张福炎,“NetCa: 一个世界范围的针对WWW的协同写作系统,软件学报,1999,增刊.

35.   黄宜华,尤晓白,纪元等,常规文本到超文本的自动转换技术的研究与实现,计算机研究与发展,Vol. 34 (增刊)1997.

36.   Huang Yihua, Zhang Fuyan, Ji Yuan, et. al., The distribution, Cooperation and Hyperlinking in an Authoring System for Chinese Hypermedia Document Base, Chinese Journal of Advanced Software Research, Vol. 4 no. 3, 1997.

37.   孙煜华,卢坚,孙赛,黄宜华,张福炎,WWW文档协同写作系统客户端文档目录树的设计与实现,小型微型计算机系统,2000Vol.21 No.3.

38.   孙颖,孙煜华,冯坚,黄宜华,张福炎,WWW文档协同写作系统中写作服务器的设计与实现,小型微型计算机系统,2000Vol.21 No.1.

39.   孙赛,杨文清,孙煜华,黄宜华,张福炎,Web文档图形化浏览系统的设计实现,小型微型计算机系统,199911.

40.   王继成,孙颖,黄宜华,张福炎, WWW文档协同写作系统中通信中间件的设计与实现, 小型微型计算机系统 199904.

41.   黄宜华,尤晓白,纪元等, 基于交互式图形界面的协作超媒体文档库系统,计算机辅助设计与图形学学报,Vol. 8(增刊), 1996.

42.   卢坚, 孙煜华, 冯坚, 黄宜华, 张福炎, WWWDOC系统中HTML文档的可视化编辑与浏览技术的实现, 计算机辅助设计与图形学学报, Vol.6, 1999.

43.   黄宜华,袁春风,曲线轮廓汉字字形缩放与还原中几个问题的研究,中文信息学报,Vol. 9, no.2, 1995.

44.   黄宜华,王绪龙,袁春风,汉字字形象量轮廓压缩算法的设计实现,中文信息学报,Vol.4, No.4, 1992.

45.   杨文清, 黄宜华,张福炎, 中文Web文档库全文检索技术研究与实现,中文信息学报,Vol. 13, no.4, 1999.

46.   王绪龙,黄宜华,汉字点阵字模轮廓向量压缩及还原, 中文信息处理,Vol. 3, 1991

47.   黄宜华,尤晓白,纪元等, “基于CSCW的中文超媒体技术的研究实现,南京大学学报,Vol. 32. No. 3, 1996

48.   王智慧, 黄宜华,王瑜等,一种基于结构的点阵汉字压缩与还原新技术研究和实现,南京大学学报,Vol. 35, no.4, 1999.

49.   王智慧,黄宜华, 张福炎,WWW环境下超媒体协同创作系统的设计,计算机科学,200012.

50.   袁春风,黄宜华,多媒体PC中声音信息的文件格式,微型计算机,Vol. 15. No.6, 1995.

51.   黄宜华, 张福炎,光盘与网络出版技术及其发展趋势,多媒体世界,199612

52.   杨文清,黄宜华,纪元,张福炎,中文电子出版物全文检索技术的应用与实现,多媒体世界,199612 

53.   纪元,黄宜华,杨文清,张福炎,光盘文档库写作工具CCHMDOC的开发与实现,多媒体世界,199612

54.   Yihua Huang, Wenqing Yang, Yuan Ji, et al., Design for Cooperative Authoring System for WWW Document, Proceedings of Second International Workshop on CSCW in DesignNov. 26-28, 1997.

55.   J.Hammer, C.B. Huang, Y.H. Huang, C. Plue., M.Lee, H. Li, L. Wang, and S.Y.W.Su. The IDEAL Approach to Internet-based Negotiation for E-Business. The Proceedings of 16th International Conference on Data Engineering (ICDE’2000). Feb. 29-Mar. 3, 2000, San Diego, CA, USA.

56.   Yihua Huang, Fuyan Zhang, Yuan Ji, et al., The Distribution Cooperation and Hyperlinking Mechanism for a Hypermedia Document Base System Based on CSCW, Proceedings of International Workshop on CSCW in DesignMay 8-11, 1996, Beijing, China.

57.   Yihua Huang, Xulong Wang, Chunfeng Yuan, An Algorithm for Vectorized Contour Compression and Generating for High Quality of Chinese Character Fonts”, Proceedings of 1992 International 33. Conference on Chinese Information Processing1992, Beijing, China.

58.   Yihua Huang, Xiaobai You, Ji Yuan, et. al., A Cooperative Authoring System for Chinese Hypermedia Document, International Symposium on Information Science and Technology1996, Beijing, China.