•10+ years of DBMS kernel development experiences.
•2 years of data mining experience in the telecommunication industry.
•5 years of project management experience, leading a group of 18 people.
•10+ years of software development experience in C++/C, Java, Python, Perl.
Specialties:
•Deep database kernel knowledge, expertise in distributed query processing, query optimization, execution, storage, transaction management and concurrency control.
•Experience in distributed computing.
•Familiar with the open source database code, including Postgresql, BerkelyDB, MySQL.
•Map/Reduce in database
•Familiar with Hadoop
•Graph Engine
Senior Software Engineer @ Asterdata is a start up company that was acquired by Teradata in April, 2011. It is the first database in market that can do Map/Reduce in database.
Responsibilities and achievements:
•Database kernel development
- Dynamic partition pruning using stable function, loop join and scalar function
- Table function improvement (pipeline execution)
- Multi-insert statement for multi-output SQL-MR function
- Columnar table update and delete
•SQL Hadoop integration
- SQL planning for SQL-H (SQL-Hadoop) integration
•SQL-MR
- Semantic analysis for multiple input SQL-MR function
•Graph Engine
- Spool file infrastructure for temporary result
- Serialization infrastructure
- Graph Store From July 2011 to Present (4 years 4 months) Software Architect @ Teradata is the leader of the Data warehouse area.
Responsibilities and achievements:
•Contributed to the architect design
•Developed a MPP-aware SQL optimizer based on SQL standard 2003 from scratch as main designer and developer.
•Reviewed the design and implementation of query execution and storage engine.
•Worked closely with offshore Beijing team.
•Worked with program manager and other architects to define the roadmap of the product From July 2010 to June 2011 (1 year) Software Engineering Manager @ Responsibilities and achievements:
•Led a group of 18 people in Beijing.
•Worked closely with remote architects in US.
•Communicated with architects on design reviews.
•Established an automated testing framework to ensure code quality.
•Produced project plan for the Beijing development team, and monitored the progress.
•Developed database kernel modules for new product for Teradata, including SQL parsing, resolution, plan generation, expression evaluation, join enumeration etc. From July 2006 to June 2010 (4 years) Data Mining Consultant @ BASS Project at CMCC (China Mobile Communication Compnay), China
CMCC is the largest telecommunication company in China and also one of the largest telecommunication companies in the world. BASS is the infrastructure system within CMCC for its decision support.
Responsibilities and achievements:
•ETL (Data extracting, transformation and loading).
•Teradata enterprise data warehouse database administration.
•Produced 2 data mining solutions for customer, customer churn analysis and customer segmentation analysis. From July 2004 to June 2006 (2 years) Senior Software Engineer @ Kingbase is a chinese home-made database.
Responsibilities and achievements:
•Designed and implemented transaction concurrency control subsystem. As a result, the product was the first place winner in the Chinese DBMS performance evaluating test.
•Designed and implemented the buffer management subsystem for KingbaseLite, an embedded version of the Kingbase database product.
•Ported PostgreSQL 7.3 from Unix OS to Windows OS.
•Designed and implemented the Unix communication mechanism on Windows. From August 2001 to April 2003 (1 year 9 months)
Doctor of Philosophy (Ph.D.), Computer Science @ Renmin University of China From 2001 to 2004 Yingjie He is skilled in: Software Project..., Query Optimization, Transaction Management, Concurrency Control, Distributed Computing, Data Mining, C/C++, Java, PostgreSQL, Hadoop, MySQL, Teradata, Distributed Systems, Unix Shell Scripting, Data Warehousing, Perl, Python, Databases, ETL, SQL, Integration, Software Engineering, Unix, Database Administration, Database Design, Linux, Oracle, Enterprise Architecture, C++, Software Development, Agile Methodologies, C, Big Data, Cloud Computing, Open Source
Websites:
http://www.blogger.com/home?pli=1