Engineering Leader (Hiring Big Data Platform Engineers!)
United States
Engineering Manager II @ Uber Managing the Data Foundation - Streaming & Realtime Analytics Platform From October 2018 to December 2019 (1 year 3 months) San Francisco / Palo AltoSr. Software Engineer II @ Uber Data Analytic / Infrastructure / Storage engineer:• Developing and supporting a GPU-based analytics engine -- https://github.com/uber/aresdb• Streaming data in the cloud --...
Engineering Manager II @ Uber Managing the Data Foundation - Streaming & Realtime Analytics Platform From October 2018 to December 2019 (1 year 3 months) San Francisco / Palo AltoSr. Software Engineer II @ Uber Data Analytic / Infrastructure / Storage engineer:• Developing and supporting a GPU-based analytics engine -- https://github.com/uber/aresdb• Streaming data in the cloud -- Kafka ecosystem on AWS.• Data Foundation - Streaming and Realtime Analytics Platform• Data Foundation - Hadoop Ecosystem Engineering• Storage as a Service -- Cassandra on Mesos, MySQL, Postgres.• Build the distributed Marketplace Storage System for Uber's dispatch and fulfillment (internally called MSG) From January 2016 to October 2018 (2 years 10 months) San Francisco / Palo Alto Bay AreaSoftware Engineering Manager @ Twitter Managed the Hardware & System Reliability Engineer team; a group of talented engineers who are responsible for all production reliability aspects of hardware, BIOS, firmware, and Linux Kernel/OS to a more than 200,000 of Twitter’s servers. Our main customer is the Site Reliability Engineer (SRE) team. We support them across many different technologies including Mesos & Aurora, Hadoop, Vertica, and DB ecosystems to handle all system or performance related issues that impacted Twitter infrastructure and applications. We provided Framework and manage end-to-end system Bios, firmwares, kernel upgrades to very large number of servers:• Handling and overseen end-to-end hardware quality issues and managing low Total Cost of Ownership (TCO) for each hardware platform. Involved early in the manufactured product process -- As early as Design Verification (DVT) transaction into Product Verification (PVT). Successfully supporting and empowering the team to establish sets of production operational requirements, and raising the bar on hardware quality.• Built automation frameworks to monitor, and data mining server health and quality data throughout the fleets. Our team provides weekly metrics for hardware quality data such as HW Annual Failure Rate (AFR), hardware component (SSD, hard disks, memory, etc) issues and the failure statistics per each hardware platform / vendor / etc. Real-time TCO metrics and charge-back were also an important goal in our team roadmap• Build the hardware quality & sustaining processes• Implemented the better quality control and incident avoidance. From October 2013 to January 2016 (2 years 4 months) San Francisco Bay AreaStaff Software Engineer @ Twitter Build the Hardware-Quality team to support a large infrastructure focusing on Hardware & System Reliability From October 2013 to January 2016 (2 years 4 months) San Francisco Bay AreaSr. Software engineer @ Twitter Infrastructure engineer. Built automation frameworks to monitor, and data mining server health and quality data throughout the fleets.Hosting everyone to tweet... From September 2012 to October 2013 (1 year 2 months) San Francisco Bay AreaSystem Engineering manager @ Zynga Managing and supporting system engineer team (including across geography members) to build tools and support the high availability & serviceability of Zclound (Zynga private cloud)Architected and led a small system engineer team to build Zynga bare-metal and virtualized server farms – successfully build two new Zynga’s datacenter in California with about 30,000 servers online and in service for Zynga’s games. Bellow are couple highlights: • System performances, power-consumption benchmarks, and capacity planning to build our new Zynga gaming infrastructures - Zcloud• Worked with OEMs, and ODMs on new system architecture such as new intel CPU architectures• Build tools to automatic provision, maintain, and support the high availability of Zynga’s infrastructures and services. Successfully provisioning and supporting more than 40,000 production servers in all datacenter locations with different hardware from multiple vendors.• Build a system burn_in framework to automate component performance qualification • Build out-of-band with IPMI commands working with Ganglia, and Nagios to monitoring and support system administrator operational. Standardized what I called MORE (‘Minimum Operational Requirements or Equivalents’) for BMC/IPMI from all vendors.• Debugged, analyzed and root-caused Linux kernel crashes, and patched in kernel / driver bug fixes• Created "Splunk" dashboards to monitoring system / kernel issues, and proactively plan out solutions to such distributive known kernel issues. From May 2011 to September 2012 (1 year 5 months) San Francisco Bay AreaStaff Software Engineer @ VMware, Inc Developed new features requests, handled customer-escalations (active member of tiger team) for storage and network related issues in ESX server products (2.5.x, to 5.0.x) from a large customer / partner base, included IBM, HP, Fujitsu, and DELL. Specialized in iSCSI, NFS, SAS, SATA, tape, USB, and multi-path storage drivers in ESX kernel virtualization environment. A few high-lines are listed below:• Designed and enhanced the SCSI multi-path I/O modules (internal named PSA -- Plug-in Storage Architecture) of the virtual machine kernel (vmkernel) to protect against path failures. Major performance improved ESX boot time by introduced a target/LUN parallel discovery mechanism. This has successfully resolved a lot of major customer issues on older produces.• Implemented SCSI LUN reset feature in software initiator iSCSI driver for vmkernel multi-path codes to cleanly doing path-failover without corrupting concurrently IO on other iSCSI LUNs behind the same SCSI-target. This feature was successfully resolved a lot of customer’s path- failover issues on software initiator iSCSI storage.• Designed and implemented storage device/patch claiming rules to enhance the flexibility for ESX server to manage its devices/patches. Successfully handling dynamically the conflicting requirement from EMC and Dell-Equallogic for their management LUNs just by a configuration option change for their devices/patches in ESX server. Device/patch claiming rule mechanism has become a major feature in ESX4.0.x to enable major vendors’ storage plug-in modules in ESX environment – EMC’s storage multipath module (EMC’s Powerpath) is loading at the same time with VMWare in-house storage multipath module; device/path claim rules to determine which storage module to claim the paths to a particular storage LUN. From February 2007 to May 2011 (4 years 4 months) Palo Alto, CASenior Software Engineer @ Silicon Image LINUX OS / MAC OS X kernel development, multi-threaded programming, SCSI/SATA, PCI/PCI-X/PCI-EXP, RAID (0,1,10,5, N-Way Mirror) device driver development for SATA storage products including SteelVine product line.· Architected and deployed a monotonic code-base (SCSI) driver architecture, a single driver source tree, to control all Silicon-Image’s PCI, PCI-X, PCI-Express to S-ATA Host Bus Adapters (HBA), supporting multiple Operating Systems (WINDOWS, LINUX, Netware, MAC OS X), on multiple platforms (32-64 bit architectures of Intel, AMD, and Power PC). This resulted in great cost reduction of software developing, testing; alleviated product releases managing. This architecture helped to re-use driver codes and software components for different platforms and OS, shorten software developing cycles. Products: Sil3112, Sil3114, Sil3124, Sil3131, Sil3132.· More ... Original FW developer of the SATA port-multiplier named SiliconImage-SteelVine. From May 2004 to February 2007 (2 years 10 months) Senior Software Developer @ Pacific Digital Corporation _____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Software architecture/leader of PDC's Parallel ATA/Serial ATA RAID (0, 1, 10) storage controller products. Managed daily activities and leaded a group of four software engineers to successfully delivered the beta shipment on tight schedule._____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ LINUX/FreeBSD Device Driver Developer for PDC's Universal Host Bus Adapter (UHBA) to control Parallel ATA/Serial ATA drives. Successfully delivered high performance, high reliability RAID SCSI subsystem device drivers._____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Improvised software solutions/workarounds to hardware problems. With strong understanding of hardware and integration issues, I have been contributing a number of algorithms and methods to both hardware and software to roll out a high performance, high level of flexibility SATA RAID storage controller products._____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Technical lead of PDC_____ILLEGAL_CHAR_SSSTART146ILLEGAL_CHAR_EEEND_____s software group and offshore-software-group (SPSoft Inc. in India) From January 2003 to May 2004 (1 year 5 months) Irvine, CASenior Software Developer @ 3PAR IPO 2007 PAR Kernel/System Manager/FiberChannel Device Drivers developer for 3PARdata network storage system. I was employee 34th.• Created detailed algorithm to eliminate data inconsistencies, prevent data loss, and perform data recovery under uncontrolled shutdowns, along with data storage managment features. This resulted in successful customer critical quality tests for Beta and General Availability shipments.• Established and deployed infrastructure for communication interface protocols, such as between BIOS and 3PAR controller node kernel software using SMI, and I2C protocols; between User and Kernel of LINUX OS using IOCTL; between clients and the 3PAR storage box using TCP/IP with buffer size change on demand; between the JBOD chassis and the System Manager software. • Designed and rolled out new LINUX kernel services contain monitoring and reporting the system‘s environment, logging and enabling proactive and predictive customer support actions; From February 2000 to December 2002 (2 years 11 months) Fremont, CASoftware Developer @ Accrue Software, Inc. IPO in 07/1999 ACRU Server developer for _____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____Accrue Insight_____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____ product:_____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Developed and implemented DAS (Database Access Servlet) for _____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____Accrue Insight_____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____ product. Successfully provided a set of Application Protocol Interface (API) that allowed UI layer to connect to Accrue Warehouse._____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Restructured Accrue's data Analyzer component, a core of _____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____Accrue Insight_____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____ product that parsed and transformed web data to database schema-formats automatically._____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Improved loading data rate into Accrue Warehouse (ORACLE / REDBRICK Databases) by 20% by optimizing the loader component. _____ILLEGAL_CHAR_SSSTART149ILLEGAL_CHAR_EEEND_____ Established a security protocol using MD5 and RC4 algorithms to safely transfer data from Collectors (remote web data collectors) to Accrue Warehouse.Setup entire environment to facilitate _____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____Accrue Insight_____ILLEGAL_CHAR_SSSTART147ILLEGAL_CHAR_EEEND_____ product release. Determined product-build cycles. From May 1997 to January 2000 (2 years 9 months) Senior Engineering Manager (Hiring Big Data Platform Engineers!) @ Branch Branch powers mobile growth for 50,000+ of the most advanced apps in the world, including household names like Airbnb, Buzzfeed, Twitch, Under Armour, and many more. Every day, our customers generate more than 100 million new URLs, and we process 2.5TB of raw data from over 10 billion events.Branch is already transforming how brands and users interact across digital platforms with our mobile marketing and deep linking solutions, and we are just getting started. We’ve raised more than $330M from investors such as NEA, Australian Future Fund, Founders Fund and Playground Ventures. If this sounds exciting and you’re looking for a challenge, come work with me! Check out our open roles: https://branch.io/careers/
Uber
Engineering Manager II
October 2018 to December 2019
San Francisco / Palo Alto
Uber
Sr. Software Engineer II
January 2016 to October 2018
San Francisco / Palo Alto Bay Area
Software Engineering Manager
October 2013 to January 2016
San Francisco Bay Area
Staff Software Engineer
October 2013 to January 2016
San Francisco Bay Area
Sr. Software engineer
September 2012 to October 2013
San Francisco Bay Area
Zynga
System Engineering manager
May 2011 to September 2012
San Francisco Bay Area
VMware, Inc
Staff Software Engineer
February 2007 to May 2011
Palo Alto, CA
Silicon Image
Senior Software Engineer
May 2004 to February 2007
Pacific Digital Corporation
Senior Software Developer
January 2003 to May 2004
Irvine, CA
3PAR IPO 2007 PAR
Senior Software Developer
February 2000 to December 2002
Fremont, CA
Accrue Software, Inc. IPO in 07/1999 ACRU
Software Developer
May 1997 to January 2000
Branch
Senior Engineering Manager (Hiring Big Data Platform Engineers!)
What company does Diepvu Le work for?
Diepvu Le works for Uber
What is Diepvu Le's role at Uber?
Diepvu Le is Engineering Manager II
What industry does Diepvu Le work in?
Diepvu Le works in the Computer Software industry.
Who are Diepvu Le's colleagues?
Diepvu Le's colleagues are Bradley Fernando, Cynthia Chiang, Jason LI, Swapnil Debarshi, Lila Strominger, Dingjie Yang, Mafalda Barros, KARTHIKEYAN DEVENDRAN, Rumeesa Khalid, and Fatima Aliyeva
Enjoy unlimited access and discover candidates outside of LinkedIn
One billion email addresses and counting
Everything you need to engage with more prospects.
ContactOut is used by
76% of Fortune 500 companies
Diepvu Le's Social Media Links
/school/ha... /company/b...