Consulting Opportunities with Big Data and Hadoop
Gayn B. Winters, Ph.D.
1962 Lerner Lane
Santa Ana, CA 92705
Becoming common are Petabyte (10^15 bytes) and larger datasets and techniques to process them on clusters of thousands of ordinary PCs. This talk discusses the techniques provided by The Apache Software Foundation’s Hadoop, which enable high availability processing on such clusters where frequent machine and disk failures are common and expected. The techniques and several simple examples plus at least one substantial example will be presented. Once basic principles are understood, consulting opportunities abound as more and more companies realize they have huge amounts of complex data. For example, in the financial markets there are huge datasets that are related to risks on various investments being made. Many companies offer their products on line, and as a prospect logs into their site, the company wants to make meaningful suggestions for products (Think Amazon for books, NetFlix for movies, Fry’s for electronics, Dell and HP for PCs, etc.) GM just cancelled their Facebook advertising for lack of productivity; GM has tons of consumer data, what did they do wrong on Facebook and how should GM proceed? Grocery stores give discounts for using “membership cards.” They collect petabytes of information yearly that correlate the shopper’s age, address, and purchases. How should they use this data? Power transmission companies have amassed huge quantities of usage and load data across the current grid. How should they use this to design the next generation “smart grid”? Electronic medical records can now be mined to discover correlations with various sources of pollution and certain diseases. Google builds all of its applications using this technology; what would you suggest for Yahoo and other specialized search engines? There will be time for Q&A on these and related consulting opportunities
Ph.D. from MIT, with considerable post doctoral and academic work. Did foundational work using distributed workstations in the Financial Analysis industry. At Digital Equipment built the industry’s first PC with integral Ethernet, won a Computerworld award for best micro/mainframe integration software (that became Digital’s highest volume software product), promoted to Digital’s corporate strategy board and Corporate Consultant, corporate internationalization architect, oversaw multiple worldwide programs with many F1000 clients. Board of Directors for Unicode, Inc. At Phoenix Technology, as CTO, VP and GM, ran worldwide engineering and program management for systems; clients were manufacturers of PCs, boards, chips, peripherals, PDAs, and laptops. Started Bristol Systems to do engineering and application development outsourcing. Expert in high availability networking and security. Consulting, software development, and project/program management for manufacturing, commercial real estate, legal, and laboratory clients. At L-3/IEC, worked on current and modernized military GPS based products. Currently working on Big Data problems in the insurance industry.
Date: Saturday, June 23rd, 2012
Time: 10:00 AM Introductions & Presentation
12:00 PM Lunch & Networking
Location: Cal State Fullerton on Nutwood and 57 in Fullerton.
Meeting is in the Engineerimg and Computer Science (ECS) Building. Room 202.
Parking: Saturday parking is free. Enter from Nutwood and take Folino Drive north to Parking Lot E. See maps at parking.fullerton. edu.maps for more detail.
Future Meeting: A presentation by Dr. Gary L. Blank, consultant, workshop leader, and 2012 candidate for IEEE-USA President-Elect is being rescheduled for an August date. Watch for the announcement.
Lunch is free with reservation. RSVP before the 23rd with an email to firstname.lastname@example.org.
Check out the IEEE-USA consultants website, database, and resources at www.ieeeusa.org/business.