Berlin 2018
Flink Forward speakers are experts from global companies like Airbnb, Alibaba, Amazon, eBay, Here, IBM, ING, Lyft, Microsoft, Netflix, Rovio, Yelp, Uber, and many more, who have built scalable streaming infrastructure and enterprise-grade applications.
Hear why and how they use Flink as the stream processing engine of choice for large-scale stateful applications, including real-time analytics, real-time search and content ranking, fraud/anomaly/threat detection.
Ververica
CEO, Co-founder at Ververica
Kostas Tzoumas is a co-founder and CEO of Ververica, the company founded by the original creators of Apache Flink. Kostas is PMC member of Apache Flink and earned a PhD in Computer Science from Aalborg University with postdoctoral experience at TU Berlin. He is an author of a number of technical papers and blog articles on stream processing and other data science topics.
Ververica
Co-founder, CTO at Ververica
Stephan Ewen is CTO and co-founder at Ververica where he leads the development of the stream processing platform based on open source Apache Flink. He is also a PMC member and one of the original creators of Apache Flink. Before working on Apache Flink, Stephan worked on in-memory databases, query optimization, and distributed systems. He holds a Ph.D. from the Berlin University of Technology.
Unlocking the next wave of applications with Stream Processing
Alibaba
Senior Director at Alibaba
Xiaowei Jiang is a Senior Director at Alibaba. He leads the StreamCompute Platform for AliCloud. The platform provides the real time data processing both internally and externally. Previously, he worked as Tech Lead at Facebook and Principal Engineer at Microsoft SQL Server.
Unified Engine for Data Processing and AI
Ververica
Software Engineer at Ververica
Till is a PMC member of Apache Flink and software engineer at
A Year in Flink
Ververica
Tech Team Lead at Ververica
Aljoscha Krettek is a PMC member at Apache Flink and co-founder and software engineer at “
A Year in Flink
Lightbend
Deputy CTO at Lightbend
Viktor Klang is a problem solver, software developer, prolific contributor to the Akka project, Akka Tech Lead Emeritus,
co-founding member of the Reactive Streams Special Interest Group, and contributor to the Scala Standard Library APIs like Scala Future & Promise.
The convergence of stream processing and microservice architecture
Microsoft
Group Architect at at Microsoft
Avihai is an architect in the Microsoft Cloud App Security group, where he leads the development of strategic infrastructure changes and major product tracks. His main focus is designing and building applications at mass scale.
TNG Technology Consulting
Software Consultant at TNG Technology Consulting
Max works with TNG Technology Consulting, a Munich-based firm focused on high-end IT projects. He is currently on a project with one of Europe’s largest telecommunications providers, where he is heading up the SRE team supporting a platform built for large-scale anonymization and hence focusing on all things automation and observability. In his free time he enjoys juggling and volleyball.
Ververica
Software Engineer at Ververica
Dawid Wysakowicz is a Flink committer, currently working as a Software Engineer at Ververica. Recently his main area of interest is detecting patterns in streams of data with Flink Complex Event Processing library. Previously worked at GetInData, where he’s been implementing real-time streaming solutions based on Apache Flink. His journey with highly distributed and scalable solutions started in 2015 while writing
Ververica
Software Engineer at Ververica
Timo Walther is a committer and PMC member of the Apache Flink project. He studied Computer Science at TU Berlin. Alongside his studies, he participated in the Database Systems and Information Management Group there and worked at IBM Germany. Timo works as a software engineer at Ververica. In Flink, he is mainly working on the Table & SQL API.
Netflix
Senior Software Engineer at Netflix
Steven Wu is a software engineer at Netflix. He is working on real-time data infrastructure that powers massive data ingestion pipeline and stream processing platform. Previously he was working on cloud platform that builds the foundation for Netflix’s cloud-native microservice architecture. He is passionate about building scalable distributed system and empowering people with data.
Uber
Senior Software Engineer at Uber
Amey is a Senior Software Engineer on Uber’s Marketplace Data Intelligence team where he works on the stateful streaming and geo-spatial data systems that power various applications ranging from health monitoring, forecasting to dynamic pricing within Uber’s rider sharing marketplace. He’s been dealing with thorny issues around streaming pipelines and state ever since he started his career working on Yahoo’s ad tech systems in 2011 when Apache Pig and Storm were the state-of-the-art. He holds a B.S/M.S in Electrical and Computer Engineering from the University of Illinois at Urbana-Champaign.
Threading Needles in a Haystack: Sessionizing the Uber firehose in realtime
Yelp
Software Engineer at Yelp
Vipul Singh is a software engineer at Yelp Inc. He is a part of Distributed Systems Team where he works on developing and maintaining stream processing applications. His current focus is on building data connectors, enabling creation of a data lake from schematized streams, and developing next-gen stream processing infrastructure here at Yelp.
Alibaba
Senior Staff Engineer at Alibaba
Feng Wang is the head of realtime computing engine team in Alibaba.
Airbnb
Software Engineer at Airbnb
Brian Wolfe is a software engineer on the Observability team at Airbnb. While working on Observability, he helped create several streaming pipelines to perform custom processing and deep introspection of performance and errors in production infrastructure. This work has saved hundreds of hours of engineering time spent diagnosing and remediating issues in production. Before working at Airbnb, he developed software for home robotics and algorithms for designing biomolecules.
SK Telecom
Manager at SK Telecom
Dongwon Kim is a big data architect at SK telecom. During his post-doctoral work, he was fascinated by the internal architecture of Flink and gave a talk titled “a comparative performance evaluation of Flink” at Flink Forward 2015. He introduces Flink to SK telecom, SK energy, and SK hynix to fulfill various needs for real-time streaming processing from the companies. Last year at Flink Forward 2017 Berlin, he shared his experience of using Flink in building a solution for Predictive Maintenance. He recently has been adopting Flink to calculate driving scores of millions of users in real time.
Netflix
Senior Data Engineer at Netflix
Shirya works on the Data engineering team for Personalization. Which, among other things, delivers recommendations made for each user. The team is responsible for the data that goes into training and scoring of the various machine learning models that power the Netflix homepage. They have been working on moving some of our core datasets from being processed in a once-a-day daily batch ETL to being processed in near-real time using Apache Flink. Before Netflix, she was at Walmart Labs, where she helped build and architect the new generation item-setup, moving from batch processing to stream. They used Storm-Kafka to enable a micro-services architecture that can allow for products to be updated near real-time as opposed to once-a-day update on the legacy framework.
Dell EMC
Senior Software Engineer at Dell EMC
Raúl Gracia-Tinedo is a senior software engineer at DellEMC working for Pravega: a novel distributed storage system for data streams. Prior to joining DellEMC, he has worked as a postdoc in the context of European research projects (FP7 CloudSpaces, H2020 IOStack) and as intern at IBM Research and Tel-Aviv University. He holds a Ph.D. in Computer Engineering (2015, outstanding thesis award) from Universitat Rovira i Virgili (Spain). Raúl is a highly motivated researcher and engineer interested in distributed systems, cloud storage, and data analytics, with more than 20 papers.
ING
Senior Full Stack Engineer at ING
Currently working as a Senior Full Stack Engineer at ING’s Wholesale Banking Advanced Analytics. With a background in software development & consultancy, he helps organizations in validating and fleshing out new digital opportunities. His passion and craftsmanship are fed by the belief that there still is lots of room for improvement in software engineering. His field of interest span all aspects related to software architecture, software engineering, and cloud-based engineering.
Shandong University
PhD Candidate at Shandong University
Xingcan Cui, who is interested in database and stream processing, is a committer of the Apache Flink project. He has just finished his Ph.D. study under the supervision of Prof. Xiaohui Yu at Shandong University, China and will continue his research as a postdoc at York University, Canada.
TU Berlin
Research Associate at TU Berlin
Jonas is a Research Associate at TU Berlin and a PhD candidate supervised by Volker Markl. His research interests include data stream processing, sensor data analysis, and data acquisition from sensor nodes. All his publications are available on http://www.user.tu-berlin.de/powibol/. He wrote my master thesis during a year abroad at the Royal Institute of Technology (KTH) and the Swedish Institute of Computer Science (SICS) / RISE in Stockholm under Supervision of Seif Haridi and Volker Markl and advised by Paris Carbone and Asterios Katsifodimos. He graduated with a M.Sc. in computer science in April 2015 at TU-Berlin. Prior to that, he received his B.Sc degree at Baden-Württemberg Cooperative State University (DHBW Stuttgart) and worked several years at IBM in Germany and the USA. He is a participant of Software Campus and Alumnus of Studienstiftung des deutschen Volkes and Deutschlandstipendium.
KTH Royal Institute of Technology in Stockholm
Senior Researcher at KTH Royal Institute of Technology in Stockholm
Paris Carbone is a Flink committer and a senior computer scientist within the special intersection of distributed systems, data management and programming systems. Paris is currently the tech lead of the ‘Continuous Deep Analytics’ project at KTH and RISE SICS in Sweden, investigating how intermediate programming languages and hardware acceleration will make data streaming the dominant end-to-end architecture for critical and complex decision making. At night, you can catch Paris performing with his jazz quintet at the oldest neighbourhoods of Stockholm.
Stream Loops on Flink: Reinventing the wheel for the streaming era
Europace AG
Open Source Strategist at Europace AG
Isabel Drost-Fromm is Open Source Strategist at Europace AG Germany. She’s a member of the Apache Software Foundation, co-founder of Apache Mahout and mentored several incubating projects. Isabel is interested in all things FOSS, search and text mining with a decent machine learning background. True to the nature of people living in Berlin she loves having friends fly in for a brief visit – as a result she co-founded and is still one of the creative heads behind Berlin Buzzwords, a tech conference on all things search, scale and storage.
Centrum Bezpieczeństwa Cyfrowego S.A.
Principal Streaming Architect at Centrum Bezpieczeństwa Cyfrowego S.A.
Sebastian Czarnota is a Principal Streaming Architect at Digital Fingerprints (Centrum Bezpieczeństwa Cyfrowego S.A.), Polish cybersecurity startup working on continous authorisation solution (aka behavioural biometry) for financial sector. Sebastian is responsible for both system architecture and data flow design. His experience includes working on acquisition, analysis and presentation of sec.gov data as well as research of algorithmic trading on fundamental data retrieved from aforementioned source. Sebastian also worked in Samsung R&D on secure bootloader project. He finished Military University of Technology with speciality in Cryptology with distinction. He loves feeding Flink with Acorns.
Lyft
Software Engineer at Lyft
Thomas is Software Engineer, Streaming Platform at Lyft, working with Apache Flink. Earlier he has been at a number of other technology companies in the San Francisco Bay Area, including DataTorrent, where he was a Co-Founder of the Apex project. Thomas is Apache Apex PMC Chair, committer to Apache Beam and has contributed to several more of the ASF ecosystem projects. He has also presented at international big data conferences and is author of the book “Learning Apache Apex”.
GoJek
Product Engineer at GoJek
Rohil Surana works at Go-Jek as a Product Engineer in the Data-Engineering team. He has been solving problems on data streaming and data warehousing at Go-Jek. He prefers a hands-on aprroach for solving problems while learning at the same time. He loves to travel and learn about different places and try their food.
RISE/KTH
Research Intern at RISE/KTH
Tobias previously worked as an IT-Consultant for Internet of Things at IBM before he started a pan-european Master Program in Data Science
Appier
Software Engineer at Appier
Wei-Che(Tony) Wei is a softwore engineer on Data Platform Team at Appier. He works on providing general facilities for internal users to access data by leveraging different open sources, such as Flink, Spark and Kafka. Recently, he focuses on building a streaming platform to let users benefit from the advantage of stateful streaming framework. And he has been contributing to Flink as well.
Lessons learned from Migrating to a Stateful Streaming Framework
Amazon
Principal Engineer at Amazon
Suneel is a member of Apache Software Foundation and is a PMC member on Apache OpenNLP, Apache Mahout and Apache Streams. He has presented in the past at Hadoop Summit, Apache Big Data, Flink Forward, Berlin Buzzwords, Big Data Tech Warsaw. He is a Principal Engineer at Amazon Web Services.
Streaming topic model training and inference with Apache Flink
Intellify Learning
Chief Architect at Intellify Learning
Jared is a passionate expert in Software Architecture, Continuous Delivery, Platform-as-a-Service systems, and AWS. As Chief Architect of Intellify, Jared led the design and implementation of the cloud-based streaming analytics platform. Previously, he was a Staff Software Engineer at HubSpot, responsible for migrating their content management system from a python-based monolith into Java micro services. He has worked in various industries including Lending, Search, Ad-tech, and Marketing.
GoJek
Data Engineer at GoJek
Ravi Suhag works on the Data Engineering team at GoJek which is responsible for handling data infrastructure for all GoJek Products. To know more about the speaker please visit: www.ravisuhag.com
TU Berlin
PhD Candidate at TU Berlin
Niklas is doing his PhD with Prof. Anja Feldmann (TU Berlin). During his masters degree he became intrigued with the mechanics of large-scale networks. In his research he investigates means to speed up the analysis of network traces and develops optimizations for stream processors. He participated in the research project Berlin Big Data Center (bbdc.berlin) and has supervised multiple bachelor and master thesis in the environment of Apache Flink.
Orange Polska
PhD Candidate at Orange Polska
Piotr Wawrzyniak is Project Manager in Orange Polska R&D Centre since 2011. He is currently pursuing the Ph.D. degree in electronics at Institute of Electronics, Lodz University of Technology, Lodz, Poland. His research interests include the development of innovative telecommunication services, stream mining and Big Data architecture designs. He is also focused on software development, in particular for Apache Hadoop ecosystem. Piotr holds PRINCE2 Practitioner certificate in project management. He is a member of Polish Information Processing Society and PPMC member of Apache SAMOA project.
IBM
Software Engineer at IBM
Edward Rojas is a Software Engineer at IBM since 2015. He worked mainly in the Hybrid Cloud division on different project and is currently part of a team in charge of building a product on top of Kubernetes leveraging Apache Flink for event stream processing.
ING
IT Engineer at ING
Olga Slenders (Reznik) has been a software engineer for more than 10 years. She has enjoyed working with a range of programming languages, and is now having fun developing in Scala. She joined ING Bank Netherlands in 2014 after finishing a data reverse engineering thesis project. Currently, Olga is a part of the team that develops a streaming platform for analytics fraud detection using Apache Flink.
New Relic
Software Engineer at New Relic
Caito is pursuing a Masters in Software Engineering at Harvard University and she
Imperial College
Ph.D. Student at Imperial College
George is a Ph.D. student in the Large-Scale Distributed Systems (LSDS) group @ Imperial College London, under the supervision of Dr. Peter Pietzuch. His Ph.D. is supported by a CDT HiPEDS scholarship. Prior to this, he was an undergraduate student in the Electrical and Computer Engineering department of National Technical University of Athens and conducted his thesis in affiliation with CSLab.
Trackunit A/S
System Architect at Trackunit A/S
Lasse Nedergaard System Architect @ Trackunit. More details at https://www.linkedin.com/in/lassenedergaard/
GetInData
Big Data Architect, Co-founder at at GetInData
Krzysztof is an architect, engineer and researcher of solutions that take advantage of Big Data technologies, like advanced analytics, decision automation systems or recommendation engines. He is a Big Data geek working with those technologies, HPC, distributed systems and machine learning for over 7 years, previously in companies like Netezza/IBM, Hadapt/Teradata and now as GetInData experts team member. He likes to work full-stack going from architecting solution through engineering down to installing, troubleshooting and monitoring. Now specializes in scalable real-time analytics solutions.
Criteo
GetInData at Criteo
Staffs Software Engineer in Criteo, Currently technical lead of Invalid Traffic detection team. Worked for Grammarly in the past. Likes JVM and functional programming. Fun of improving development productivity.
Data lossless event time streaming processing for revenue calculation
Walmart Labs
Principal Data Engineer at Walmart Labs
Andrew Torson is a Principal Data Engineer in the Smart Pricing team within the Walmart Labs organization. His current work is focused on big-fast-data pipelines in Flink and Spark for the Walmart e-commerce business, leveraging ML-based retail pricing algorithms. Before joining Walmart Labs, Andrew worked as a data scientist and data engineer on a handful of IoT projects in the area of mobile robotics(warehousing/manufacturing/marine container terminals industries), using ML-based tools in Scala, Python and Java. Andrew holds a PhD in Operations Management from NYU and also worked as a ML scientist in the Siemens Research/Labs after his graduation, which led him towards his current product data engineer track.
Using a sharded Akka distributed data cache as a Flink pipelines integration buffer
ING
Data Engineer at ING
After a bachelor in Computer Science and a master Artificial Intelligence, I started to work at ING roughly two and a half years ago. Following a one year IT traineeship, I started with the team I’m currently in, Wholesale Banking Advanced Analytics. Here I work as a Data Engineer for project Katana. Katana is aimed at aiding traders in Financial Markets which involves lots of real-time streaming systems, one of which obviously is Apache Flink! We’ve been using Flink for nearly a year now, resulting in open sourcing a deployer on Kubernetes and implementing our own state manager using Apache Avro.
German Research Centre for Artificial Intelligence
Research Assistant at German Research Centre for Artificial Intelligence
Philipp is a computer science Master’s student at the Technische Universität Berlin, specializing in big data analytics systems. Besides the university, he has worked for several companies and collected experiences in frontend and backend software development. At the German Research Center for Artificial Intelligence, he joined a streaming systems oriented research project involving Apache Flink as a research assistant.
Mesosphere
Technical Lead Community Projects at Mesosphere
Jörg is the technical lead for community projects at Mesosphere in San Francisco. In his previous life he implemented distributed and in memory databases and conducted research in the Hadoop and Cloud area during his PhD. His speaking experience includes various Meetups, international conferences, and lecture halls.
Orange Polska S.A.
R&D Expert at Orange Polska S.A.
Jarosław Legierski received his M.Sc. in electronics and telecommunication and PhD degree in electronics from the Technical University of Lodz. Since 1998, he has worked in the telecommunications industry. He is currently R&D Expert in Research and Development Center, Orange Labs at Orange Polska and assistant professor at Faculty of Mathematics and Information Science of Warsaw University of Technology. Jarosław Legierski is the co-creator of Open Middleware 2.0 Community (www.openmiddleware.pl). His research interest includes open application programming interfaces (APIs), Open (Big) Data, and next-generation telecommunication services. Author of publications in the area of API and Open Data.
GoJek
Data Engineer at GoJek
Sumanth works on the Data Engineering team at GoJek which is responsible for handling data infrastructure for all GoJek Products.
GoJek
Data Engineer at GoJek
Prakhar Mathur has completed his bachelors from Indian Institute of Technology, Jodhpur. He is currently working at GO-JEK as a Product Engineer with the Data Engineering team. He is working with the team solving problems regarding data publishing and making data easily available to the organisation.
Kcell
Technical Lead at Kcell
Ten years of experience in software development field for one of the largest telecom operators in Kazakhstan. Interested in high load and big data projects. Technical lead for real time event processing and data lake projects in Kcell.
Microsoft
Data Science Team Leader at Microsoft
Yonatan holds an M.Sc. in theoretical physics, and currently leads the Data Science team for Microsoft Cloud App Security, utilizing Machine Learning tools to detect anomalies in user activity in the cloud.
ING
Software Engineer at ING
Gijsbert van Vliet has been a software engineer for 3 years with a background in mathematics. He started off as a java developer and is now mainly occupied with scala. He started his career at ING Bank Netherlands. Last year he joined the team that develops a streaming data platform using Apache Flink, which is used for multiple use cases.
New Relic
Software Engineer at New Relic
Nikolas has worked in both the computer and software engineering fields. For a start, he is a father, tinkerer, and cyclist. In his latest adventure at New Relic, he worked to prototype various solutions in order to develop an internal method for aggregating customer product usage metrics, which are ultimately used for billing purposes. The stream processing system leverages Apache Flink to process millions of messages per minute.
Databricks
Software Engineer at Databricks
Joey Frazee is a Solutions Architect at Databricks, an Apache Software Foundation member, and contributor to the Apache Streams and Apache NiFi projects. He was previously a graduate student in statistics and linguistics at the University of Texas, a data scientist and director of engineering at People Pattern, and IoT specialist at Hortonworks. He has presented at Flink Forward, Big Data Warsaw, and elsewhere.
Streaming topic model training and inference with Apache Flink
Ververica
Software Engineer at Ververica
Stefan is an Apache Flink
Ververica
Software Engineer at Ververica
Tzu-Li (Gordon) Tai is an Apache Flink PMC member and software engineer at
Flumaion Ltd
Independent Consultant at Flumaion Ltd
Raj has over 20 years’ experience in Investment Banking. Raj has worked primarily in the Fixed Income business at HSBC, J P Morgan and Deutsche Bank. He started his career in FX Options at Citibank. Raj has a PhD in Engineering from the University of Newcastle upon Tyne and holds a certificate in Quantitative Finance from the CQF Institute in London.
Rovio
Director, Data Engineering at Rovio
Henri Heiskanen works as Director of Data Engineering in Rovio Entertainment Corp, a major entertainment media company and creator of the globally successful Angry Birds franchise. He has 19 years of experience in software development for mobile, telecommunications and entertainment industries.
HERE Technologies
Robin Slomkowski at HERE Technologies
American living in Berlin, working with distributed systems since 1992, alternating between technical operations and software engineering. These days focusing on translating business needs into technology implementations both in developing HERE’s platform and helping HERE’s customers.
King
Director of Engineering at King
Together with his team Vladimír develops King’s streaming platform RBEA. Previously, Vladimír helped create technology of two startup companies Omniata (acquired by King in 2017) and RM5 Software (acquired by Efecte in 2014). After over 10 years in engineering roles his main technical expertise lies in big data processing, streaming and identity management.
Ververica
Software Engineer at Ververica
Gary Yao is a Software Engineer at
Ververica
Software Engineer at Ververica
Nico Kruber is an Apache Flink contributor and works as a software engineer at
Improving throughput and latency with Flink’s network stack
Ververica
Co-founder, Software Engineer at Ververica
Robert Metzger is a PMC member of the Apache Flink project and a co-founder and an engineering lead at Ververica. He is the author of many Flink components including the Kafka and YARN connectors. Robert studied Computer Science at TU Berlin and worked at IBM Germany and at the IBM Almaden Research Center in San Jose. He is a frequent speaker at conferences such as the Hadoop Summit, ApacheCon and meetups around the world.
data Artisans Platform: Enterprise-Ready Stream Processing with Apache Flink
Ververica
Software Engineer at Ververica
Igal Shilman is a Software Engineer at
Open-Source Software Engineer
Open-Source Software Engineer at Open-Source Software Engineer
Max is an independent software engineer and PMC member of Apache Flink and Apache Beam. During his studies at Free University of Berlin and Istanbul University, he worked at Zuse Institute Berlin on Scalaris, a distributed transactional database. Inspired by the principles of distributed systems and open-source, he helped to develop Apache Flink at dataArtisans and, in the course of, joined the Apache Beam community. After maintaining the SQL layer of the distributed database CrateDB, he is now working on the cross-language portability aspects of Apache Beam.
Google Cloud
Software Engineer at Google Cloud
Robert Bradshaw is a software engineer at Google, developing on tools for doing petabyte-scale data processing, most recently working on Apache Beam. He is also active in the open source community, leading the Cython project since it’s inception and as a long-time contributor to the open source mathematics software Sage. He received Ph.D. in Mathematics from University of Washington and currently resides in Stockholm, Sweden.
Ververica
Training Coordinator at Ververica
David is responsible for train
Ververica
Solutions Architect at Ververica
As a Solution Architect at
Ververica
Co-founder, Software Engineer at Ververica
Fabian Hueske is a committer and PMC member of the Apache Flink® project and has been contributing to Flink since its earliest days. Fabian is a co-founder of
Ververica
Software Engineer at Ververica
Kostas is a Flink Committer, currently working with
Apache Flink, Flink and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event.