Jobs Posted on the Whova Community Board of International Conference on Very Large Data Bases
If you know anyone in the job market, feel free to share with them
Research Engineer - Data Systems
BorealisAI We’re looking for an enthusiastic Research Engineer with experience in data systems who’s excited by the opportunity of being at the forefront of machine learning technology, and working on extremely challenging problems in data discovery, data quality, and data integration. As a Research Engineer, you’ll be part of a collaborative team delivering AI projects end to end – everything from data pre-processing and exploration, to prototyping novel algorithmic solutions, to software implementations of machine learning-based products. You will function as a bridge between research and development teams to help extend research prototypes into implemented products.
Postdoc in Graph Management and Semantic Search at Aarhus University
Aarhus University One position in Graph Management and Semantic Search is available in the Data-Intensive Systems group, Department of Computer Science (https://cs.au.dk) at Aarhus University.
The position is part of a project with an important industrial stakeholder. The project aims at enabling semantic search on video sequences through Graph Databases, Machine Learning, and Data Mining methods. We are looking for a motivated and independent postdoctoral researcher to join our group.
The Postdoc position is offered initially for 1 year, with possibility of an extra year extension after mutual consent.
Software Engineer, Researcher and Internships on Encrypted Databases
Alibaba Group Responsible for developing encrypted database systems for Alibaba Cloud.
Developing core database components using secure programming methods and Intel SGX trusted execution environment.
Optimizing the performance of secured database operations, queries and transactions.
Improving the security of database systems using cryptography theories.
Researcher/Engineers on Graph Processing Systems (Research Intern for students are also welcome!)
Alibaba Group You are invited to join forces with us to build GraphScope!
GraphScope is a unified distributed graph computing platform that provides a one-stop environment for performing diverse graph operations on a cluster of computers through a user-friendly Python interface. GraphScope makes multi-staged processing of large-scale graph data on compute clusters simple by combining several important pieces of Alibaba technology: including GRAPE, MaxGraph, and Graph-Learn (GL) for analytics, interactive, and graph neural networks (GNN) computation, respectively, and the vineyard store that offers efficient in-memory data transfers.
See https://github.com/alibaba/graphscope to learn more. We will present GraphScope in VLDB2021 industry and demo track! Looking forward to meet you on VLDB2021!
Research Assistant/Associate in Health Data Science
Newcastle university We are recruiting for a Data Science postdoc to join our Health Data Science team at Newcastle University, as part of a collaboration between the School of Computing and the Faculty of Medical Sciences, as well as a large Pharma company.
**Closing Date: 17 August 2021** **full time and available on a fixed term basis until 31 October 2022. - extensible pending further funding.
The challenge is to extract clinically novel insights from the largest liver disease (NAFLD-NASH) registry in Europe, by deploying an array of data analytics, machine learning, and AI techniques to a rich clinical and multi-omics dataset. When "fat liver" develops into fibrosis (NASH), it results in high mortality rates. One of the aims of the project is to uncover novel non-invasive biomarkers for early detection of NASH on a personalised basis.
Software Engineer and Researcher On Time Series Database / Next-generation Distributed Relational Database
Huawei Technologies Co., Ltd. The Cloud Database Innovation Lab's mission is the research and development of cloud-native databases. We have developed the cloud-native relational database Taurus. With the seperation of compute and storage and log-is-database design in mind, it is also fully compatible with MySQL. We has also developed the global-scale, mutiple-model NoSQL database system Gemini, which is fully compatible with MongoDB and Cassandra. We will continue the journey in the cloud-native databases world, developing the time-serie and spatiotemporal database, the next-generation distributed database, and exploring the fusion of AI and DB.
Software Engineering MTS/SMTS/LMTS - Hyper Engine
Tableau Software, A Salesforce Company As a Software Engineer on the Hyper Database team, you will contribute to Salesforce's next-gen database technology Hyper, which powers all products of our Tableau Visual Analytics and Collaboration suite. Hyper is designed and built for high-performance transactional and analytical workloads in a diverse range of settings from scalable cloud environments to workstations and laptops running all major operating systems.
Together with a geographically distributed collaborative, smart and motivated team you will design and build impactful trustable solutions that are used in a variety of use cases, helping our customers perform interactive analytics of the freshest state of data across Tableau and Salesforce products.
Software Engineering MTS/SMTS/LMTS, Hyper Engine developer tools and CI/CD
Tableau Software, A Salesforce Company The primary focus of this position is to design and implement developer tools and the CI/CD pipelines for Hyper, our database powering all of Tableau’s products. Hyper is one of the most complex technical challenges here at Tableau. You will be challenged to leverage your creativity and experience to build the tools and development pipelines solving some of the most complex technical challenges in Tableau. Your work will have a significant impact across all of Tableau’s product offerings.
Tableau Software, A Salesforce Company Hyper Experiences brings the power of the Hyper Database, a next-gen high-performance analytics database, to the people. We find new ways to leverage its value across a variety of use cases, helping our customers perform interactive analytics of the freshest state of data across Tableau and Salesforce products. To deliver on this promise we build a healthy collaborative and geographically distributed team environment that empowers growth and execution. We put Customers front and center of everything we do, their trust and excitement are our goal. As we embrace the Cloud, it provides the opportunity to offer user experiences that solve our customers' biggest pain points around data volume, performance, and quality. We’re building microservices on Kubernetes to provide services and APIs to manage data pipelines, support partner integrations. All while keeping operational excellence high to ensure high availability. With your new perspective and drive, we’ll do more, smarter.
Aarhus University In the Data-Intensive Systems research group in the Department of Computer Science at Aarhus University, Denmark, we are currently looking for postdocs interested in efficient scalable algorithms, clustering algorithms or GPU parallel algorithms. If this sounds interesting, and you have published related work in top data management or data mining conferences, please reach out to Prof. Ira Assent, email@example.com
Internship or Research Visit, Hyper Engine
Tableau Software, A Salesforce Company As an Intern or Research Visitor on the Hyper Database team, you will contribute to Salesforce's next-gen database technology Hyper, which powers all products of our Tableau Visual Analytics and Collaboration suite. Hyper is designed and built for high-performance transactional and analytical workloads in a diverse range of settings from scalable cloud environments to workstations and laptops running all major operating systems. Together with a geographically distributed collaborative, smart, and motivated team you will design and build impactful trustable solutions that are used in a variety of use cases, helping our customers perform interactive analytics of the freshest state of data across Tableau and Salesforce products.
Asst. Professor in Data-intensive Systems
TU Delft We are searching for a strong researcher in a field related to one or more of the following: Data integration and knowledge management Scalable data processing Database systems
Ping me if you have questions!
Post-Doc Researcher (m/f/x) for Distributed Data Platforms
Dynatrace We are seeking for somebody with a Ph.D. in Computer Science or related fields to conduct research in the field of Distributed Data Platforms. Your exciting tasks will be to design the new architecture to address the massive data ingest, to handle data and to develop optimized algorithms. We need you to have a good feeling for performance because you will be contributing to the Global Research Community by publishing and disseminating research findings in journals and speaking at conferences. You will play an active part in scientific and industrial cooperation projects and collaborate with our Data Platform team and other internal and external stakeholders.
Lead Researcher (m/f/x) for Distributed Data Systems
Dynatrace We are looking for a Researcher to join our team as a Lead Researcher for Distributed Data Systems, to identify and develop new leading-edge technologies together with the research team and to define roadmaps and ensure consistency and timely delivery of results. You may be a great fit for our team if you have a Ph.D. in Computer Science and experience in a Postdoc role for several years of independent research. You must have a strong publication record of driving market-relevant research to contribute to the global Research Community. In this role you will mentor Ph.D. and master students and lead a team of researchers.
Senior Java Software Developer with focus on Big Data and High-Performance Computing (m/f/x)
Dynatrace Excited about High-Performance Computing with large quantities of data? Interested in joining a global leader that enables digital transformation? Looking for teammates who appreciate open communication and face challenges together as a team? Dynatrace is looking for a Senior Java Software Developer with focus on Big Data and High Performance Computing to develop and contribute to the Dynatrace Software Intelligent Solution in Java. You should have significant work experience developing in Java, including architectural design because you are going to design and implement the Dynatrace Platform independently.
Post-Doc Researcher (m/f/x) for Real-Time Data Analytics
Dynatrace Our research in the field of real-time data analytics is driven by the need to process and analyze millions of metrics and events. This requires high scalability not only of the infrastructure but also of the applied technologies and algorithms to handle data ingest and storage, and to enable large-scale data analysis and anomaly detection in real-time. Dynatrace is looking for a Post-Doc Researcher to conduct research in the field of real-time data analytics and develop efficient algorithms for real-time data processing and anomaly detection. You may be a great fit for the team if you have a Ph.D. in Computer Science or a related field and an expertise in the relevant research topics. We need you to have a good feeling for performance because you will be contributing to the Global Research Community by publishing and disseminating research findings in journals and speaking at conferences. You will play an active part in scientific and industrial cooperation projects and collaborate with our Data Science team and other internal and external stakeholders.
Azure Graph Database Engineers
LinkedIn The graph team builds and operates a novel distributed graph database which currently serves a 200B edges graph at 1.2 million QPS. We are developing an in-memory graph database in-house from the ground up. We also build and operate the distributed system that scales to the staggering size of the Economic Graph, while supporting all of the queries that power LinkedIn’s many products and core member experience.
Senior Rust Engineer (Query Engine)
Prisma At Prisma we are building the data layer for modern applications.
Below are some of the things you could expect to do as part of the Prisma team
Expand and improve the Core of the Prisma Query Engine and be part of creating a market-leading ORM in collaboration with our Open Source community.
Collaborate with a team of engaged engineers working on developing and improving Prisma Clients in multiple languages, such as Go and TypeScript.
Use your knowledge of databases and system architecture to create solutions that work for developers at all experience levels, making it easy for them to get the best out of their data.
Create well-tested and documented code that is easy to understand and contribute to by anyone in our community.
Two PhD positions in AI for Big Spatial Data
Roskilde University Department of People and Technology, Roskilde University, Denmark invites applications for two positions as PhD of Computer Science from January 1, 2022 or as soon as possible thereafter. Each PhD position is limited to a period of 3 years. The recruited PhD fellows will work on the research project AI-Powered Spatial Databases funded by Independent Research Fund Denmark. The project's overall goal is designing and developing machine learning based techniques for efficient and effective management of massive, heterogeneous and dynamic big spatial data. For the stipends, we are looking for PhD students who are qualified to undertake supervised independent research in the areas of big spatial data and applied machine learning, in particular on the research topics such as learned index for spatial data, machine learning facilitated spatial query processing, and trajectory pattern mining. *** Note: PhD fellows in Denmark are employees, not students of the conventional meaning.
Research Scientist, Systems and Infrastructure (PhD)
Facebook As a Research Scientist at Facebook, you will help build the systems behind Facebook's products, create web applications that reach millions of people, build high volume servers and be a part of a team that’s working to help connect people around the globe. The ideal candidate will have a keen interest in relevant engineering fields, such as (but not limited to) distributed software systems, storage systems, data warehousing and analytics, database systems, operating systems, networking systems, programming languages, compilers & runtime systems, security & privacy, cryptography, and mobile systems.
Software Engineer Intern, Systems and Infrastructure (PhD)
Facebook As a PhD intern at Facebook, you will help build the systems behind Facebook's products, create web applications that reach millions of people, build high volume servers and be a part of a team that’s working to help connect people around the globe.
The ideal candidate will have a keen interest in relevant engineering fields, such as (but not limited to) distributed software systems, storage systems, data warehousing and analytics, database systems, operating systems, networking systems, programming languages, compilers & runtime systems, security & privacy and mobile systems.
Sigma Computing About the Role:
We are rapidly growing and want you to join the world-class research team we are forming. Research Scientists at Sigma will develop novel methods and systems to improve every aspect of business intelligence and data exploration for all, shaping the future of Sigma as well as scalable interactive data analytics.
What you will be doing:
- Devise and execute a research agenda to improve the product. - Develop effective solutions to research problems with direct product impact. - Work with Sigma teams and customers to deploy research solutions to benefit the ever growing number of Sigma users. - Communicate research findings and developments, engaging with the broader community as a leader.
Qualifications we are looking for:
- PhD in Computer Science or a related field. - Track record of independent thinking and producing high impact research artifacts, including but not limited to publications, novel systems, and open source software. - Experience in one or more of Data Systems, Human-Computer Interaction, Information Retrieval, Visual Analytics, Machine Learning, Natural Language Processing, or Programming Languages. - Programming proficiency to rapidly turn research ideas into software prototypes.
We are looking for people that are excited to grow and constantly ask how we can do things better. If you are excited about the opportunity, we encourage you to apply even if you don’t satisfy 100% of the job requirements.
Assistant or Associate Professor
Utrecht University Utrecht University in the Netherlands is having a number of openings in multiple areas for tenure track Assistant or Associate professor. One of the openings is on Data Science with focus on Data and Information Management.
The application deadline is Aug 31st and the full text of the call can be found here: https://www.uu.nl/en/organisation/working-at-utrecht-university/jobs/assistant-or-associate-professors-in-information-and-computing-sciences-tenure-track-08-10-fte
Interested strong candidates are invited to apply.
Huawei Canada Research Centre Huawei Canada is seeking for engineers and researchers to expand and strengthen its Cloud R&D Business Unit. This is a great opportunity for the successful candidate to be part of an industry-leading team to build the next generation of cloud-native databases. The team has recently delivered its Taurus cloud-native database offering (SIGMOD 2019 https://dl.acm.org/doi/abs/10.1145/3318464.3386129). As part of a world class research team, you will have the opportunity to work as part of a small, high performance, and startup-like team to innovative, publish, develop patents, and drive our database-as-a-service solutions. The team works on a wide range of topics, including transactions, storage engines, SQL optimization, HA, scale-out architectures and so on. The ideal candidate will have excellent systems programming skills in C/C++, and be responsible for prototyping database engine features in the cloud, being up to date with latest developments from leading relevant conferences and publications, validating design and reviewing technical specifications and requirements. Bachelor degree in Computer Science/Engineering required; MS or PhD in Computer Science/Engineering highly preferred.
Firebolt Looking for C++ engineers to work on Firebolt's unique Cloud Data Warehouse engine. Many positions and details at https://www.firebolt.io/careers
Postdoc in Decentralized Systems
IST Austria https://twitter.com/LefKok/status/1427298469149089794
(Assistant/Full) Professor in Systems
IST Austria https://ist.ac.at/en/jobs/faculty/
Software developer / Researcher
Oracle Oracle’s commitment to R&D is a driving factor in the development of technologies that have kept Oracle at the forefront of the computer industry. If you are passionate about advanced development of next-generation large-scale distributed systems designed for the most popular database in the world and optimized for the cloud, we would like to talk with you.
Area of Expertise •System software, Computer architecture •Distributed systems & networking •Database kernel development •Query processing/big data analytics •Machine learning
Desired Skills and Experience •We are looking for candidates with a PhD or Master’s degree in Computer Science or Computer Engineering •Strong background in design and development •Demonstrated research excellence •Excellent interpersonal and communication skills
Postdoctoral Researcher in Database Engines, DB for ML, ML for DB at TU Berlin
TU Berlin The Berlin Institute for the Foundations of Learning and Data (BIFOLD, http://bifold.berlin) at TU Berlin conducts research in data management, machine learning, and its intersection. Currently, we have openings for Postdoctoral Researchers in the area of data management systems, including database engines, ML for systems, systems for ML, database performance, data mining, and analytics. We offer a world-class research environment and a diverse team of scientists, with a strong history of high impact research papers, software systems (e.g., Apache Flink), and technology transfer. Berlin is a vibrant city with one of the strongest science and startup ecosystems in Europe. For more details and application information contact Volker Markl at firstname.lastname@example.org.
PostDoc Position at HPI - Data Quality and AI
Hasso Plattner Institute (HPI) For a joint project with a law group, an ethics group and an industry partner, we are looking for a Postdoc researcher in the area of data quality / data cleaning / data preparation, preferably with some background in machine learning methods. The position is fully funded for a duration of 20 months, starting as soon as possible.
Developer Advocate at Toloka
Toloka Toloka is one of the world's largest crowdsourcing platforms for collecting and labeling large amounts of data for machine learning projects, ultimately solving a variety of problems for businesses. The data labeled on the platform is used to drive computer vision, speech recognition, search algorithms, recommendation systems, and more. Toloka is a key tool for major IT companies globally. We are looking for an experienced data evangelist with a passion for data science and a talent for distilling technical concepts into clear, concise, and engaging content. Since your audience will be ML engineers, developers, data engineers, and other tech professionals, your content will need to resonate with them. Responsibilities:
- build examples and blog posts showing how to solve common problems using our technology; - engage in conversations on GitHub, on Slack, and in user community meetups as well as at industry and academic conferences, roadshows, boot camps, hackathons, lectures, and elsewhere; - run community events; - develop programs to engage customers in the broader community; - participate in educational activities; - listen to and document DS or MLE needs, problems, and success factors; - act as a key resource for data scientists and MLEs, facilitate product adoption; - provide strategic recommendations for engineering, product, research, and marketing teams to help prioritize company direction and the product roadmap; - participate in product development so you’re in touch with our product, market, and communities; - manage, analyze, and report progress on various evangelism initiatives.
PostDoc & PhD Positions at Hong Kong Polytechnic University
The Hong Kong Polytechnic University We are looking for Postdoctoral Fellows and PhDs to work on data management and big data analytics. There are several positions available at our group, Department of Computing, The Hong Kong Polytechnic University.
*** Postdoc position ***: - 1-year or 2-year Postdoc positions with highly competitive salary starting from 32000HKD per month (negotiable). - First come first serve until the positions are filled. - Applicants should have a PhD degree in Computer Science, or a related field or an equivalent qualification, also have rich research experience and publication record in the fields of big data analytics and data management, e.g., graph algorithms, GPU parallel computing, data processing, multi-dimensional data management. - Applicants will be responsible for developing effective data mining and learning algorithms/systems, and publishing high-quality research papers in top conferences or journals.
*** PhD students ***: - Applicants should have a bachelor/master degree in Computer Science or related areas. - Strong programming skills and algorithmic analysis skills - Has publications or high GPA
*** How to apply ***: Send an email with your CV to email@example.com
PhD or PostDoc position in Data Systems focusing on Computational Storage at IT University of Copenhagen
IT University of Copenhagen PhD or up to 2-year PostDoc position in Data Systems focusing on Modern Storage Hierarchies, Computational Storage, and Locality-Aware Scheduling for Data-Intensive Workloads at DASYA lab (https://dasya.itu.dk/) in IT University of Copenhagen (itu.dk). Supervised by Pınar Tözün and Philippe Bonnet. Ideal starting date is January 2022 or soon thereafter. If you want to learn more about the position, contact Pınar Tözün (firstname.lastname@example.org).
Software Developer at Exadata
Oracle Oracle Exadata is the best cloud database platform on the market today. Combining the latest advancements in hardware with ground-breaking software innovations, Exadata is the #1 platform to run both OLTP and analytics database workloads everywhere in cloud and on premises. We harness the power of Persistent Memory and Remote Direct Memory Access (RDMA) technology, delivering the fastest database OLTP data and commit IO accelerator with sub-19 usec OLTP IO latency and 16 million IOPS in a full rack. We pioneer the innovation to offload analytics query into storage along with columnar rewrites and vector processing. Exadata is a truly disruptive platform that delivers unsurpassed database performance. Customers have responded, and the platform has seen massive organic growth since its introduction. 87% of Global Fortune 100 run their most mission-critical systems on Exadata. This growth has led Larry Ellison, Chairman of the Board of Oracle, to declare Exadata “the most successful product in Oracle’s history”. In the cloud, popular Oracle cloud services like Autonomous Database and Exadata Cloud Service, all build on Exadata to deliver the extreme performance and maximum availability that our enterprise customers demand.
Join an amazing team that created the first Database Machine in the industry to provide extreme performance for database workloads. You’ll work on adopting, researching and harnessing the power of the latest storage and networking technology, such as Persistent Memory, RoCE and NVMe Flash.
You’ll also work on designing and creating a new highly available, cloud scaling, and extremely fast distributed database storage system that continuously sets new world records with every Exadata release. A small sampling of past projects includes building a boundless scalable distributed storage system, creating a persistent memory cache that greatly accelerates OLTP IOs, and developing an allocate-on-write snapshot infrastructure in database storage.
Database Software Engineer
Apple Welcome VLDB Attendees!
At Apple, collaboration and innovation go hand in hand with scale and impact to develop first-in-class database solutions which are the foundation of Apple's infrastructure and platform technologies. Join our team and contribute to our growing mission that is powering solutions for millions.
We are searching for a capable engineers who have interest in database and scaleable systems development. In this highly visible position, you will collaborate with multi-functional engineering teams to tackle meaningful technical problems while collaborating with innovative product development teams to define and implement core platform frameworks and technologies that power next generation Apple products. We promote innovation and new technology to further improve our creative output, and we are looking for creative and passionate people to help us bring our visions to fruition.
Candidates with a deep academic understanding of machine learning and artificial intelligence who are interested in contributing to fundamental research by working to define, design, implement, and evaluate algorithms will also be considered.
Desired Education & Experience BS, MS, and/or PhD in Computer Science and relevant experience is desired.
Oracle Software Internship Positions Open for Summer 2022 -
Oracle Exadata is the best cloud database platform on the market today. Combining the latest advancements in hardware with ground-breaking software innovations, Exadata is the #1 platform to run both OLTP and analytics database workloads everywhere in cloud and on premises. We harness the power of Persistent Memory and RDMA over Converged Ethernet (RoCE) technology, delivering the fastest database OLTP data and commit IO accelerator with sub-19 usec OLTP IO latency and 16 million IOPS in a full rack. We pioneer the innovation to offload analytics query into storage along with columnar rewrites and vector processing. Exadata is a truly disruptive platform that delivers unsurpassed database performance. Customers have responded, and the platform has seen massive organic growth since its introduction. 87% of Global Fortune 100 run their most mission-critical systems on Exadata. This growth has led Larry Ellison, Chairman of the Board of Oracle, to declare Exadata “the most successful product in Oracle’s history”.
Join an amazing team that created the first Database Machine in the industry to provide extreme performance for database workloads. You’ll work on adopting, researching and harnessing the power of the latest storage and networking technology, such as Persistent Memory, RoCE and NVMe Flash. You’ll also work on designing and creating a new highly available, cloud scaling, and extremely fast distributed database storage system that continuously sets new world records with every Exadata release. A small sampling of past projects includes building a boundless scalable distributed storage system, creating a persistent memory cache that greatly accelerates OLTP IOs, and developing an allocate-on-write snapshot infrastructure in database storage.
Databricks Our mission at Databricks is to radically simplify the whole data lifecycle from ingestion to ETL, BI, and all the way up to ML/AI with a unified platform. To achieve this goal, we believe the data warehouse architecture as we know it today will be replaced by a new architectural pattern, Lakehouse (CIDR 2021 paper: http://cidrdb.org/cidr2021/papers/cidr2021_paper17.pdf), open platforms that unify data warehousing and advanced analytics. The new architecture will help address several major challenges, including data staleness, reliability, total cost of ownership, data lock-in, and limited use-case support.
A critical part of realizing this vision is the next generation (decoupled) query engine and structured storage system that can outperform specialized data warehouses in relational query performance, yet retain the expressiveness and of general purpose systems such as Spark to support diverse workloads ranging from ETL to data science.
As part of this team, you will be working in one or more of the following areas to design and implement these next gen systems that leapfrog state-of-the-art:
- Query compilation and optimization - Distributed query execution and scheduling - Vectorized execution engine - Data security - Resource management - Transaction coordination - Efficient storage structures (encodings, indexes) - Automatic physical data optimization
What we look for: - A passion for database systems, storage systems, distributed systems, language design, or performance optimization - Experience working towards a multi-year vision with incremental deliverables - Motivated by delivering customer value and impact - 3+ years of experience working in a related system (preferred) - Optional: PhD in databases or distributed systems