Research Scientist , Revamping the Role of Modern Flash Storage in Accelerating Generative AI Pipelines für myScience in Zurich - myjob.ch
Die besten 33063 Jobs in der Region.
Job-typ
10-100%
Pensum
Position

      27.04.2025

      Research Scientist , Revamping the Role of Modern Flash Storage in Accelerating Generative AI Pipelines

      • Zurich
      • Festanstellung 100%

      • Merken
      • drucken
       

      myScience

      myScience

      Research Scientist , Revamping the Role of Modern Flash Storage in Accelerating Generative AI Pipelines

      Research Scientist , Revamping the Role of Modern Flash Storage in Accelerating Generative AI Pipelines

      IBM Zurich Research Laboratory
      Workplace Zurich, Zurich region, Switzerland
      Category
      Computer Science
      Position
      Senior Scientist / Postdoc
      Published 21 November 2024
      Research Scientist

      Revamping the Role of Modern Flash Storage in Accelerating Generative AI Pipelines

      Ref. 2024_031

      Position description

      Generative AI (GenAI) pipelines, especially using the RAG pattern, have attracted a significant amount of community attention due to their widespread use, effectiveness, and potential applicability in a variety of domains. In an ideal case, all data/metadata used in these workflows will be contained in memory. However, DRAM technology faces challenges from multiple fronts as it is not scaling, has a high cost ($/GB), and is energy inefficient. As the performance of Flash-based NVMe devices keeps increasing, the key research question that we are interested in is: how can modern Flash memory and NVMe hardware/software stacks help run data-intensive GenAI pipelines (focus on RAG, i.e. information retrieval and LLM inference)?

      In this context, we aim to answer the following research questions:

      1. What dependencies do RAG pipelines have on storage and how is storage accessed in RAG pipelines?
      2. How can Vector DBs benefit from using high-performance NVMe flash arrays that can support millions of small I/O operations per seconds?
      3. How can flash memory help accelerate the inference of LLMs?



      Necessary qualifications

      • The candidate has obtained a Ph.D. degree in electrical engineering or computer science with a focus on (graph) data mining, (graph) databases, data storage, or (generative) AI in the past 5 years.
      • Excellent coding skills, in Python and C++, familiarity with Linux environments and software development tools (git/GitHub, IDEs, gcc, virtual machine and containers, etc.).
      • High level of creativity and outstanding problem-solving ability.
      • Excellent oral and written English and excellent presentation skills.
      • Strong academic record and experience with publishing and presenting at international conferences.
      • Strong interpersonal skills and excellent written and verbal communication.


      Preferred Qualifications

      • Background and experience in systems architecture and hardware design.
      • Experience with systems programming and internals (kernel, memory management, hardware, CPU)
      • Experience with data storage and NVMe storage internals and specification.



      Diversity

      IBM is committed to diversity at the workplace. With us you will find an open, multicultural environment. Excellent flexible working arrangements enable all genders to strike the desired balance between their professional development and their personal lives.

      How to apply

      If you are interested in this position, please submit your application here.
      In your application, please refer to myScience.ch and referenceJobID65889.

      Arbeitsort: Zurich