publications

publications by categories in reversed chronological order. generated by jekyll-scholar.

2024

  1. Mechanistic Interpretability for AI Safety — A Review
    Leonard F. Bereska, and Efstratios Gavves
    TMLR, Apr 2024

2023

  1. Taming Simulators: Challenges, Pathways and Vision for the Alignment of Large Language Models
    Leonard F. Bereska, and Efstratios Gavves
    AAAI-SS, Oct 2023

2022

  1. Continual Learning of Dynamical Systems With Competitive Federated Reservoir Computing
    Leonard F. Bereska, and Efstratios Gavves
    Proceedings of The 1st Conference on Lifelong Learning Agents, Nov 2022
  2. Tractable Dendritic RNNs for Reconstructing Nonlinear Dynamical Systems
    Manuel Brenner, Florian Hess, Jonas M. Mikhaeil, Leonard F. Bereska , and 3 more authors
    Proceedings of the 39th International Conference on Machine Learning, Jun 2022

2019

  1. Unsupervised Part-Based Disentangling of Object Shape and Appearance
    Dominik Lorenz, Leonard F. Bereska, Timo Milbich, and Bjorn Ommer
    Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Jun 2019