Publications
2025
- Manifold Metric: A Loss Landscape Approach for Predicting Model PerformanceIn Proceedings of The 4th Conference on Lifelong Learning Agents, 2025
- NeoBERT: A Next Generation BERTTransactions on Machine Learning Research, 2025Reproducibility Certification
- Structure-Aligned Protein Language ModelarXiv preprint arXiv:2505.16896, 2025
- Small Encoders Can Rival Large Decoders in Detecting GroundednessJul 2025
- Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMsIn Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Jul 2025
- CADmium: Fine-Tuning Code Language Models for Text-Driven Sequential CAD DesignarXiv preprint arXiv:2507.09792, Jul 2025
- NovoMolGen: Rethinking Molecular Language Model PretrainingarXiv preprint arXiv:2508.13408, Jul 2025
2024
- A Deep Dive into the Trade-Offs of Parameter-Efficient Preference Alignment TechniquesIn Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Aug 2024
- Exploring Quantization for Efficient Pre-Training of Transformer Language ModelsIn Findings of the Association for Computational Linguistics: EMNLP 2024, Nov 2024
- Protein language models: Is scaling necessary?bioRxiv, Nov 2024
- Interpolate: How Resetting Active Neurons can also improve Generalizability in Online LearningNov 2024
2023
- A practical survey on faster and lighter transformersACM Computing Surveys, Nov 2023
- Detection of microservice-based software anomalies based on OpenTracing in cloudSoftware: Practice and Experience, Nov 2023
- Distributed computation of the critical path from execution tracesSoftware: Practice and Experience, Nov 2023
- Language models for novelty detection in system call tracesarXiv preprint arXiv:2309.02206, Nov 2023
- Energy and carbon-aware initial VM placement in geographically distributed cloud data centersSustainable Computing: Informatics and Systems, Nov 2023
2022
- Machine Learning for Anomaly Detection in Kernel TracesNov 2022
2021
- On improving deep learning trace analysis with system call argumentsIn 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), Nov 2021
- Automated cause analysis of latency outliers using system-level dependency graphsIn 2021 IEEE 21st International Conference on Software Quality, Reliability and Security (QRS), Nov 2021
2020
- Depgraph: Localizing performance bottlenecks in multi-core applications using waiting dependency graphs and software tracingIn 2020 IEEE 20th International Working Conference on Source Code Analysis and Manipulation (SCAM), Nov 2020
2019
- Empirical comparison between autoencoders and traditional dimensionality reduction methodsIn 2019 IEEE Second International Conference on Artificial Intelligence and Knowledge Engineering (AIKE), Nov 2019
- Automatic cause detection of performance problems in web applicationsIn 2019 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), Nov 2019