Beyond Language: Applying MLX Transformers to Engineering Physics
Stavros Kassinos, Alessio Alexiadis

TL;DR
This paper introduces a physics-informed Transformer model for solving 2D heat conduction problems, leveraging MLX framework for efficient training and prediction on personal hardware, demonstrating high accuracy on unseen conditions.
Contribution
It presents the first application of Transformers to engineering physics problems using MLX, enabling accessible and efficient physics-based modeling on personal devices.
Findings
High accuracy in predicting steady-state temperature fields
Efficient training on modest hardware using MLX framework
Successful generalization to unseen boundary conditions
Abstract
Transformer Neural Networks are driving an explosion of activity and discovery in the field of Large Language Models (LLMs). In contrast, there have been only a few attempts to apply Transformers in engineering physics. Aiming to offer an easy entry point to physics-centric Transformers, we introduce a physics-informed Transformer model for solving the heat conduction problem in a 2D plate with Dirichlet boundary conditions. The model is implemented in the machine learning framework MLX and leverages the unified memory of Apple M-series processors. The use of MLX means that the models can be trained and perform predictions efficiently on personal machines with only modest memory requirements. To train, validate and test the Transformer model we solve the 2D heat conduction problem using central finite differences. Each finite difference solution in these sets is initialized with four…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems
MethodsAttention Is All You Need · Sparse Evolutionary Training · Dense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Dropout · Byte Pair Encoding
