Fast Pareto Set Approximation for Multi-objective Flexible Job Shop 调度 via Parallel Preference-conditioned Graph Reinforcement Learning
We develop a preference-conditioned graph reinforcement learning framework for fast Pareto set approximation in multi-objective flexible job shop scheduling. The method parallelizes preference-based learning and substantially improves both solution quality and computational speed on large-scale scheduling problems.