A MULTI OBJECTIVE DEEP REINFORCEMENT LEARNING METHOD FOR X2026