ASC24决赛页

Rules of the final stage

1.The use of optimization methods specific to certain parameters or input data sets is

strictly forbidden.

2.If any changes are made to the algorithm, the revised version must maintain mathematical

equivalence to the original.

3.Violation of any rule mentioned above will result in a zero score being assigned for the

corresponding task.

4.Properly constructing the cluster is essential. Any damage to the server may incur a

penalty of up to 20 points for the team, as determined by the ASC24 Committee.

Performance Optimization

I. HPL performance optimization:

1.Platform requirement: The runtime power consumption must remain under 3

KW. Failure to

comply will result in the invalidation of the current task.

2.Goal: Obtain the correct results while achieving the highest performance.

3.Software download: http://www.netlib.org/benchmark/hpl/

II. Performance optimization of HPCG:

1.Platform requirement: The runtime power consumption must remain under 3

KW. Failure to

comply will result in the invalidation of the current task.

2.About run time: The runtime of HPCG (version 3.0) must be a minimum of

1800 seconds (30

minutes), as reported in the output file. The Quick Path option is not permitted.

3.Software download: https://github.com/hpcg-benchmark/hpcg

III. Performance optimization of OpenCAEPoro:

1.Platform requirement: The power restriction of the test platform is 3 KW.

If the power

consumption exceeds 3 KW, the results of the current task will be invalid.

2.Goal: The OpenCAEPoro challenge shares a similar objective with the

preliminary round. The

ASC24 committee will announce several OpenCAEPoro workloads during the finals. Each

workload's results must pass correctness checking, and the objective is to minimize OBJECT

TIME. It's important to note that modifying any code related to the method parameters is

prohibited, and all parameters in the input files must remain unchanged.

3.Software download: https://github.com/OpenCAEPlus/OpenCAEPoro_ASC2024

IV. Performance optimization of GoMars:

1.Platform requirement: The power restriction of the test platform is 3 KW.

If the power

consumption exceeds 3 KW, the results of the current task will be invalid.

2.Goal: GoMars is a novel global open planetary atmospheric model designed

for Mars. It

offers insights into Martian meteorological conditions crucial for landing operations. The

ASC24 committee will introduce various GoMars workloads during the finals. The results of

each workload must undergo correctness checking, and the objective is to minimize the

runtime of the GoMars application. Please note that the provided link only contains the

dynamic core of GoMars, whereas additional codes for physical processes may be included in

the final version.

3.Software download: https://gitee.com/dongli85/GMCORE

V. Performance optimization of the Mystery Application:

1.Platform requirement: The power restriction of the test platform is 3 KW.

If the power

consumption exceeds 3 KW, the results of current task will be invalid.

2.Goal: The ASC24 committee will announce several Mystery Application

workloads during the

finals. The results of each workload must pass correctness checking, and the objective is to

minimize runtime.

VI. Performance optimization of LLM inference:

1.Platform requirement: The power limit for the test platform is 3 KW. Any

power consumption exceeding this threshold during the contest will result in the

invalidation of the current task's results.

2.Goal: The LLM inference challenge shares a similar goal with the

preliminary round. AquilaChat2-34B will be used in the final round and the datasets used for

evaluation will be disclosed on site. Besides, it is required to utilize 4bit or less

quantization for inference acceleration during the final and the baseline quantization

method is 4bit bitsandbytes. Moreover, the overall accuracy rather than only output length

of dataset results will be taken into account, which should be controlled within a maximum

error of 1.5% compared to the BF16 precision baseline.

3.Download: Model weight and the baseline code for bitsandbytes download

link https://huggingface.co/BAAI/AquilaChat2-34B/

Team Presentation

1.Each team is required to present their results using PowerPoint (PPT) slides. The

presentation order for each team will be determined by a draw. English must be used in both

the PPT slides and in the presentations delivered by up to two student speakers.

2.The presentation must not exceed 7 minutes in duration. Any additional time will result in

a reduction of your score accordingly. Following the presentation, judges will have

approximately 3 minutes to ask questions.

3.The evaluation committee will assess the presentation of each team.

4.The team advisor is welcome to observe her/his team's presentation session.

2025-09-19 00:01:47
2016年中国杯世界花样滑冰大奖赛 双人自由滑 20161119
1-1!世预赛爆冷:头号种子队2轮0胜,领先垫底队1分,难进世界杯