This recommended practice provides a data processing framework for training large language models, refining the relevant terms and definitions. The recommended practice also provides data processing processes, methods, characteristics and performance evaluation during the pre-training and fine-tuning stages.
- Standard Committee
- C/AISC - Artificial Intelligence Standards Committee
- Status
- Active PAR
- PAR Approval
- 2023-09-21
Working Group Details
- Society
- IEEE Computer Society
- Standard Committee
- C/AISC - Artificial Intelligence Standards Committee
- Working Group
-
DTLLM - Data for Training Large Language Models
- IEEE Program Manager
- Christy Bahn
Contact Christy Bahn - Working Group Chair
- Dan Liu
Other Activities From This Working Group
Current projects that have been authorized by the IEEE SA Standards Board to develop a standard.
No Active Projects
Standards approved by the IEEE SA Standards Board that are within the 10-year lifecycle.
No Active Standards
These standards have been replaced with a revised version of the standard, or by a compilation of the original active standard and all its existing amendments, corrigenda, and errata.
No Superseded Standards
These standards have been removed from active status through a ballot where the standard is made inactive as a consensus decision of a balloting group.
No Inactive-Withdrawn Standards
These standards are removed from active status through an administrative process for standards that have not undergone a revision process within 10 years.
No Inactive-Reserved Standards