ASX-Dividend-Report-Banner

One half-day of training using a few hundred dollars yields similar results to mainstream large models, open-source and commercial-free domain-specific LLM solution

October 02, 2023 02:22 AM AEDT | By Cision
Follow us on Google News: https://kalkinemedia.com/resources/assets/public/images/google-news.webp
 One half-day of training using a few hundred dollars yields similar results to mainstream large models, open-source and commercial-free domain-specific LLM solution
Image source: Kalkine Media

SINGAPORE, Oct. 1, 2023 /PRNewswire/ -- Recently, Colossal-AI built a remarkable domain-specific large language model (LLM) by utilizing only a few hundred dollars training costs. It can be readily applied across various domains, facilitating the economical construction of large AI models.

The solution is accessible without any commercial restrictions, with complete transparency extended to the entire training process, code, and model weights.

Technical details, open-source code and weights are available at : https://github.com/hpcaitech/ColossalAI

Bridging from any general large models to any domain-specific large models with only a few hundred dollars.
Bridging from any general large models to any domain-specific large models with only a few hundred dollars.

Performance

Colossal-AI's model not only enhances Chinese language capabilities but also further refines its proficiency in English. Remarkably, it exhibits performance levels that rival state-of-the-art (SOTA) models of similar scale within the open-source community.

In conjunction with this, Colossal-AI offers the comprehensive evaluation framework, ColossalEval, facilitating cost-effective reproducibility.

In addition, fine-tuning through methods like SFT and LoRA has limitations in effectively infusing knowledge and capabilities from the base model. It doesn't satisfactorily meet the requirements for constructing high-quality domain-specific knowledge or specialized model applications.

Bridging from General Large Models to Domain-specific Large Models

More importantly, the creation of a Chinese version not only offers the advantage of reusability but also carries significant importance in real-world implementation scenarios.

It's widely recognized that the cost of pre-training large AI models from scratch is exorbitant, often humorously referred to as a domain accessible only to those with "50 million dollars" to spare.

Many tech giants and AI startups are eager to invest heavily in building large general-purpose models. However, behind the generality of these large models often lies a lack of domain-specific knowledge. Consequently, the issue of practical applicability becomes particularly serious.

If a domain-specific large model can be rapidly and cost-effectively constructed, followed by fine-tuning for specific business needs, it would undoubtedly advance the deployment of applications, providing a competitive advantage.

Applying the above process to perform knowledge transfer in any field allows for the cost-effective construction of lightweight domain-specific foundational large models.

For constructing foundational large models from scratch, one can also draw inspiration from the aforementioned experiences and Colossal-AI's cost-reduction and efficiency-enhancing capabilities to efficiently achieve this goal at minimal cost.

Colossal-AI System Optimization and Cloud Platform

The impressive performance and cost advantages are built upon the foundation of the low-cost AI large model development system, Colossal-AI.

Colossal-AI leverages efficient techniques to reduce the costs of large AI models training, fine-tuning, and inference. It has collaborated with numerous Fortune 500 companies and other well-known enterprises.

To further enhance the efficiency of large model development and deployment, Colossal-AI has been upgraded to the Colossal-AI cloud platform, which is now in public beta, and registration will provide you with vouchers.

Colossal-AI Cloud Platform: platform.colossalai.com

Colossal-AI Open Source Address: https://github.com/hpcaitech/ColossalAI

About HPC-AI Tech

HPC-AI Tech is a startup headquartered in Singapore. Its flagship product, Colossal-AI, is a versatile deep learning system designed for the era of large AI models. It enables efficient and rapid deployment of large AI model training and inference, resulting in significant cost reduction for large AI model applications. HPC-AI Tech raised 22 million USD in Series A Funding in July 2023.

For media inquiries or more information, please contact:

[email protected]


Disclaimer

The content, including but not limited to any articles, news, quotes, information, data, text, reports, ratings, opinions, images, photos, graphics, graphs, charts, animations and video (Content) is a service of Kalkine Media Pty Ltd (Kalkine Media, we or us), ACN 629 651 672 and is available for personal and non-commercial use only. The principal purpose of the Content is to educate and inform. The Content does not contain or imply any recommendation or opinion intended to influence your financial decisions and must not be relied upon by you as such. Some of the Content on this website may be sponsored/non-sponsored, as applicable, but is NOT a solicitation or recommendation to buy, sell or hold the stocks of the company(s) or engage in any investment activity under discussion. Kalkine Media is neither licensed nor qualified to provide investment advice through this platform. Users should make their own enquiries about any investments and Kalkine Media strongly suggests the users to seek advice from a financial adviser, stockbroker or other professional (including taxation and legal advice), as necessary. Kalkine Media hereby disclaims any and all the liabilities to any user for any direct, indirect, implied, punitive, special, incidental or other consequential damages arising from any use of the Content on this website, which is provided without warranties. The views expressed in the Content by the guests, if any, are their own and do not necessarily represent the views or opinions of Kalkine Media. Some of the images/music that may be used on this website are copyright to their respective owner(s). Kalkine Media does not claim ownership of any of the pictures displayed/music used on this website unless stated otherwise. The images/music that may be used on this website are taken from various sources on the internet, including paid subscriptions or are believed to be in public domain. We have used reasonable efforts to accredit the source wherever it was indicated as or found to be necessary.

Recent Articles

Investing Tips

Previous Next
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.