11.9 C
Canberra
Tuesday, August 19, 2025

Unlocking knowledge synthesis with a conditional generator


Experiments

We performed experiments on 4 datasets, the place three datasets correspond with downstream generative duties and one dataset with a classification activity. Generative duties are sometimes tougher than classification duties. It is because the generative duties are evaluated by the next-token prediction accuracy, which requires the artificial knowledge to protect fine-grained textual info from the non-public knowledge. In distinction, the classification duties solely require sustaining the co-occurrence patterns between labels and phrases within the non-public knowledge.

The three generative duties are chosen to cowl a various set of sensible eventualities: PubMed (medical paper abstracts), Chatbot Enviornment (human-to-machine interactions), and Multi-Session Chat (human-to-human day by day dialogues). To guage the standard of the generated artificial knowledge, we adopted the setup of Aug-PE to coach a small downstream language mannequin on the artificial knowledge after which compute the next-token prediction accuracy on the actual take a look at knowledge.

The classification activity is carried out on the OpenReview (educational paper critiques) dataset. To guage the standard of the generated artificial knowledge, we prepare a downstream classifier on the artificial knowledge, and compute the classification accuracy on the actual take a look at knowledge.

To mitigate issues relating to knowledge contamination, we rigorously analyzed our chosen datasets. Our evaluation confirmed no overlap between our pre-training knowledge and the downstream datasets.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

[td_block_social_counter facebook="tagdiv" twitter="tagdivofficial" youtube="tagdiv" style="style8 td-social-boxed td-social-font-icons" tdc_css="eyJhbGwiOnsibWFyZ2luLWJvdHRvbSI6IjM4IiwiZGlzcGxheSI6IiJ9LCJwb3J0cmFpdCI6eyJtYXJnaW4tYm90dG9tIjoiMzAiLCJkaXNwbGF5IjoiIn0sInBvcnRyYWl0X21heF93aWR0aCI6MTAxOCwicG9ydHJhaXRfbWluX3dpZHRoIjo3Njh9" custom_title="Stay Connected" block_template_id="td_block_template_8" f_header_font_family="712" f_header_font_transform="uppercase" f_header_font_weight="500" f_header_font_size="17" border_color="#dd3333"]
- Advertisement -spot_img

Latest Articles