,

AI Myths

I often share missed or surprising observations in my talks. Here are five of them, summarised as food for thought.

๐— ๐˜†๐˜๐—ต ๐Ÿญ: ๐—ข๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ฒ๐—ฎ๐˜€-๐˜๐—ฟ๐—ฎ๐—ถ๐—ป๐—ฒ๐—ฑ ๐—”๐—œ ๐˜„๐—ผ๐—ปโ€™๐˜ ๐—ฎ๐—น๐—ถ๐—ด๐—ป ๐˜„๐—ถ๐˜๐—ต ๐—”๐˜‚๐˜€๐˜๐—ฟ๐—ฎ๐—น๐—ถ๐—ฎ๐—ป ๐˜ƒ๐—ฎ๐—น๐˜‚๐—ฒ๐˜€.
Itโ€™s intuitive to think globally trained models canโ€™t reflect our norms. Yet when frontier models answer the same cultural questions posed to national cohorts, they align most strongly with Australia and New Zealandโ€”above the US [1]. There are many possible reasons, but ๐˜ต๐˜ฉ๐˜ฆ ๐˜ช๐˜ฏ๐˜ด๐˜ช๐˜จ๐˜ฉ๐˜ต ๐˜ช๐˜ด ๐˜ค๐˜ญ๐˜ฆ๐˜ข๐˜ณ: ๐˜ธ๐˜ฆ ๐˜ฏ๐˜ฆ๐˜ฆ๐˜ฅ ๐˜ณ๐˜ช๐˜จ๐˜ฐ๐˜ณ๐˜ฐ๐˜ถ๐˜ด ๐˜ฆ๐˜ท๐˜ข๐˜ญ๐˜ถ๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ, ๐˜ฏ๐˜ฐ๐˜ต ๐˜ซ๐˜ถ๐˜ด๐˜ต ๐˜ช๐˜ฏ๐˜ต๐˜ถ๐˜ช๐˜ต๐˜ช๐˜ฐ๐˜ฏ.

๐— ๐˜†๐˜๐—ต ๐Ÿฎ: ๐—ง๐—ผ ๐˜€๐—ผ๐—น๐˜ƒ๐—ฒ ๐—ฎ ๐—ฝ๐—ฟ๐—ผ๐—ฏ๐—น๐—ฒ๐—บ, ๐˜†๐—ผ๐˜‚ ๐—บ๐˜‚๐˜€๐˜ ๐˜๐—ฟ๐—ฎ๐—ถ๐—ป ๐—ผ๐—ป ๐—ฑ๐—ฎ๐˜๐—ฎ ๐˜๐—ต๐—ฎ๐˜ ๐—ฟ๐—ฒ๐—ฝ๐—ฟ๐—ฒ๐˜€๐—ฒ๐—ป๐˜๐˜€ ๐—ถ๐˜.
Weโ€™re told models need domain-matched data in training. Yet the strongest systems for essays/poetry improve when trained with software code, data unrelated to the task domain. Some data also plays an outsized role only during inference and has little effect during training. ๐˜›๐˜ฉ๐˜ฆ ๐˜ญ๐˜ฆ๐˜ด๐˜ด๐˜ฐ๐˜ฏ ๐˜ช๐˜ด ๐˜ค๐˜ฐ๐˜ฏ๐˜ด๐˜ช๐˜ด๐˜ต๐˜ฆ๐˜ฏ๐˜ต: ๐˜ฆ๐˜ท๐˜ข๐˜ญ๐˜ถ๐˜ข๐˜ต๐˜ฆ ๐˜ธ๐˜ฉ๐˜ฆ๐˜ณ๐˜ฆ ๐˜ถ๐˜ฏ๐˜ช๐˜ฒ๐˜ถ๐˜ฆ ๐˜ฅ๐˜ข๐˜ต๐˜ข ๐˜ฉ๐˜ข๐˜ด ๐˜ณ๐˜ฆ๐˜ข๐˜ญ ๐˜ช๐˜ฎ๐˜ฑ๐˜ข๐˜ค๐˜ต, ๐˜ฏ๐˜ฐ๐˜ต ๐˜ซ๐˜ถ๐˜ด๐˜ต ๐˜ธ๐˜ฉ๐˜ฆ๐˜ณ๐˜ฆ ๐˜ช๐˜ต ๐˜ญ๐˜ฐ๐˜ฐ๐˜ฌ๐˜ด ๐˜ณ๐˜ฆ๐˜ญ๐˜ฆ๐˜ท๐˜ข๐˜ฏ๐˜ต.

๐— ๐˜†๐˜๐—ต ๐Ÿฏ: ๐—”๐—œ ๐—ถ๐˜€ ๐—ฎ ๐—ด๐—ฒ๐—ป๐—ฒ๐—ฟ๐—ฎ๐—น-๐—ฝ๐˜‚๐—ฟ๐—ฝ๐—ผ๐˜€๐—ฒ ๐˜๐—ฒ๐—ฐ๐—ต๐—ป๐—ผ๐—น๐—ผ๐—ด๐˜† ๐—น๐—ถ๐—ธ๐—ฒ ๐—ฒ๐—น๐—ฒ๐—ฐ๐˜๐—ฟ๐—ถ๐—ฐ๐—ถ๐˜๐˜† ๐—ผ๐—ฟ ๐—ฐ๐—ผ๐—บ๐—ฝ๐˜‚๐˜๐—ถ๐—ป๐—ด.
Electricity and computing required further human invention to build each application. Todayโ€™s AI began as โ€œpredict the next wordโ€, yet a bundle of narrow capabilities automatically emerges without human design. In practice, it behaves less like a general platform and ๐˜ฎ๐˜ฐ๐˜ณ๐˜ฆ ๐˜ญ๐˜ช๐˜ฌ๐˜ฆ ๐˜ข ๐˜ญ๐˜ช๐˜ฃ๐˜ณ๐˜ข๐˜ณ๐˜บ ๐˜ฐ๐˜ง ๐˜ณ๐˜ฆ๐˜ข๐˜ฅ๐˜บ-๐˜ฎ๐˜ข๐˜ฅ๐˜ฆ ๐˜ง๐˜ถ๐˜ฏ๐˜ค๐˜ต๐˜ช๐˜ฐ๐˜ฏ๐˜ด ๐˜ต๐˜ฉ๐˜ข๐˜ต ๐˜ด๐˜ต๐˜ช๐˜ญ๐˜ญ ๐˜ณ๐˜ฆ๐˜ฒ๐˜ถ๐˜ช๐˜ณ๐˜ฆ ๐˜ด๐˜ฎ๐˜ข๐˜ณ๐˜ต ๐˜ฆ๐˜ญ๐˜ช๐˜ค๐˜ช๐˜ต๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ, ๐˜ณ๐˜ช๐˜จ๐˜ฐ๐˜ณ๐˜ฐ๐˜ถ๐˜ด ๐˜ฆ๐˜ท๐˜ข๐˜ญ๐˜ถ๐˜ข๐˜ต๐˜ช๐˜ฐ๐˜ฏ, ๐˜ข๐˜ฏ๐˜ฅ ๐˜ด๐˜ข๐˜ง๐˜ฆ๐˜ต๐˜บ ๐˜ค๐˜ฐ๐˜ฏ๐˜ต๐˜ณ๐˜ฐ๐˜ญ ๐˜ฐ๐˜ถ๐˜ต๐˜ด๐˜ช๐˜ฅ๐˜ฆ ๐˜ต๐˜ฉ๐˜ฆ ๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ญ.

๐— ๐˜†๐˜๐—ต ๐Ÿฐ: ๐—ฆ๐—ฎ๐—ณ๐—ฒ๐˜๐˜† ๐—ฎ๐—ป๐—ฑ ๐—ถ๐—ป๐—ป๐—ผ๐˜ƒ๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ฎ๐—ฟ๐—ฒ ๐—ฎ ๐˜๐—ฟ๐—ฎ๐—ฑ๐—ฒ-๐—ผ๐—ณ๐—ณ.
The assumption that you must choose between safety and performance is outdated. Training for robustness against prompt attacks, for example, tends to improve reasoning and generalisation on challenging benchmarks, turning guardrails into a performance gain. And roughly 70% of safety improvements also arrive alongside general capability gains. ๐˜›๐˜ฉ๐˜ฆ ๐˜ต๐˜ธ๐˜ฐ ๐˜ณ๐˜ฆ๐˜ช๐˜ฏ๐˜ง๐˜ฐ๐˜ณ๐˜ค๐˜ฆ ๐˜ฆ๐˜ข๐˜ค๐˜ฉ ๐˜ฐ๐˜ต๐˜ฉ๐˜ฆ๐˜ณ ๐˜ณ๐˜ข๐˜ต๐˜ฉ๐˜ฆ๐˜ณ ๐˜ต๐˜ฉ๐˜ข๐˜ฏ ๐˜ค๐˜ฐ๐˜ฎ๐˜ฑ๐˜ฆ๐˜ต๐˜ฆ.

๐— ๐˜†๐˜๐—ต ๐Ÿฑ: ๐—›๐˜‚๐—บ๐—ฎ๐—ป ๐—ผ๐˜ƒ๐—ฒ๐—ฟ๐˜€๐—ถ๐—ด๐—ต๐˜ ๐—ป๐—ฎ๐˜๐˜‚๐—ฟ๐—ฎ๐—น๐—น๐˜† ๐—ถ๐—บ๐—ฝ๐—ฟ๐—ผ๐˜ƒ๐—ฒ๐˜€ ๐—ผ๐˜‚๐˜๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€.
Adding humans into the loop sounds safe and performance-enhancing, yet naรฏve oversight often underperforms both AI and human alone. Without the right tools, context, and incentives, reviewers can either over/under-trust or disengage. ๐˜–๐˜ท๐˜ฆ๐˜ณ๐˜ด๐˜ช๐˜จ๐˜ฉ๐˜ต ๐˜ฐ๐˜ฏ๐˜ญ๐˜บ ๐˜ธ๐˜ฐ๐˜ณ๐˜ฌ๐˜ด ๐˜ธ๐˜ฉ๐˜ฆ๐˜ฏ ๐˜ช๐˜ตโ€™๐˜ด ๐˜ฑ๐˜ถ๐˜ณ๐˜ฑ๐˜ฐ๐˜ด๐˜ฆ-๐˜ฅ๐˜ฆ๐˜ด๐˜ช๐˜จ๐˜ฏ๐˜ฆ๐˜ฅ ๐˜ง๐˜ฐ๐˜ณ ๐˜ถ๐˜ฏ๐˜ฅ๐˜ฆ๐˜ณ๐˜ด๐˜ต๐˜ข๐˜ฏ๐˜ฅ๐˜ช๐˜ฏ๐˜จ ๐˜ข๐˜ฏ๐˜ฅ ๐˜ญ๐˜ฆ๐˜ข๐˜ณ๐˜ฏ๐˜ช๐˜ฏ๐˜จ.

At CSIRO’s Data61, weโ€™re operationalising these insights. Talk to us if you are interested.

[1] https://www.adalovelaceinstitute.org/blog/cultural-misalignment-llms/ See Figure 1: the higher a country appears on the vertical axis, the more closely the AI answers align with the answers given by people from that country


About Me


About me – According to AI

Director/Head of CSIRO’s Data61
Conjoint Professor, CSE UNSW

For other roles, see LinkedIn & Professional activities.

If you’d like to invite me to give a talk, please see here & email liming.zhu@data61.csiro.au

Featured Posts