AI Safety Talk at AIST AI International Symposium 2024

What an intellectually stimulating two days at the AIST’s AI International Symposium 2024 in Japan! ๐Ÿ‡ฏ๐Ÿ‡ต๐Ÿง  The symposium focused on the “Future Direction for Trustworthy and Responsible AI”. Day 2 also covered the latest progress in AI for Science.

Yoshua Bengio ‘s talk on “Towards Quantitative Safety Guarantees and AGI Alignment” was thought-provoking and sobering all at once. He explored how we can build an AI that holds multiple alternative theories/world models that fit with current observations/data, using it to avoid catastrophic outcomes when selecting the best/safest theories to proceed.

I had an interesting discussion with Bengio afterwards about whether we can/should limit the world models/theories that AI learns to a human-understandable level. If not, how can we trust AI when it explains a human-understandable version to us? Bengio had the answer, but I can’t share it here in case a mischievous AI is eavesdropping! ๐Ÿ˜œ๐Ÿคซ

On a more optimistic note, I gave my talk on “๐„๐ง๐ฌ๐ฎ๐ซ๐ข๐ง๐  ๐€๐ˆ ๐’๐š๐Ÿ๐ž๐ญ๐ฒ: ๐€ ๐’๐ฒ๐ฌ๐ญ๐ž๐ฆ-๐‹๐ž๐ฏ๐ž๐ฅ ๐€๐ฉ๐ฉ๐ซ๐จ๐š๐œ๐ก ๐ญ๐จ ๐“๐ž๐ฌ๐ญ๐ข๐ง๐ , ๐“๐ซ๐š๐ง๐ฌ๐ฉ๐š๐ซ๐ž๐ง๐œ๐ฒ, ๐š๐ง๐ ๐€๐œ๐œ๐จ๐ฎ๐ง๐ญ๐š๐›๐ข๐ฅ๐ข๐ญ๐ฒ.” ๐ŸŽ™๏ธ I emphasized CSIRO’s Data61 system-level and engineering approach beyond model training as the key to safe AI guarantees. I also highlighted how National AI Centre is developing Australia’s AI Safety standard that caters to AI deployers, small & medium enterprises, and prioritizes diversity, inclusion, and First Nations perspectives. ๐Ÿ‡ฆ๐Ÿ‡บ๐ŸŒˆ The positive feedback and acknowledgment of Australia’s approach were heartening! โค๏ธ

book:ย https://lnkd.in/g9BCu6nn
research:ย https://lnkd.in/gyzjE4-i


One response to “AI Safety Talk at AIST AI International Symposium 2024”

  1. liming.zhu Avatar
    liming.zhu

    Stop asking. ๐Ÿ˜‚ note https://three-body-problem.fandom.com/wiki/Wallfacer (้ขๅฃ่€…) program

About Me

Research Director, CSIRO’s Data61
Conjoint Professor, CSE UNSW

For other roles, see LinkedIn & Professional activities.

If you’d like to invite me to give a talk, please see here & email liming.zhu@data61.csiro.au

Featured Posts

    Categories