multimedia

AI-assisted job fraud is spiking — from thedeepview.co by Ian Krietzberg

A recent report published by the Identity Theft Resource Center (ITRC) found that data from 2023 shows “an environment where bad actors are more effective, efficient and successful in launching attacks. The result is fewer victims (or at least fewer victim reports), but the impact on individuals and businesses is arguably more damaging.”

One of these attacks involves fake job postings.

The details: The ITRC said that victim reports of job and employment scams spiked some 118% in 2023. These scams were primarily carried out through LinkedIn and other job search platforms.

- The bad actors here would either create fake (but professional-looking) job postings, profiles and websites or impersonate legitimate companies, all with the hopes of landing victims to move onto the interview process.
- These actors would then move the conversation onto a third-party messaging platform, and ask for identity verification information (driver’s licenses, social security numbers, direct deposit information, etc.).

Hypernatural — AI videos you can actually use. — via Jeremy Caplan’s Wonder Tools

Hypernatural is an AI video platform that makes it easy to create beautiful, ready-to share videos from anything. Stop settling for glitchy 3s generated videos and boring stock footage. Turn your ideas, scripts, podcasts and more into incredible short-form videos in minutes.

GPT-4o mini: advancing cost-efficient intelligence — from openai.com
Introducing our most cost-efficient small model

OpenAI is committed to making intelligence as broadly accessible as possible. Today, we’re announcing GPT-4o mini, our most cost-efficient small model. We expect GPT-4o mini will significantly expand the range of applications built with AI by making intelligence much more affordable. GPT-4o mini scores 82% on MMLU and currently outperforms GPT-4¹ on chat preferences in LMSYS leaderboard(opens in a new window). It is priced at 15 cents per million input tokens and 60 cents per million output tokens, an order of magnitude more affordable than previous frontier models and more than 60% cheaper than GPT-3.5 Turbo.

GPT-4o mini enables a broad range of tasks with its low cost and latency, such as applications that chain or parallelize multiple model calls (e.g., calling multiple APIs), pass a large volume of context to the model (e.g., full code base or conversation history), or interact with customers through fast, real-time text responses (e.g., customer support chatbots).

Also see what this means from Ben’s Bites, The Neuron, and as The Rundown AI asserts:

Why it matters: While it’s not GPT-5, the price and capabilities of this mini-release significantly lower the barrier to entry for AI integrations — and marks a massive leap over GPT 3.5 Turbo. With models getting cheaper, faster, and more intelligent with each release, the perfect storm for AI acceleration is forming.

Nvidia: More AI Waves Are Taking Shape — from seekingalpha.com by Eric Sprague

Summary

Nvidia Corporation is transitioning from a GPU designer to an AI factory builder.
AI spending will continue to grow in healthcare, government, and robotics.
CEO Jensen Huang says the AI robot industry could be bigger than the auto and consumer electronics industries combined.

Byte-Sized Courses: NVIDIA Offers Self-Paced Career Development in AI and Data Science — from blogs.nvidia.com by Andy Bui
Industry experts gather to share advice on starting a career in AI, highlighting technical training and certifications for career growth.

Free Sites for Back to School [Restifo]

On 07/19/2024, in animation, art, career development, chemistry, cloud-based computing / apps / other cloud-related, collaboration, communications, content development, aggregation, repositories, creativity, data related items, design, digital audio, digital learning, digital video, education technology, educational games / serious games, Emotion, engagement / engaging students, experimentation, Google, graphics, history, ideas, K-12 related, languages and translation, learning ecosystem, Learning Experience Design, legislatures / government / legal, mathematics, multimedia, music, new media literacies, online media, passions, pedagogy, platforms, play, productivity / tips and tricks, science, student teaching/teacher education, student-related, teachers, teaching & learning, technologies for your home, tools, United States, vendors, Virtual Reality (VR) / worlds / learning, web-based collaboration, writing, youth, by Daniel Christian

Free Sites for Back to School — from techlearning.com by Diana Restifo
Top free and freemium sites for learning

An internet search for free learning resources will likely return a long list that includes some useful sites amid a sea of not-really-free and not-very-useful sites.

To help teachers more easily find the best free and freemium sites they can use in their classrooms and curricula, I’ve curated a list that describes the top free/freemium sites for learning.

In some cases, Tech & Learning has reviewed the site in detail, and those links are included so readers can find out more about how to make the best use of the online materials. In all cases, the websites below provide valuable educational tools, lessons, and ideas, and are worth exploring further.

Two bonus postings here! 🙂

Where Is K-12 Enrollment Headed? Population Trends, by the Numbers — from edweek.org by Mark Lieberman
America is projected to have 3 million fewer K-12 students attending public school by 2031 than it did in 2013.
How to Kill Student Curiosity in 5 Steps (and What to Do Instead) — from edweek.org by Olivia Odileke
The unintentional missteps teachers and administrators are making
I’ve observed five major ways we’re unintentionally stifling curiosity and issue a call to action for educators, administrators, and policymakers to join the curiosity revolution:

Reflections on “What to Know About Buying A Projector for School” [Edwards]

On 07/16/2024, in A/V -- audio/visual, Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, colleges, community colleges, Daniel S. Christian, education, education technology, face-to-face, future, higher education, human-computer interaction (HCI), intelligent systems, interaction design, interactivity, internet, IT in HE, K-12 related, learning ecosystem, learning hubs & pods & microschooling, learning spaces, liberal arts, multimedia, Natural Language Processing (NLP), online media, platforms, presentations, productivity / tips and tricks, teachers, teaching & learning, technology (general), tools, United States, universities, usability, user experience (UX), vendors, voice recognition / voice enabled interfaces, by Daniel Christian

What to Know About Buying A Projector for School — from by Luke Edwards
Buy the right projector for school with these helpful tips and guidance.

Picking the right projector for school can be a tough decision as the types and prices range pretty widely. From affordable options to professional grade pricing, there are many choices. The problem is that the performance is also hugely varied. This guide aims to be the solution by offering all you need to know about buying the right projector for school where you are.

Luke covers a variety of topics including:

Types of projectors
Screen quality
Light type
Connectivity
Pricing

From DSC:
I posted this because Luke covered a variety of topics — and if you’re set on going with a projector, this is a solid article. But I hesitated to post this, as I’m not sure of the place that projectors will have in the future of our learning spaces. With voice-enabled apps and appliances continuing to be more prevalent — along with the presence of AI-based human-computer interactions and intelligent systems — will projectors be the way to go? Will enhanced interactive whiteboards be the way to go? Will there be new types of displays? I’m not sure. Time will tell.

Safe Superintelligence, everyone is in R&D, the power grid is in serious trouble, + other AI-related items

On 06/23/2024, in Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, business, corporate / business world, digital video, emerging technologies, experimentation, future, game-changing environment, generational differences, leadership, media/film, Microsoft, multimedia, NVIDIA, Open AI, workplace, by Daniel Christian

Latent Expertise: Everyone is in R&D — from oneusefulthing.org by Ethan Mollick
Ideas come from the edges, not the center

Excerpt (emphasis DSC):

And to understand the value of AI, they need to do R&D. Since AI doesn’t work like traditional software, but more like a person (even though it isn’t one), there is no reason to suspect that the IT department has the best AI prompters, nor that it has any particular insight into the best uses of AI inside an organization. IT certainly plays a role, but the actual use cases will come from workers and managers who find opportunities to use AI to help them with their job. In fact, for large companies, the source of any real advantage in AI will come from the expertise of their employees, which is needed to unlock the expertise latent in AI.

OpenAI’s former chief scientist is starting a new AI company — from theverge.com by Emma Roth
Ilya Sutskever is launching Safe Superintelligence Inc., an AI startup that will prioritize safety over ‘commercial pressures.’

Ilya Sutskever, OpenAI’s co-founder and former chief scientist, is starting a new AI company focused on safety. In a post on Wednesday, Sutskever revealed Safe Superintelligence Inc. (SSI), a startup with “one goal and one product:” creating a safe and powerful AI system.

Ilya Sutskever Has a New Plan for Safe Superintelligence — from bloomberg.com by Ashlee Vance (behind a paywall)
OpenAI’s co-founder discloses his plans to continue his work at a new research lab focused on artificial general intelligence.

Safe Superintelligence — from theneurondaily.com by Noah Edelman

Ilya Sutskever is kind of a big deal in AI, to put it lightly.

Part of OpenAI’s founding team, Ilya was Chief Data Scientist (read: genius) before being part of the coup that fired Sam Altman.
…
Yesterday, Ilya announced that he’s forming a new initiative called Safe Superintelligence.

If AGI = AI that can perform a wide range of tasks at our level, then Superintelligence = an even more advanced AI that surpasses human capabilities in all areas.

AI is exhausting the power grid. Tech firms are seeking a miracle solution. — from washingtonpost.com by Evan Halper and Caroline O’Donovan
As power needs of AI push emissions up and put big tech in a bind, companies put their faith in elusive — some say improbable — technologies.

As the tech giants compete in a global AI arms race, a frenzy of data center construction is sweeping the country. Some computing campuses require as much energy as a modest-sized city, turning tech firms that promised to lead the way into a clean energy future into some of the world’s most insatiable guzzlers of power. Their projected energy needs are so huge, some worry whether there will be enough electricity to meet them from any source.

Microsoft, OpenAI, Nvidia join feds for first AI attack simulation — from axios.com by Sam Sabin

Federal officials, AI model operators and cybersecurity companies ran the first joint simulation of a cyberattack involving a critical AI system last week.

Why it matters: Responding to a cyberattack on an AI-enabled system will require a different playbook than the typical hack, participants told Axios.

The big picture: Both Washington and Silicon Valley are attempting to get ahead of the unique cyber threats facing AI companies before they become more prominent.

Hot summer of AI video: Luma & Runway drop amazing new models — from heatherbcooper.substack.com by Heather Cooper
Plus an amazing FREE video to sound app from ElevenLabs

Immediately after we saw Sora-like videos from KLING, Luma AI’s Dream Machine video results overshadowed them.
…
Dream Machine is a next-generation AI video model that creates high-quality, realistic shots from text instructions and images.

Introducing Gen-3 Alpha — from runwayml.com by Anastasis Germanidis
A new frontier for high-fidelity, controllable video generation.

AI-Generated Movies Are Around the Corner — from news.theaiexchange.com by The AI Exchange
The future of AI in filmmaking; participate in our AI for Agencies survey

AI-Generated Feature Films Are Around the Corner.
We predict feature-film length AI-generated films are coming by the end of 2025, if not sooner.

Don’t believe us? You need to check out Runway ML’s new Gen-3 model they released this week.

They’re not the only ones. We also have Pika, which just raised $80M. And Google’s Veo. And OpenAI’s Sora. (+ many others)

Introducing Gen-3 Alpha: Runway’s new base model for video generation.

Gen-3 Alpha can create highly detailed videos with complex scene changes, a wide range of cinematic choices, and detailed art directions.https://t.co/YQNE3eqoWf

(1/10) pic.twitter.com/VjEG2ocLZ8

— Runway (@runwayml) June 17, 2024

A new chapter of creativity begins.

Introducing GEN-3 Alpha – The first of a series of new models built by creatives for creatives. Video generated with @runwayml‘s new Text-2-Video model.

Coming soon. pic.twitter.com/oNONabxdNl

— Nicolas Neubert (@iamneubert) June 17, 2024

Google just announced their work on Video-to-audio, absolutely wild.

Here are 11 crazy examples

1. drums pic.twitter.com/sTNfymJIeN

— Linus ??? Ekenstam (@LinusEkenstam) June 17, 2024

Kuaishou Unveils Kling: A Text-to-Video Model To Challenge OpenAI’s Sora — from maginative.com by Chris McKay

Generating audio for video — from deepmind.google

LinkedIn leans on AI to do the work of job hunting — from techcrunch.com by Ingrid Lunden

Learning personalisation. LinkedIn continues to be bullish on its video-based learning platform, and it appears to have found a strong current among users who need to skill up in AI. Cohen said that traffic for AI-related courses — which include modules on technical skills as well as non-technical ones such as basic introductions to generative AI — has increased by 160% over last year.

You can be sure that LinkedIn is pushing its search algorithms to tap into the interest, but it’s also boosting its content with AI in another way.

For Premium subscribers, it is piloting what it describes as “expert advice, powered by AI.” Tapping into expertise from well-known instructors such as Alicia Reece, Anil Gupta, Dr. Gemma Leigh Roberts and Lisa Gates, LinkedIn says its AI-powered coaches will deliver responses personalized to users, as a “starting point.”

These will, in turn, also appear as personalized coaches that a user can tap while watching a LinkedIn Learning course.

Also related to this, see:

Unlocking New Possibilities for the Future of Work with AI — from news.linkedin.com

Personalized learning for everyone: Whether you’re looking to change or not, the skills required in the workplace are expected to change by 68% by 2030.

Expert advice, powered by AI: We’re beginning to pilot the ability to get personalized practical advice instantly from industry leading business leaders and coaches on LinkedIn Learning, all powered by AI. The responses you’ll receive are trained by experts and represent a blend of insights that are personalized to each learner’s unique needs. While human professional coaches remain invaluable, these tools provide a great starting point.

Personalized coaching, powered by AI, when watching a LinkedIn course: As learners —including all Premium subscribers — watch our new courses, they can now simply ask for summaries of content, clarify certain topics, or get examples and other real-time insights, e.g. “Can you simplify this concept?” or “How does this apply to me?”

Roblox’s Road to 4D Generative AI — from corp.roblox.com by Morgan McGuire, Chief Scientist

Roblox is building toward 4D generative AI, going beyond single 3D objects to dynamic interactions.
Solving the challenge of 4D will require multimodal understanding across appearance, shape, physics, and scripts.
Early tools that are foundational for our 4D system are already accelerating creation on the platform.

“NYC High School Reimagines Career & Technical Education for the 21st Century” [Bauld] + several other items re: the K12 learning ecosystem

On 06/17/2024, in Bias | Diversity Equity & Inclusion (DE&I), career development, change, changing business models, collaboration, communications, creativity, digital audio, digital video, education, education reform, education technology, engagement / engaging students, engineering, entrepreneurship, experimentation, ideas, innovation, instructional design, journalism, K-12 related, leadership, learning, learning ecosystem, Learning Experience Design, maker movement, media/film, more voice more choice more control, multimedia, play, podcasting, real-world assignments, robotics, skills, society, sociology, special needs, strategy, student-related, teachers, United States, vision/possibilities, vocational / trades, by Daniel Christian

NYC High School Reimagines Career & Technical Education for the 21st Century — from the74million.org by Andrew Bauld
Thomas A. Edison High School is providing students with the skills to succeed in both college and career in an unusually creative way.

From DSC:
Very interesting to see the mention of an R&D department here! Very cool.

Baker said ninth graders in the R&D department designed the essential skills rubric for their grade so that regardless of what content classes students take, they all get the same immersion into critical career skills. Student voice is now so integrated into Edison’s core that teachers work with student designers to plan their units. And he said teachers are becoming comfortable with the language of career-centered learning and essential skills while students appreciate the engagement and develop a new level of confidence.
…
The R&D department has grown to include teachers from every department working with students to figure out how to integrate essential skills into core academic classes. In this way, they’re applying one of the XQ Institute’s crucial Design Principles for innovative high schools: Youth Voice and Choice.
.

Learners need: More voice. More choice. More control. -- this image was created by Daniel Christian

Student Enterprise: Invite Learners to Launch a Media Agency or Publication — from gettingsmart.com by Tom Vander Ark

Key Points

Client-connected projects have become a focal point of the Real World Learning initiative, offering students opportunities to solve real-world problems in collaboration with industry professionals.
Organizations like CAPS, NFTE, and Journalistic Learning facilitate community connections and professional learning opportunities, making it easier to implement client projects and entrepreneurship education.

Important trend: client projects. Work-based learning has been growing with career academies and renewed interest in CTE. Six years ago, a subset of WBL called client-connected projects became a focal point of the Real World Learning initiative in Kansas City where they are defined as authentic problems that students solve in collaboration with professionals from industry, not-for-profit, and community-based organizations….and allow students to: engage directly with employers, address real-world problems, and develop essential skills.

Portrait of a Community to Empower Learning Transformation — from gettingsmart.com by Rebecca Midles and Mason Pashia

Key Points

The Community Portrait approach encourages diverse voices to shape the future of education, ensuring it reflects the needs and aspirations of all stakeholders.
Active, representative community engagement is essential for creating meaningful and inclusive educational environments.

The Portrait of a Graduate—a collaborative effort to define what learners should know and be able to do upon graduation—has likely generated enthusiasm in your community. However, the challenge of future-ready graduates persists: How can we turn this vision into a reality within our diverse and dynamic schools, especially amid the current national political tensions and contentious curriculum debates?

The answer lies in active, inclusive community engagement. It’s about crafting a Community Portrait that reflects the rich diversity of our neighborhoods. This approach, grounded in the same principles used to design effective learning systems, seeks to cultivate deep, reciprocal relationships within the community. When young people are actively involved, the potential for meaningful change increases exponentially.

Q&A: Why Schools Must Redesign Learning to Include All Students — from edtechmagazine.com by Taashi Rowe
Systems are broken, not children, says K–12 disability advocate Lindsay E. Jones.

Although Lindsay E. Jones came from a family of educators, she didn’t expect that going to law school would steer her back into the family business. Over the years she became a staunch advocate for children with disabilities. And as mom to a son with learning disabilities and ADHD who is in high school and doing great, her advocacy is personal.

Jones previously served as president and CEO of the National Center for Learning Disabilities and was senior director for policy and advocacy at the Council for Exceptional Children. Today, she is the CEO at CAST, an organization focused on creating inclusive learning environments in K–12. EdTech: Focus on K–12 spoke with Jones about how digital transformation, artificial intelligence and visionary leaders can support inclusive learning environments.

Our brains are all as different as our fingerprints, and throughout its 40-year history, CAST has been focused on one core value: People are not broken, systems are poorly designed. And those systems are creating a barrier that holds back human innovation and learning.

Several items re: text-to-video (and even images-to-video)

On 06/14/2024, in 21st century, 24x7x365 access, Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, Asia, communications, creativity, digital audio, digital learning, digital video, emerging technologies, engagement / engaging students, media/film, multimedia, United States, vendors, by Daniel Christian

Dream Machine is an AI model that makes high quality, realistic videos fast from text and images.

It is a highly scalable and efficient transformer model trained directly on videos making it capable of generating physically accurate, consistent and eventful shots. Dream Machine is our first step towards building a universal imagination engine and it is available to everyone now!

Luma AI just dropped a Sora-like AI video generator called Dream Machine.

But unlike Sora or KLING, it’s completely open access to the public.

Here are 10 wild examples (and how to access it):

1. pic.twitter.com/Dx5Pnbp7lg

— Rowan Cheung (@rowancheung) June 12, 2024

Text-to-Video Emergence for July 2024 — from ai-supremacy.com by Michael Spencer
Who needs Sora?

There have been some incredible teasers in the text-to-video arena of Generative AI. Namely I’m watching:

Kling AI (by Kuaishou)
Luma AI
Vidu (ShengShu Technology and Tsinghua University)
Pika Labs
Zhipu AI & ByteDance (not yet released their products)
The timeline for the release of OpenAI’s Sora

“OpenAI seems to have the ability to create video in Sora, send it to ChatGPT for a script, use Voice Engine for voice over and put it all together.”
byu/MassiveWasabi insingularity

Daniel Christian: My slides for the Educational Technology Organization of Michigan’s Spring 2024 Retreat

On 06/13/2024, in 21st century, 24x7x365 access, Access to Justice (A2J), adjunct faculty, Adobe, adult learning, Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, bots, business side of he, change, cloud-based computing / apps / other cloud-related, colleges, community colleges, content development, aggregation, repositories, corporate / business world, corporate universities / corporate training, cost of getting a degree, creativity, culture, dangers of the status quo, Daniel S. Christian, data related items, design, digital audio, digital learning, digital photography, digital storytelling, digital video, education technology, emerging technologies, experimentation, faculty, future of higher education, game-changing environment, heutagogy, higher education, homeschooling/homeschoolers, human-computer interaction (HCI), ideas, innovation, instructional design, intelligent systems, intelligent tutoring, K-12 related, learning, learning agents, learning ecosystem, Learning Experience Design, Learning from the Living [Class] Room, liberal arts, library / librarians, lifelong learning, media/film, microlearning, more voice more choice more control, multimedia, Natural Language Processing (NLP), new business models, NVIDIA, online media, online tutoring, Open AI, pace of change, personalized/customized learning, platforms, productivity / tips and tricks, professional development, reinvent, skills, smart/connected TV, society, staying relevant, streams of content, student-related, surviving, teachers, technologies for your home, technology (general), the downsides of how people use tech, tools, training / L&D, United States, universities, user experience (UX), user interface design, vendors, vision/possibilities, web-based collaboration, workplace, by Daniel Christian

From DSC:
Last Thursday, I presented at the Educational Technology Organization of Michigan’s Spring 2024 Retreat. I wanted to pass along my slides to you all, in case they are helpful to you.

Topics/agenda:

Topics & resources re: Artificial Intelligence (AI)
- Top multimodal players
- Resources for learning about AI
- Applications of AI
- My predictions re: AI
The powerful impact of pursuing a vision
A potential, future next-gen learning platform
Share some lessons from my past with pertinent questions for you all now
The significant impact of an organization’s culture
Bonus material: Some people to follow re: learning science and edtech

Slides of the presentation (.PPTX)
Slides of the presentation (.PDF)

Plus several more slides re: this vision.

Introducing Stable Audio Open – An Open Source Model for Audio Samples & Sound Design [stability.ai]

On 06/06/2024, in Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, creativity, digital audio, media/film, multimedia, new media literacies, platforms, productivity / tips and tricks, tools, vendors, by Daniel Christian

Introducing Stable Audio Open – An Open Source Model for Audio Samples and Sound Design — from stability.ai; via Rundown AI

Key Takeaways:

Stable Audio Open is an open source text-to-audio model for generating up to 47 seconds of samples and sound effects.
Users can create drum beats, instrument riffs, ambient sounds, foley and production elements.
The model enables audio variations and style transfer of audio samples.

Some comments from Rundown AI:

Why it matters: While the AI advances in text-to-image models have been the most visible (literally), both video and audio are about to take the same leap. Putting these tools in the hands of creatives will redefine traditional workflows — from musicians brainstorming new beats to directors crafting sound effects for film and TV.

Hybrid learning through podcasts: a practical approach [Chambers]

On 06/06/2024, in A/V -- audio/visual, blended learning, digital audio, engagement / engaging students, experimentation, faculty, higher education, homeschooling/homeschoolers, hybrid learning, ideas, learning, Learning Experience Design, more voice more choice more control, multimedia, online learning, podcasting, skills, student-related, by Daniel Christian

Hybrid learning through podcasts: a practical approach — from timeshighereducation.com by Catherine Chambers
Adapting practice-based learning to a blend of synchronous and asynchronous delivery gives learners more control and creates opportunities for real-world learning of skills such as podcast production, writes Catherine Chambers

Hybrid learning provides students with greater control over their learning and enables the development of employability skills, supporting practice-based group work through in situ activities.

Aligned with Keele’s curriculum expectations framework, the module was designed around podcasts to support inclusivity, active learning, digital capability and external engagement.

LearnLM is our new family of models fine-tuned for learning, and grounded in educational research to make teaching and learning experiences more active, personal and engaging.

.

AI in Education: Google’s LearnLM product has incredible potential — from ai-supremacy.com by Michael Spencer and Nick Potkalitsky
Google’s Ed Suite is giving Teachers new ideas for incorporating AI into the classroom.

We often talk about what Generative AI will do for coders, healthcare, science or even finance, but what about the benefits for the next generation? Permit me if you will, here I’m thinking about teachers and students.

It’s no secret that some of the most active users of ChatGPT in its heyday, were students. But how are other major tech firms thinking about this?

I actually think one of the best products with the highest ceiling from Google I/O 2024 is LearnLM. It has to be way more than a chatbot, it has to feel like a multimodal tutor. I can imagine frontier model agents (H) doing this fairly well.

What if everyone, everywhere could have their own personal AI tutor, on any topic?

ChatGPT4o Is the TikTok of AI Models — from nickpotkalitsky.substack.com by Nick Potkalitsky
In Search of Better Tools for AI Access in K-12 Classrooms

Nick makes the case that we should pause on the use of OpenAI in the classrooms:

In light of these observations, it’s clear that we must pause and rethink the use of OpenAI products in our classrooms, except for rare cases where accessibility needs demand it. The rapid consumerization of AI, epitomized by GPT4o’s transformation into an AI salesperson, calls for caution.

The Future of AI in Education: Google and OpenAI Strategies Unveiled — from edtechinsiders.substack.comby Ben Kornell

Google’s Strategy: AI Everywhere
Key Points

Google will win through seamless Gemini integration across all Google products
Enterprise approach in education to make Gemini the default at low/no additional cost
Functional use cases and model tuning demonstrate Google’s knowledge of educators

OpenAI’s Strategy: ChatGPT as the Front Door
Key Points

OpenAI taking a consumer-led freemium approach to education
API powers an app layer that delivers education-specific use cases
Betting on a large user base + app marketplace

23 videos re: GPT-4o [OpenAI]

On 05/22/2024, in Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, bots, cloud-based computing / apps / other cloud-related, corporate / business world, digital audio, digital video, emerging technologies, experimentation, multimedia, Natural Language Processing (NLP), Open AI, tools, United States, user experience (UX), vendors, vision/possibilities, voice recognition / voice enabled interfaces, by Daniel Christian

Per OpenAI:
“Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time.”

“Frameless” art museum in London where art truly comes to life + some beautiful painted landscapes

On 05/17/2024, in A/V -- audio/visual, art, creativity, multimedia, UK, United States, by Daniel Christian

“Frameless” art museum in London where art truly comes to life pic.twitter.com/O4bP2NUE1K

— Historic Vids (@historyinmemes) May 17, 2024

Landscapes Radiate Light and Drama in Erin Hanson’s Vibrant Oil Paintings — from thisiscolossal.com by Kate Mothes and Erin Hanson

io.google/2024

.

How generative AI expands curiosity and understanding with LearnLM — from blog.google
LearnLM is our new family of models fine-tuned for learning, and grounded in educational research to make teaching and learning experiences more active, personal and engaging.

Generative AI is fundamentally changing how we’re approaching learning and education, enabling powerful new ways to support educators and learners. It’s taking curiosity and understanding to the next level — and we’re just at the beginning of how it can help us reimagine learning.

Today we’re introducing LearnLM: our new family of models fine-tuned for learning, based on Gemini.

On YouTube, a conversational AI tool makes it possible to figuratively “raise your hand” while watching academic videos to ask clarifying questions, get helpful explanations or take a quiz on what you’ve been learning. This even works with longer educational videos like lectures or seminars thanks to the Gemini model’s long-context capabilities. These features are already rolling out to select Android users in the U.S.
…
Learn About is a new Labs experience that explores how information can turn into understanding by bringing together high-quality content, learning science and chat experiences. Ask a question and it helps guide you through any topic at your own pace — through pictures, videos, webpages and activities — and you can upload files or notes and ask clarifying questions along the way.

Google I/O 2024: An I/O for a new generation — from blog.google

The Gemini era
A year ago on the I/O stage we first shared our plans for Gemini: a frontier model built to be natively multimodal from the beginning, that could reason across text, images, video, code, and more. It marks a big step in turning any input into any output — an “I/O” for a new generation.

In this story:

Google just announced huge Gemini updates, a Sora competitor, AI agents, and more.

The 12 most impressive announcements at Google I/O:

1. Project Astra: An AI agent that can see AND hear what you do live in real-time.pic.twitter.com/sA2YT80O5G

— Rowan Cheung (@rowancheung) May 15, 2024

Daily Digest: Google I/O 2024 – AI search is here. — from bensbites.beehiiv.com
PLUS: It’s got Agents, Video and more. And, Ilya leaves OpenAI

Google is integrating AI into all of its ecosystem: Search, Workspace, Android, etc. In true Google fashion, many features are “coming later this year”. If they ship and perform like the demos, Google will get a serious upper hand over OpenAI/Microsoft.
All of the AI features across Google products will be powered by Gemini 1.5 Pro. It’s Google’s best model and one of the top models. A new Gemini 1.5 Flash model is also launched, which is faster and much cheaper.
Google has ambitious projects in the pipeline. Those include a real-time voice assistant called Astra, a long-form video generator called Veo, plans for end-to-end agents, virtual AI teammates and more.

Google just casually announced Veo, a new rival to OpenAI’s Sora.

It can generate insanely good 1080p video up to 60 seconds.

9 wild examples:

1)pic.twitter.com/rYySaeMRDa

— Proper ? (@ProperPrompter) May 14, 2024

New ways to engage with Gemini for Workspace — from workspace.google.com

Today at Google I/O we’re announcing new, powerful ways to get more done in your personal and professional life with Gemini for Google Workspace. Gemini in the side panel of your favorite Workspace apps is rolling out more broadly and will use the 1.5 Pro model for answering a wider array of questions and providing more insightful responses. We’re also bringing more Gemini capabilities to your Gmail app on mobile, helping you accomplish more on the go. Lastly, we’re showcasing how Gemini will become the connective tissue across multiple applications with AI-powered workflows. And all of this comes fresh on the heels of the innovations and enhancements we announced last month at Google Cloud Next.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals — from techcrunch.com by Kyle Wiggers

Google is improving its AI-powered chatbot Gemini so that it can better understand the world around it — and the people conversing with it.

At the Google I/O 2024 developer conference on Tuesday, the company previewed a new experience in Gemini called Gemini Live, which lets users have “in-depth” voice chats with Gemini on their smartphones. Users can interrupt Gemini while the chatbot’s speaking to ask clarifying questions, and it’ll adapt to their speech patterns in real time. And Gemini can see and respond to users’ surroundings, either via photos or video captured by their smartphones’ cameras.

Generative AI in Search: Let Google do the searching for you — from blog.google
With expanded AI Overviews, more planning and research capabilities, and AI-organized search results, our custom Gemini model can take the legwork out of searching.

A major step towards much more natural human-computer interaction: OpenAI introduces GPT-4o

On 05/14/2024, in 24x7x365 access, A/V -- audio/visual, Artificial Intelligence / Machine Learning / Deep Learning / Algorithms, assistive technologies, bots, digital audio, digital learning, digital video, education, education technology, emerging technologies, Emotion, higher education, homeschooling/homeschoolers, human-computer interaction (HCI), ideas, informal learning, innovation, instructional design, intelligent systems, intelligent tutoring, interaction design, interactivity, IT in HE, K-12 related, languages and translation, law schools, learner profiles, learning, learning agents, learning ecosystem, Learning from the Living [Class] Room, learning preferences, liberal arts, lifelong learning, mathematics, multimedia, Natural Language Processing (NLP), online tutoring, Open AI, personalized/customized learning, platforms, productivity / tips and tricks, smart classrooms, smart/connected TV, student-related, technologies for your home, television, tools, United States, universities, usability, user experience (UX), user interface design, vendors, vision/possibilities, voice recognition / voice enabled interfaces, by Daniel Christian

Hello GPT-4o — from openai.com
We’re announcing GPT-4o, our new flagship model that can reason across audio, vision, and text in real time.

GPT-4o (“o” for “omni”) is a step towards much more natural human-computer interaction—it accepts as input any combination of text, audio, image, and video and generates any combination of text, audio, and image outputs. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which is similar to human response time in a conversation. It matches GPT-4 Turbo performance on text in English and code, with significant improvement on text in non-English languages, while also being much faster and 50% cheaper in the API. GPT-4o is especially better at vision and audio understanding compared to existing models.

Example topics covered here:

Two GPT-4os interacting and singing
Languages/translation
Personalized math tutor
Meeting AI
Harmonizing and creating music
Providing inflection, emotions, and a human-like voice
Understanding what the camera is looking at and integrating it into the AI’s responses
Providing customer service

With GPT-4o, we trained a single new model end-to-end across text, vision, and audio, meaning that all inputs and outputs are processed by the same neural network. Because GPT-4o is our first model combining all of these modalities, we are still just scratching the surface of exploring what the model can do and its limitations.

This demo is insane.

A student shares their iPad screen with the new ChatGPT + GPT-4o, and the AI speaks with them and helps them learn in *realtime*.

Imagine giving this to every student in the world.

The future is so, so bright. pic.twitter.com/t14M4fDjwV

— Mckay Wrigley (@mckaywrigley) May 13, 2024

From DSC:
I like the assistive tech angle here:

GPT-4o as tested by @BeMyEyes: pic.twitter.com/WeAoVmxUFH

— Greg Brockman (@gdb) May 14, 2024

It’s been less than 24 hours since the OpenAI changed the world with GPT-4o announcement.

And the Internet is a flooded with demo videos.

Here’re the 10 most jaw-dropping examples so far (Don’t miss the 6th one) pic.twitter.com/sLx1D1YSqb

— Poonam Soni (@CodeByPoonam) May 14, 2024

Learning Ecosystems

AI-assisted job fraud is spiking [Krietzberg] + other items re: AI in general

Safe Superintelligence, everyone is in R&D, the power grid is in serious trouble, + other AI-related items

Text-to-video from various vendors; + video-to-audio; + other items re: AI

Several items re: text-to-video (and even images-to-video)

Introducing Stable Audio Open – An Open Source Model for Audio Samples & Sound Design [stability.ai]

23 videos re: GPT-4o [OpenAI]

“Frameless” art museum in London where art truly comes to life + some beautiful painted landscapes

Announcements from Google I/O re: their AI-based offerings

Categories

Tags