Getty Images
Every Batman needs a Robin. To get ahead in an increasingly digital business world, today’s companies must know how to work with artificial intelligence technology. Quite often, digital technology is acknowledged as an important means to an end. In other words, it helps us to address a problem and provide a solution.
There’s nothing wrong with this, of course — in fact, we’ve witnessed a great deal of results from the use of everyday digital tools already. Think of automobile sensors that promptly give off a beep whenever cars switch lanes or automated business intelligence that quickly scans Excel and other reports.…Story continues…
Source: Meet Your Next Business Partner: Generative AI. Here’s Why You Should Team Up | Inc.com
.
Critics:
A generative AI system is constructed by applying unsupervised or self-supervised machine learning to a data set. The capabilities of a generative AI system depend on the modality or type of the data set used. Generative AI can be either unimodal or multimodal; unimodal systems take only one type of input, whereas multimodal systems can take more than one type of input. For example, one version of OpenAI‘s GPT-4 accepts both text and image inputs.
Generative AI systems trained on words or word tokens include GPT-3, LaMDA, LLaMA, BLOOM, GPT-4, and others (see List of large language models). They are capable of natural language processing, machine translation, and natural language generation and can be used as foundation models for other tasks. Data sets include BookCorpus, Wikipedia, and others (see List of text corpora).
In addition to natural language text, large language models can be trained on programming language text, allowing them to generate source code for new computer programs.[38] Examples include OpenAI Codex. Producing high-quality visual art is a prominent application of generative AI. Many such artistic works have received public awards and recognition.
Generative AI systems trained on sets of images with text captions include Imagen, DALL-E, Midjourney, Adobe Firefly, Stable Diffusion and others (see Artificial intelligence art, Generative art, and Synthetic media). They are commonly used for text-to-image generation and neural style transfer. Datasets include LAION-5B and others (See List of datasets in computer vision and image processing).
Generative AI can also be trained extensively on audio clips to produce natural-sounding speech synthesis and text-to-speech capabilities, exemplified by ElevenLabs‘ context-aware synthesis tools or Meta Platform‘s Voicebox. Generative AI systems such as MusicLM and MusicGen can also be trained on the audio waveforms of recorded music along with text annotations, in order to generate new musical samples based on text descriptions such as a calming violin melody backed by a distorted guitar riff.
Generative AI trained on annotated video can generate temporally-coherent video clips. Examples include Gen-1 and Gen-2 by Runway and Make-A-Video by Meta Platforms. Generative AI systems can be trained on sequences of amino acids or molecular representations such as SMILES representing DNA or proteins. These systems, such as AlphaFold, are used for protein structure prediction and drug discovery. Datasets include various biological datasets.
Generative AI can also be trained on the motions of a robotic system to generate new trajectories for motion planning or navigation. For example, UniPi from Google Research uses prompts like “pick up blue bowl” or “wipe plate with yellow sponge” to control movements of a robot arm. Multimodal “vision-language-action” models such as Google’s RT-2 can perform rudimentary reasoning in response to user prompts and visual input, such as picking up a toy dinosaur when given the prompt pick up the extinct animal at a table filled with toy animals and other objects.
The terms generative AI planning or generative planning were used in the 1980s and 1990s to refer to AI planning systems, especially computer-aided process planning, used to generate sequences of actions to reach a specified goal. Generative AI planning systems used symbolic AI methods such as state space search and constraint satisfaction and were a “relatively mature” technology by the early 1990s. They were used to generate crisis action plans for military use, process plans for manufacturing and decision plans such as in prototype autonomous spacecraft.
In recent developments within the field of generative artificial intelligence, the concept of “Generative Business Intelligence (BI)”has emerged as a notable application. Generative BI refers to the use of generative AI techniques to enhance business intelligence and analytics, enabling more advanced data interpretation and decision-making processes. This approach leverages the generative capabilities of AI to simulate potential business scenarios and outcomes, providing valuable insights for strategic planning.
Generative AI models are used to power chatbot products such as ChatGPT, programming tools such as GitHub Copilot, text-to-image products such as Midjourney, and text-to-video products such as Runway Gen-2. Generative AI features have been integrated into a variety of existing commercially available products such as Microsoft Office, Google Photos, and Adobe Photoshop.
Many generative AI models are also available as open-source software, including Stable Diffusion and the LLaMA language model. Smaller generative AI models with up to a few billion parameters can run on smartphones, embedded devices, and personal computers. For example, LLaMA-7B (a version with 7 billion parameters) can run on a Raspberry Pi 4 and one version of Stable Diffusion can run on an iPhone 11.
Larger models with tens of billions of parameters can run on laptop or desktop computers. To achieve an acceptable speed, models of this size may require accelerators such as the GPU chips produced by Nvidia and AMD or the Neural Engine included in Apple silicon products. For example, the 65 billion parameter version of LLaMA can be configured to run on a desktop PC.
Language models with hundreds of billions of parameters, such as GPT-4 or PaLM, typically run on datacenter computers equipped with arrays of GPUs (such as Nvidia’s H100) or AI accelerator chips (such as Google’s TPU). These very large models are typically accessed as cloud services over the Internet. In 2022, the United States New Export Controls on Advanced Computing and Semiconductors to China imposed restrictions on exports to China of GPU and AI accelerator chips used for generative AI.
Chips such as the Nvidia A800 and the Biren Technology BR104 were developed to meet the requirements of the sanctions. There is free software on the market capable of recognizing text generated by generative artificial intelligence (such as GPTZero), as well as images, audio or video coming from it. The impact of AI on numerous industries has been profound, revolutionizing productivity, decision-making processes, and customer experiences.
However, amidst this progress, challenges and concerns have emerged.The development of generative AI has raised concerns from governments, businesses, and individuals, resulting in protests, legal actions, calls to pause AI experiments, and actions by multiple governments. In a July 2023 briefing of the United Nations Security Council, Secretary-General António Guterres stated “Generative AI has enormous potential for good and evil at scale”.
That AI may “turbocharge global development” and contribute between $10 and $15 trillion to the global economy by 2030, but that its malicious use “could cause horrific levels of death and destruction, widespread trauma, and deep psychological damage on an unimaginable scale”.
From the early days of the development of AI, there have been arguments put forward by ELIZA creator Joseph Weizenbaum and others about whether tasks that can be done by computers actually should be done by them, given the difference between computers and humans, and between quantitative calculations and qualitative, value-based judgements. In April 2023, it was reported that image generation AI has resulted in 70% of the jobs for video game illustrators in China being lost.
In July 2023, developments in generative AI contributed to the 2023 Hollywood labor disputes. Fran Drescher, president of the Screen Actors Guild, declared that “artificial intelligence poses an existential threat to creative professions” during the 2023 SAG-AFTRA strike. Voice generation AI has been seen as a potential challenge to the voice acting sector. The intersection of AI and employment concerns among underrepresented groups globally remains a critical facet.
While AI promises efficiency enhancements and skill acquisition, concerns about job displacement and biased recruiting processes persist among these groups, as outlined in surveys by Fast Company. To leverage AI for a more equitable society, proactive steps encompass mitigating biases, advocating transparency, respecting privacy and consent, and embracing diverse teams and ethical considerations.
Strategies involve redirecting policy emphasis on regulation, inclusive design, and education’s potential for personalized teaching to maximize benefits while minimizing harms. In the financial realm, significant investment surges, as highlighted in discussions by Daron Acemoglu, have led to transformative tools like robo-advisors, reshaping traditional financial practices. Acemoglu’s warnings about potential adverse societal consequences stemming from AI, particularly in data harvesting, customer manipulation, and labor market disparities, underscore the complexities of AI’s impact on society.
AI’s integration with social identities, elucidated by Marcin Frackiewiczin, holds both promises and challenges. AI’s ability to transform traditional research methods, unveiling nuanced patterns within Social identity realms, has immense potential. However, biases ingrained in AI systems perpetuate stereotypes and marginalize groups, emphasizing the critical need to address these biases for inclusivity.
Deepfakes (a portmanteau of “deep learning” and “fake”) are AI-generated media that take a person in an existing image or video and replace them with someone else’s likeness using artificial neural networks. Deepfakes have garnered widespread attention and concerns for their uses in deepfake celebrity pornographic videos, revenge porn, fake news, hoaxes, and financial fraud. This has elicited responses from both industry and government to detect and limit their use.
Instances of users abusing software to generate controversial statements in the vocal style of celebrities, public officials, and other famous individuals have raised ethical concerns over voice generation AI. In response, companies such as ElevenLabs have stated that they would work on mitigating potential abuse through safeguards and identity verification. Concerns and fandom have spawned from AI generated music. The same software used to clone voices has been used on famous musicians’ voices to create songs that mimic their voices, gaining both tremendous popularity and criticism.
Similar techniques have also been used to create improved quality or full-length versions of songs that have been leaked or have yet to be released. Generative AI has also been used to create new digital artist personalities, with some of these receiving enough attention to receive record deals at major labels. The developers of these virtual artists have also faced their fair share of criticism for their personified programs, including backlash for “dehumanizing” an artform, and also creating artists which create unrealistic or immoral appeals to their audiences.
Generative AI’s ability to create realistic fake content has been exploited in numerous types of cybercrime, including phishing scams. Deepfake video and audio have been used to create disinformation and fraud. Former Google fraud czar Shuman Ghosemajumder has predicted that while deepfake videos initially created a stir in the media, they would soon become commonplace, and as a result, more dangerous. Additionally, large-language models and other forms of text-generation AI have been at a broad scale to create fake reviews on e-commerce websites to boost ratings.
Cybercriminals have created large language models focused on fraud, including WormGPT and FraudGPT. Recent research done in 2023 has revealed that generative AI has weaknesses that can be manipulated by criminals to extract harmful information bypassing ethical safeguards. The study presents example attacks done on ChatGPT including Jailbreaks and reverse psychology. Additionally, malicious individuals can use ChatGPT for social engineering attacks and phishing attacks, revealing the harmful side of these technologies.
In January 2023, Futurism.com broke the story that CNET had been using an undisclosed internal AI tool to write at least 77 of its stories; after the news broke, CNET posted corrections to 41 of the stories. In April 2023, the German tabloid Die Aktuelle published a fake AI-generated interview with former racing driver Michael Schumacher, who had not made any public appearances since 2013 after sustaining a brain injury in a skiing accident. The story included two possible disclosures: the cover included the line “deceptively real”, and the interview included an acknowledgment at the end that it was AI-generated. The editor-in-chief was fired shortly thereafter amid the controversy.
In the European Union, the proposed Artificial Intelligence Act includes requirements to disclose copyrighted material used to train generative AI systems, and to label any AI-generated output as such. In the United States, a group of companies including OpenAI, Alphabet, and Meta signed a voluntary agreement with the White House in July 2023 to watermark AI-generated content. In China, the Interim Measures for the Management of Generative AI Services introduced by the Cyberspace Administration of China regulates any public-facing generative AI.
It includes requirements to watermark generated images or videos, regulations on training data and label quality, restrictions on personal data collection, and a guideline that generative AI must “adhere to socialist core values”.
Newsom, Gavin; Weber, Shirley N. (September 6, 2023). “Executive Order N-12-23” (PDF). Executive Department, State of California. Retrieved September 7, 2023.
^ Jump up to:a b Griffith, Erin; Metz, Cade (January 27, 2023). “Anthropic Said to Be Closing In on $300 Million in New A.I. Funding”. The New York Times. Retrieved March 14, 2023.
^ Lanxon, Nate; Bass, Dina; Davalos, Jackie (March 10, 2023). “A Cheat Sheet to AI Buzzwords and Their Meanings”. Bloomberg News. Retrieved March 14, 2023.
^ Pinaya, Walter H. L.; Graham, Mark S.; Kerfoot, Eric; Tudosiu, Petru-Daniel; Dafflon, Jessica; Fernandez, Virginia; Sanchez, Pedro; Wolleb, Julia; da Costa, Pedro F.; Patel, Ashay (2023). “Generative AI for Medical Imaging: extending the MONAI Framework”. arXiv:2307.15208 [eess.IV].
^ Pasick, Adam (March 27, 2023). “Artificial Intelligence Glossary: Neural Networks and Other Terms Explained”. The New York Times. ISSN 0362-4331. Retrieved April 22, 2023.
^ Karpathy, Andrej; Abbeel, Pieter; Brockman, Greg; Chen, Peter; Cheung, Vicki; Duan, Yan; Goodfellow, Ian; Kingma, Durk; Ho, Jonathan; Rein Houthooft; Tim Salimans; John Schulman; Ilya Sutskever; Wojciech Zaremba (June 16, 2016). “Generative models”. OpenAI.
^ Metz, Cade (March 14, 2023). “OpenAI Plans to Up the Ante in Tech’s A.I. Race”. The New York Times. ISSN 0362-4331. Retrieved March 31, 2023.
^ Thoppilan, Romal; De Freitas, Daniel; Hall, Jamie; Shazeer, Noam; Kulshreshtha, Apoorv (January 20, 2022). “LaMDA: Language Models for Dialog Applications”. arXiv:2201.08239 [cs.CL].
^ Roose, Kevin (October 21, 2022). “A Coming-Out Party for Generative A.I., Silicon Valley’s New Craze”. The New York Times. Retrieved March 14, 2023.
^ “Don’t fear an AI-induced jobs apocalypse just yet”. The Economist. March 6, 2023. Retrieved March 14, 2023.
^ Harreis, H.; Koullias, T.; Roberts, Roger. “Generative AI: Unlocking the future of fashion”.
^ “How Generative AI Can Augment Human Creativity”. Harvard Business Review. June 16, 2023. ISSN 0017-8012. Retrieved June 20, 2023.
^ “The race of the AI labs heats up”. The Economist. January 30, 2023. Retrieved March 14, 2023.
^ Yang, June; Gokturk, Burak (March 14, 2023). “Google Cloud brings generative AI to developers, businesses, and governments”.
^ Hendrix, Justin (May 16, 2023). “Transcript: Senate Judiciary Subcommittee Hearing on Oversight of AI”. techpolicy.press. Retrieved May 19, 2023.
^ Simon, Felix M.; Altay, Sacha; Mercier, Hugo (October 18, 2023). “Misinformation reloaded? Fears about the impact of generative AI on misinformation are overblown”. Harvard Kennedy School Misinformation Review. doi:10.37016/mr-2020-127. S2CID 264113883.
^ Crevier, Daniel (1993). AI: The Tumultuous Search for Artificial Intelligence. New York, New York: BasicBooks. p. 109. ISBN 0-465-02997-3.
^ Newquist, H. P. (1994). The Brain Makers: Genius, Ego, And Greed In The Quest For Machines That Think. New York: Macmillan/SAMS. pp. 45–53. ISBN 978-0-672-30412-5.
^ Sharkey, Noel (July 4, 2007), A programmable robot from 60 AD, vol. 2611, New Scientist, archived from the original on January 13, 2018, retrieved October 22, 2019
^ Brett, Gerard (July 1954), “The Automata in the Byzantine “Throne of Solomon””, Speculum, 29 (3): 477–487, doi:10.2307/2846790, ISSN 0038-7134, JSTOR 2846790, S2CID 163031682.
^ kelinich (March 8, 2014). “Maillardet’s Automaton”. The Franklin Institute. Retrieved August 24, 2023.
^ SITNFlash (August 28, 2017). “The History of Artificial Intelligence”. Science in the News. Retrieved December 8, 2023.
^ Bergen, Nathan; Huang, Angela (2023). “A Brief History of Generative AI” (PDF). Dichotomies: Generative AI: Navigating Towards a Better Future (2): 4.
^ Gagniuc, Paul A. (2017). Markov Chains: From Theory to Implementation and Experimentation. USA, New Jersey: John Wiley & Sons. pp. 2–8. ISBN 978-1-119-38755-8.
^ Grinstead, Charles Miller; Snell, James Laurie (1997). Introduction to Probability. American Mathematical Society. pp. 464–466. ISBN 978-0-8218-0749-1.
^ Bremaud, Pierre (March 9, 2013). Markov Chains: Gibbs Fields, Monte Carlo Simulation, and Queues. Springer Science & Business Media. p. ix. ISBN 978-1-4757-3124-8. Archived from the original on March 23, 2017.
^ Hayes, Brian (2013). “First Links in the Markov Chain”. American Scientist. 101 (2): 92. doi:10.1511/2013.101.92. ISSN 0003-0996.
^ Fine, Shai; Singer, Yoram; Tishby, Naftali (July 1, 1998). “The Hierarchical Hidden Markov Model: Analysis and Applications”. Machine Learning. 32 (1): 41–62. doi:10.1023/A:1007469218079. ISSN 1573-0565. S2CID 3465810.
^ Jebara, Tony (2012). Machine learning: discriminative and generative. Vol. 755. Springer Science & Business Media.
^ Cao, Yihan; Li, Siyu; Liu, Yixin; Yan, Zhiling; Dai, Yutong; Yu, Philip S.; Sun, Lichao (March 7, 2023). “A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT”. ArXiv. doi:10.48550/arXiv.2303.04226.
^ “finetune-transformer-lm”. GitHub. Retrieved May 19, 2023.
^ Radford, Alec; Wu, Jeffrey; Child, Rewon; Luan, David; Amodei, Dario; Sutskever, Ilya; others (2019). “Language models are unsupervised multitask learners”. OpenAI Blog. 1 (8): 9.
^ Bubeck, Sébastien; Chandrasekaran, Varun; Eldan, Ronen; Gehrke, Johannes; Horvitz, Eric; Kamar, Ece; Lee, Peter; Lee, Yin Tat; Li, Yuanzhi; Lundberg, Scott; Nori, Harsha; Palangi, Hamid; Ribeiro, Marco Tulio; Zhang, Yi (March 22, 2023). “Sparks of Artificial General Intelligence: Early experiments with GPT-4”. arXiv:2303.12712 [cs.CL].
^ Schlagwein, Daniel; Willcocks, Leslie (September 13, 2023). “ChatGPT et al: The Ethics of Using (Generative) Artificial Intelligence in Research and Science”. Journal of Information Technology. 38 (2): 232–238. doi:10.1177/02683962231200411. S2CID 261753752.
^ “A History of Generative AI: From GAN to GPT-4”. March 21, 2023.
^ “Explainer: What is Generative AI, the technology behind OpenAI’s ChatGPT?”. Reuters. March 17, 2023. Retrieved March 17, 2023.
^ Bommasani, R.; Hudson, D. A.; Adeli, E.; Altman, R.; Arora, S.; von Arx, S.; Bernstein, M. S.; Bohg, J.; Bosselut, A; Brunskill, E.; Brynjolfsson, E. (August 16, 2021). “On the opportunities and risks of foundation models”. arXiv:2108.07258 [cs.LG].
^ Chen, Ming; Tworek, Jakub; Jun, Hongyu; Yuan, Qinyuan; Pinto, Hanyu Philippe De Oliveira; Kaplan, Jerry; Edwards, Haley; Burda, Yannick; Joseph, Nicholas; Brockman, Greg; Ray, Alvin (July 6, 2021). “Evaluating Large Language Models Trained on Code”. arXiv:2107.03374 [cs.LG].
^ Epstein, Ziv; Hertzmann, Aaron; Akten, Memo; Farid, Hany; Fjeld, Jessica; Frank, Morgan R.; Groh, Matthew; Herman, Laura; Leach, Neil; Mahari, Robert; Pentland, Alex “Sandy”; Russakovsky, Olga; Schroeder, Hope; Smith, Amy (2023). “Art and the science of generative AI”. Science. 380 (6650): 1110–1111. arXiv:2306.04141. Bibcode:2023Sci…380.1110E. doi:10.1126/science.adh4451. PMID 37319193. S2CID 259095707.
^ Ramesh, Aditya; Pavlov, Mikhail; Goh, Gabriel; Gray, Scott; Voss, Chelsea; Radford, Alec; Chen, Mark; Sutskever, Ilya (2021). “Zero-shot text-to-image generation”. International Conference on Machine Learning. PMLR. pp. 8821–8831.
^ Desai, Saahil (July 17, 2023). “A Voicebot Just Left Me Speechless”. The Atlantic. Retrieved November 28, 2023.
^ Agostinelli, Andrea; Denk, Timo I.; Borsos, Zalán; Engel, Jesse; Verzetti, Mauro; Caillon, Antoine; Huang, Qingqing; Jansen, Aren; Roberts, Adam; Tagliasacchi, Marco; Sharifi, Matt; Zeghidour, Neil; Frank, Christian (January 26, 2023). “MusicLM: Generating Music From Text”. arXiv:2301.11325 [cs.SD].
^ Dalugdug, Mandy (August 3, 2023). “Meta in June said that it used 20,000 hours of licensed music to train MusicGen, which included 10,000 “high-quality” licensed music tracks. At the time, Meta’s researchers outlined in a paper the ethical challenges that they encountered around the development of generative AI models like MusicGen”.
^ Metz, Cade (April 4, 2023). “Instant Videos Could Represent the Next Leap in A.I. Technology”. The New York Times.
^ Wong, Queenie (September 29, 2022). “Facebook Parent Meta’s AI Tool Can Create Artsy Videos From Text”. cnet.com. Retrieved April 4, 2023.
^ Heaven, Will Douglas (February 15, 2023). “AI is dreaming up drugs that no one has ever seen. Now we’ve got to see if they work”. MIT Technology Review. Massachusetts Institute of Technology. Retrieved March 15, 2023.
^ Yang, Sherry; Du, Yilun (April 12, 2023). “UniPi: Learning universal policies via text-guided video generation”. Google Research, Brain Team. Google AI Blog.
^ Brohan, Anthony (2023). “RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control”. arXiv:2307.15818 [cs.RO].
^ Jump up to:a b Alting, Leo; Zhang, Hongchao (1989). “Computer aided process planning: the state-of-the-art survey”. The International Journal of Production Research. 27 (4): 553–585. doi:10.1080/00207548908942569.
^ Chien, Steve (1998). “Automated planning and scheduling for goal-based autonomous spacecraft”. IEEE Intelligent Systems and Their Applications. 13 (5): 50–55. doi:10.1109/5254.722362.
^ Burstein, Mark H., ed. (1994). ARPA/Rome Laboratory Knowledge-based Planning and Scheduling Initiative Workshop Proceedings. The Advanced Research Projects Agency, Department of Defense, and Rome Laboratory, US Air Force, Griffiss AFB. p. 219. ISBN 155860345X.