Protecting your website with ChatGPT

TL:DR – I noticed someone attempt to use access a file called c99.php on one of my websites. It was blocked by the server, but I wanted to know more. ChatGPT explained that it was a hacking tool.


I was seeing some unusual activity in the logs for one of my websites, so I gave this prompt to ChatGPT, the language model, to see if it could offer an explanation:

You are to act as a web and programming expert with experience of PHP, WordPress, MySQL, and Apache. I noticed a visit to my site that was attempting to open a file called c99.php, is that file a hacking “backdoor” of some sort. It does not appear to exist on my server and if I try to open it, I get the following message in my web browser “This site can’t be reached, site unexpectedly closed the connection.”

Yes, c99.php is a commonly used backdoor file by hackers. It is usually uploaded to a vulnerable website to provide remote access and control to the hacker. The file name c99.php is derived from the popular web shell called “c99 shell”. This shell provides an easy-to-use interface for a hacker to access and manipulate files, databases, and other resources on the compromised server.

If you have noticed a request for c99.php on your website, it is likely that your website is being targeted by hackers. However, it’s good to note that this does not necessarily mean your site has already been compromised. It is possible that the hacker was simply scanning your site for vulnerabilities.

If the file does not exist on your server and you are unable to access it, it’s possible that your server’s security measures are preventing it from being uploaded or accessed. It is recommended that you take additional security measures to protect your website and server, such as regularly updating your software, using strong passwords, and implementing security plugins. It’s also a good idea to review your server logs to identify any suspicious activity and to report it to your hosting provider.

I then asked ChatGPT for advice on securing my site:

If you have identified any suspicious files on your server or if you suspect that your server has been compromised, it is important to take immediate action to secure your website and server. Here are some steps you can take to secure your server:

  1. Update your software: Make sure that your website software, including the CMS (e.g., WordPress), plugins, and themes, are up-to-date with the latest security patches. This will help to close any known vulnerabilities that could be exploited by hackers.
  2. Use strong passwords: Ensure that you and your users are using strong passwords that are not easy to guess. Use a mix of uppercase and lowercase letters, numbers, and symbols, and avoid using the same password across multiple accounts.
  3. Install security plugins: Install a security plugin such as Wordfence or Sucuri to monitor your website for suspicious activity, block malicious IPs, and detect and remove malware.
  4. Backup your data: Regularly backup your website files and databases so that you can restore your website quickly in case of a security breach or data loss.
  5. Limit user access: Only grant access to users who need it, and make sure that each user has the appropriate level of access. Use strong passwords, and consider implementing two-factor authentication for additional security.
  6. Use HTTPS: Enable HTTPS on your website to encrypt data transmitted between your website and users, preventing sensitive information from being intercepted by hackers.
  7. Contact your hosting provider: If you suspect that your server has been compromised, contact your hosting provider immediately. They may be able to help you identify and resolve the issue.

Given that all such security recommendations was already implemented on the WordPress side, I asked what might be done at teh server level via cPanel:

  1. Keep your cPanel account secure: Use strong passwords, and avoid using the same password across multiple accounts. Enable two-factor authentication if available. Also, make sure to log out of your cPanel account when you’re finished.
  2. Use an SSL certificate: Install an SSL certificate on your website to encrypt data transmitted between your website and users. This will help to prevent sensitive information from being intercepted by hackers.
  3. Monitor server logs: Check your server logs regularly to identify any suspicious activity. You can use tools such as AWStats or Webalizer to analyze your server logs and get insights into your website traffic.
  4. Enable ModSecurity: ModSecurity is a web application firewall that can help protect your website from common web-based attacks. It’s available in cPanel, and you can enable it under the “Security” section.
  5. Disable unused services: Disable any unused services in cPanel to reduce the attack surface of your server. For example, if you’re not using FTP, disable the FTP service.
  6. Install antivirus software: Install antivirus software on your server to scan for and remove any malware or viruses. There are several antivirus solutions available for cPanel, such as ClamAV.
  7. Backup your data: Regularly backup your website files and databases so that you can restore your website quickly in case of a security breach or data loss.

These are just a few suggestions for improving the security of your website and server. It’s important to note that some of these steps may require root access to the server, which may not be available on a shared hosting plan. However, your hosting provider may offer additional security features that you can take advantage of. It’s always a good idea to check with your hosting provider for specific recommendations on how to secure your server.

How to get ChatGPT to fix your website

TL:DR – Given an appropriate prompt, ChatGPT can write computer code to solve website problems…and more.


I’ve been doing web stuff since 1995, but sometimes running several sites as subdomains I come unstuck with redirects. Change something on one and it makes the same change inadvertently happen on another. It happened today, I wanted to change the way the RSS feeds work for the Sciencebase site, but the changes I made to the .htaccess file in the domain root kept b0rking another of my sites that’s on the same server as a sub-domain.

The problem was that I was getting scraped at the page level by various bots but individual pages on my site don’t have feeds as comments are disabled. There is one main RSS feed for all updates and separate ones for each category and each tag. I tried to fix it, but it was then redirecting the feed for one of my other sites to the main Sciencebase feed.

So I asked ChatGPT to help. I gave it this prompt to begin with:

You are an expert programmer with Apache knowledge, knowledge of redirects and the file .htaccess. Write a code to redirect all individual RSS feed pages generated by a website’s content management system (CMS) to the primary RSS for the site. I do not want you to redirect the RSS feeds assigned to individual category or tag pages.

It replied as follows:

Below the code snippet it added an explanation for each line, which is useful. That didn’t quite work how I was hoping it would, so I spelled out an example of what I wanted to happen:

The site is https://www.sciencebase.com/science-blog An example URL is https://www.sciencebase.com/science-blog/an-artificial-album-cover.html I want to redirect any URLs of this type https://www.sciencebase.com/science-blog/an-artificial-album-cover.html/feed But not any in a category or tag URL They should go to the primary RSS URL for https://www.sciencebase.com/science-blog/

It then gave me a revised version of the code, which worked fine, although needed an additional tweak to block the interfering with the other website problem. It took about ten minutes all in. I could’ve spent an hour or more on a search engine trying to home in a page discussing the exact same scenario. I think it’s all working cleanly now.

Updates and an artificial album cover

TL:DR – Updating Sciencebase systems and using MidJourney to generate some intriguing artwork to illustrate a blog post.


As regular readers will know, I’ve been running the Sciencebase site since July 1999. Its precursor, Elemental Discoveries, had various homes on the web from December 1995 until that fateful summer. There are almost 4000 articles in the archives, so it’s quite a hefty site for a one-man show.

AI image of a boy staring at a television, the image is reminiscent of a 1980s Rush album cover, specifically Power Windows

Anyway, having played around with various website options in the last couple of weeks, I’ve finally done a proper spring clean, got rid of some very out-of-date articles and updated others that were worth keeping. I’ve upgraded security and performance stuff so the site should load much faster than ever before but also present no security problems to visitors (I don’t think it ever did, to be honest).

I’ve also now added a new and improved search box. I’ve also made search easier for mobile users who can now access the search box from the menu rather than guessing where it might be and having to scroll to the bottom of the page. The new search box also anticipates what it is you might be searching for. So, if you start typing “avian” it might assume you’re after information on “avian influenza” of which there is quite a lot in the Sciencebase archives.

All in, I hope the site is now offering an improved experience for the 3000 or so unique visitors the site gets every day!

I also wanted an illustration for this blog post about all the updates and so prompted MidJourney with the spider web emoji to allude to the world wide web and websites. It came back with four images, as it does. Two were odd superhero, Spider Man type images, one was rather eclectic and showed a girl staring through the window of a house at raging fire in the middle of the room.

The other seemed much more apt. It purportedly showed a young boy in a red hoodie with his back to us presumably staring at a large, old-fashioned television screen. A child dreaming of a future, perhaps? It also struck me as being rather of the style of artist Hugh Syme who worked on album covers for the Canadian rock band Rush for decades.

Indeed, this generated boy watching TV image might almost have been an outtake from the designs for the band’s Power Windows album which also features a boy and television screens. I might use it for some of my music output at some point, but have annotated it for now with my name and the word SCIENCEBASE.

A ChatGPT glossary

TL:DR – A number of people were reaching Sciencebase searching for the phrase ChatGPT beam search so I asked ChatGPT itself to give me a definition. I then asked it to create a glossary of other common terms associated with Generative pre-trained transformers, GPTs.


ChatGPT, the language model, is essentially an incredibly sophisticated autocomplete program that responds to text prompts by predicting what might come next. It’s worth noting that it’s commonly referred to as artificial intelligence, AI. It is not AI. Here’s its response to my prompt asking it for a definition of ChatGPT beam search:

Beam search is a search algorithm used in Natural Language Processing (NLP) to find the most likely sequence of words or phrases that represent the best possible solution, given a set of choices.

In simpler terms, when trying to predict the next word or sequence of words in a sentence, there are often multiple possibilities that could be correct. The beam search algorithm helps to narrow down the number of options and find the most likely option based on the context and probability of each word or phrase.

The “beam” in beam search refers to the number of possible options that the algorithm keeps in consideration at each step of the search. By limiting the number of options to a certain “beam width”, the algorithm can efficiently explore the search space and find the best solution more quickly than exhaustively considering all possible options.


I then asked it to generate a glossary of the most important terms in discussions of ChatGPT:

Generative pre-trained transformer, GPT: GPT is a type of language model developed by OpenAI that uses deep learning techniques to generate natural language text. The GPT model is based on a transformer architecture, which was introduced in a paper by Vaswani et al. in 2017, pre-training involves exposing the system to a large amount of written text so that it “learns” the meaning and context of different words and phrases.

Natural Language Processing (NLP): A field of study that focuses on teaching computers to understand, analyze, and generate human language.

Neural Networks: A type of machine learning model that is inspired by the structure and function of the human brain.

Recurrent Neural Networks (RNN): A type of neural network that is specifically designed to handle sequential data, such as text.

Long Short-Term Memory (LSTM): A type of RNN that is designed to better handle long-term dependencies in sequential data.

Attention Mechanism: A technique used in NLP that allows models to focus on specific parts of the input sequence when making predictions.

Transformer: A type of neural network architecture that uses self-attention mechanisms to process input sequences in parallel.

BERT: Bidirectional Encoder Representations from Transformers. A transformer-based neural network architecture that is designed to better understand the context and meaning of words in a sentence by processing them in both directions.

GPT-3: Generative Pre-trained Transformer 3. A large-scale language model developed by OpenAI that uses a transformer architecture and is capable of generating human-like text.

GPT-4: The latest version of the language model from OpenAI as of March 2023. This version can handle around 25000 words at a time, that’s about eight times as many as GPT-3 can cope with.

Pretraining: The process of training a model on a large amount of data to learn general features of a language or domain.

Domain: A specific topic or subject area on which a text or dataset is focused.

Fine-tuning: The process of adapting a pre-trained model to a specific task or domain by training it on a smaller, task-specific dataset.

Fine-tuning strategies: Various techniques used to fine-tune pre-trained language models, like GPT, for specific NLP tasks. Examples include task-specific architectures, adding domain-specific data, and adjusting learning rates.

Language Modelling: The task of predicting the probability of the next word in a sentence or sequence of words.

Text Generation: The task of generating new text that is similar in style and structure to a given input text.

Transformer-based language models: A type of language model, like GPT, that uses the transformer architecture to process input sequences.

Generative models: Machine learning models, like GPT, that can generate new output based on input data.

Sampling: The process of randomly selecting words or phrases from a language model, like GPT, to generate new text.

Top-p sampling: A type of sampling that selects from the most probable words or phrases in a language model, like GPT, until a certain probability threshold is reached.

Top-k sampling: A type of sampling that selects from the top k most probable words or phrases in a language model, like GPT.

Perplexity: A metric used to evaluate the performance of a language model, like GPT, by measuring how well it predicts new text.

Overfitting: A situation where a model, like GPT, performs well on training data but poorly on new data due to being too closely fitted to the training data.

Transfer learning: A technique in machine learning where a pre-trained model is used as a starting point for a new task or dataset, in order to improve the performance and efficiency of training.

Language transfer learning: A type of machine learning that involves training models, like GPT, on one language or domain and transferring that knowledge to a different language or domain.

Unsupervised learning: A type of machine learning that involves training models, like GPT, on unlabeled data, without the need for explicit supervision or labeling of input data.

Input sequence: The text or sequence of words that is fed into a language model, like GPT, to generate output.

Embeddings: A vector representation of words or phrases used by language models, like GPT, to process input and generate output.

End-to-end training: A type of training where a model, like GPT, is trained to perform a specific task from start to finish without any intermediate steps or processing.

Dialogue generation: The production of responses to a given dialogue context. This is an important application of language models like GPT, and it involves training models to generate coherent and relevant responses to a user’s input in a conversation.

Multi-modal learning: A type of machine learning that involves processing and integrating information from multiple modalities, such as text, images, and audio.

Inference: The process of using a trained model, like GPT, to generate output based on input data.

Domain adaptation: A type of transfer learning that involves adapting a pre-trained model, like GPT, to a specific domain or task by fine-tuning it on a smaller, task-specific dataset.

Hyperparameters: Parameters in a machine learning model, like GPT, that are set by the user and affect the model’s behavior, such as the learning rate or batch size. These are typically tuned through experimentation to optimize model performance.

Knowledge graph: A type of graph database that stores information in a structured format, representing relationships between entities and their attributes. This is often used to extract and store structured data from unstructured text, and can be used as a knowledge base to inform language models like GPT.

Text classification: The task of categorizing text into one or more predefined classes or categories. This is an important application of NLP, and can be used to classify documents, emails, social media posts, and other types of text data.

Named entity recognition (NER): The task of identifying and extracting named entities such as people, organizations, locations, and dates from unstructured text. This is an important application of NLP, and is used in tasks such as information extraction and text classification.

Conditional generation: The task of generating text, like in GPT, that satisfies a certain condition or requirement, such as generating a story based on a given prompt or question.

GPT-2: A larger and more powerful version of the original GPT model, with 1.5 billion parameters.

GPT-3: A state-of-the-art language model developed by OpenAI with 175 billion parameters, capable of generating highly coherent and contextually appropriate text.

Megatron: A high-performance GPT model developed by NVIDIA, designed to train on large-scale distributed systems.

T5: A transformer-based language model developed by Google, capable of performing a wide range of natural language processing tasks.

CTRL: A conditional transformer language model developed by Salesforce, designed for generating coherent and controllable text.

BingGPT: BingGPT is a proprietary language model developed by Microsoft from the starting point of the GPT architecture developed by OpenAI, which has been incorporated into Microsoft’s search engine.

OpenAI: An “artificial intelligence” research laboratory consisting of a team of scientists and engineers focused on developing cutting-edge AI technologies. The organization was founded in 2015

Prompting MidJourney with emoji

TL:DR – You can paste emoji into a MidJourney prompt and it will generate some interesting art. Some of it seems wholly unrelated to the emoji.


Everyone surely knows by now that you can create some intriguing images by sending an appropriate prompt to a bot, you can even send an image to some of them, such as MidJourney, and have it spit back something that it generates based on that input. Recently, I saw an interesting idea from Linus Ekenstam who was prompting MidJourney with emoji. My next favourite waste of time was obvious…

Prompt: Male singer emoji, the one that looks like Aladdin Sane Bowie

MidJourney art generated from the "singer" emoji
MidJourney art generated from the “male singer” emoji

Prompt: Male and female singers and guitar emoji

MidJourney art generated from the "male singer, female singer, and guitar" emoji
MidJourney art generated from the “male singer, female singer, and guitar” emoji

Prompt: Three guitar emoji

MidJourney art generated from three "guitar" emoji
MidJourney art generated from three “guitar” emoji

Prompt: Singer, guitar, and drum emoji

MidJourney art generated from "microphone, guitar, drum" emoji
MidJourney art generated from “microphone, guitar, drum” emoji

Prompt: Musical keyboard

MidJourney art generated from "musical keyboard" emoji
MidJourney art generated from “musical keyboard” emoji

Prompt: Musical score, flat, sharp, natural emoji

MidJourney art generated from "musical score, flat, sharp, natural" emoji
MidJourney art generated from “musical score, flat, sharp, natural” emoji

UPDATE: I prompted MidJourney with the spider web emoji and it gave me two images that were reminiscent of Spider Man, another of a girl peering through a window at a fire, and this one, which looks like an out-take from Rush album cover designs of the Power Windows era…I added the text, just for fun.

I’ve gathered together more of my MidJourney output on my Pinterest page.

Update: Because of the album-like nature of the last image above, I prompted MJ with a few messages of this sort: “Hipgnosis style, surreal album cover, photorealistic, high-definition, beautiful, incredibly detailed” and it came back with some interesting generative art. The following one used that prompt but with the “woman” emoji at the front-end for good measure.

Repeated the prompt but with the “man” emoji:

I prompted MJ with the emoji representing Diwali/Diya Lamp, if you have coulrophobia, look away now. The other three generative images for that prompt were equally unrelated to Diwali, but this one was the weirdest.

Spawn again – Operation PondLife 2023

UPDATE: 22nd Feb 2024 – Spawn present in the morning, perhaps been there a day or two, it’s been raining heavily so hadn’t checked since the weekend. 40+ frogs in the pond, mating activity still ongoing at the beginning of March, masses of spawn.

I have resculpted the rear edge of the pond to allow it to spill when it is very full and creates a potentially semi-permanent swamp area to increase invertebrate diversity. This was prompted by a chat with a member of our village nature recovery project.

2023: Several frogs active in our garden pond, some of them mating, but no spawn as of 2nd March 2023. UPDATE: 11th March, sudden appearance of spawn and ongoing frog activity, croaking, mating, at least 30 frogs. UPDATE: 11th April. Frog activity down, lots of tadpoles, one batch still very small an older batch looking big-headed and perhaps just about to start sprouting limbs.


I’ve not mentioned our wild garden pond for a while. It’s still thriving with iris, reed, and sedge shoots beginning to show already and it is just late February. Matt often checks on the frogs after dark and reported that there were a pair mating on the night of 26th February, which is a good sign, we might soon be with spawn again. We had spawn on 21st February 2021, but not until 5th March in 2020, nothing in 2019 when I’d only just resurrected the pond.

Garden pond
This is how it looked in May 2019 when I first re-dug our garden pond (it’s just half the size of the original)
garden pond
This is how it looked in April 2022

Just for the record, no fish. This is a wild pond for frogs, water snails, and dragonflies etc. Anything else that turns up by chance will find a home, but I’ve not introduced any creatures other than two species of watersnail from a friend’s wildlife pond.

I asked ChatGPT to act as a herpetologist to explain the lifecycle of the Common Frog:

The Common Frog, Rana temporaria, is a species of amphibian found throughout much of Europe and parts of Asia. Here’s a general overview of its lifecycle:

  1. Egg Stage: The lifecycle of a Common Frog starts when the female lays eggs in water, typically in ponds, lakes or slow-moving streams, usually in the spring. A female can lay up to 4,000 eggs in a single season. The eggs are enclosed in a gelatinous mass that floats on the surface of the water.
  2. Tadpole Stage: After several days, the eggs hatch into tadpoles. Tadpoles are aquatic and breathe through gills. They feed on algae and other small organisms found in the water. During this stage, they develop legs, absorb their tail and undergo metamorphosis.
  3. Metamorphosis Stage: As tadpoles grow, they start to develop legs and eventually lose their tails. They also start to breathe air through their lungs instead of gills, and their digestive system changes to accommodate a more carnivorous diet. This metamorphosis can take several months, depending on the temperature and food availability.
  4. Juvenile Stage: Once the metamorphosis is complete, the young froglets leave the water and begin their life on land. During this stage, they continue to grow and develop, and their diet shifts to insects and other small invertebrates.
  5. Adult Stage: After a few months, the young froglets reach sexual maturity and become adults. They will continue to breed and lay eggs, thus completing the lifecycle.

The entire lifecycle of a Common Frog typically takes around 2-3 years, although it can vary depending on environmental factors such as temperature and food availability.

Avian influenza, bird flu, H5N1

TL:DR – Scientists are worried that we might soon see the emergence of another pandemic, this one originating in avian influenza.


A bird flu pandemic has killed thousands of wild birds over the last couple of years. Scientists have now seen infection in mammals, and very recently a person died from avian influenza and several close contacts show signs of  infection. The concern is that we might be headed for another H5N1 pandemic. Previous strains of H5N1 that infected people had a mortality rate of 60 percent.

Avian influenza, bird flu, H5N1

There are fifteen known variants of avian influenza. The most virulent, and usually fatal in birds, are the H5 and H7 strains. There are then nine variants of the H5 strain and the type of most concern because of the risk to human health is H5N1.

Up to May 2020, the World Health Organisation had reported a total of 861 human cases of bird flu, which killed 455 people since 2003. However, in February 2003, an 11-year-old Cambodian girl died of H5N1, her father tested positive and several close contacts had symptoms. At the time of writing, the WHO believes this strain of H5N1 is spreading among people and is pushing for a new human vaccine against the disease to be produced urgently.

Meet GINA, Global INitiative for Asthma

TL:DR – Thanks to GINA I have not needed to use salbutamol to treat my asthma for three years at the time of writing.


It’s three years since my asthma nurse introduced me to GINA, the Global INitiative for Asthma. It was a phone consultation because the then new virus SARS-CoV-2, which causes what became known as COVID-19, was beginning to spread. At the time, I was very worried that it would be a killer for me given my asthma. Eventually, medical science learned that people with asthma were not necessarily at any greater risk of morbidity and mortality.

Anyway, it was nice to meet GINA. GINA contradicts some of what general practitioners (GPs) in the UK have been advising asthma patients for years, but supports what the medical evidence suggests one should do in terms of medication. The main thing is GINA told me to stop using Salbutamol (the reliever spray) and to use my Symbicort (combined preventer with reliever spray) instead. Salbutamol is an old, out-of-date drug with some unpleasant side effects in long-term use, partly because it can affect the heart but mainly because inappropriate use in the absence of proper control of the condition with a preventer can lead to scarring in the lungs.

Modern relievers are more targeted to the lungs rather than other organs. Moreover, using the preventer at a higher dose, which is what my nurse recommended, means you reduce inflammation and mucus production, which are the main issues with asthma in parallel with episodic bronchoconstriction.

For three years, I have been taking twice as much preventer each day as I was before, and only very occasionally need to take extra to get a dose of reliever. I have not used Salbutamol at all in those three years and don’t expect to have to ever again. I no longer carry Salbutamol with me.

A GP friend was concerned when I told him about GINA, he reasoned, quite sensibly that taking this approach would be increasing one’s exposure to the preventer drugs, which are corticosteroids. This is true. But asthma is mostly about inflammation and these drugs reduce inflammation and so should reduce asthma symptoms. They do. He discussed the issue with colleagues and is now persuaded that GINA is a more sensible approach to asthma control than the current NICE recommendations in the UK. Hopefully, someone will introduce NICE to GINA soon.

Birding and wording

TL:DR – An answer to the question why are so many science writers also birders?


Someone on social media asked me as a science writer why so many science writers are also birders. My initial thoughts were as follows: Well, to be a science writer, I think you have to be curious, analytical, interested in lots of stuff…a polymath. Birds are interesting, identification requires analysis, writing about them is fun and if you’re a togger [birder-photographer], you always have a decent photo to illustrate your article.

Wind-blown Short-eared Owl perched on a fence post
Short-eared Owl

One possible explanation for the prevalence of birding among science writers is the concept of “flow.” Flow is a state of deep engagement and enjoyment that occurs when a person is fully absorbed in an activity that is challenging but also within their skill level. Birding can be a highly immersive activity that requires focus, attention to detail, and knowledge of ecology and behaviour. Science writing also demands a similar level of engagement and attention to detail. Thus, individuals who enjoy birding may be drawn to science writing because it provides a similar experience of flow. [In my case, I’d say it was the reverse, I was drawn to birding, because I had a scientific mind and was a science writer and wanted an additional creative outlet].

A flock of 80 or so Avocet
A flock of 80 or so Avocet

Another possible explanation is that birding can serve as a form of “nature therapy” that provides mental health benefits. Research has shown that spending time in nature can reduce stress, improve mood, and increase creativity. Science writing can be a demanding and intellectually stimulating profession, and birding may provide a healthy balance by allowing individuals to connect with nature and recharge their mental batteries.

Red Kite flying against a blue sky
Red Kite

Finally, it’s possible that there is simply a correlation between the personality traits that make someone a good science writer and those that make someone interested in birding. For example, curiosity, attention to detail, and a broad range of interests are all traits that are valued in both professions. Additionally, both birding and science writing are activities that can be pursued alone or in groups, providing opportunities for social interaction and a sense of belonging. [I definitely think of myself as a wannabe polymath, hence my “songs, snaps, science” motif].

Overall, while there is no definitive answer to why so many science writers are also birders, it’s likely that a combination of these factors plays a role. Regardless of the reason, it’s clear that the intersection of birding and science writing provides a rich and fascinating field of exploration.

How to pronounce Kyiv correctly

TL:DR – The correct pronunciation of Kyiv.


The capital city of Ukraine, written in English, is Kyiv. But there still seems to be so much inconsistency in how English-speaking newsreaders and reporters pronounce the place name.

It’s definitely not Keeeev and definitely, definitely not Kee-eff or Kee-ev.

Watch and listen…and learn…

That’s it. No ifs, no buts.

K(r)-yee-iv – but rolled tightly together into two short syllables. That bracketed r is like a Northumbrian or French “r” sound, but much shorter.

That said, English speakers do not pronounce Paris like the French, so perhaps they could be consistent with an Anglicised version of Kyiv – Kyeev.

To pronounce “Kyiv” correctly, follow these steps:

Start with the “K” sound, which is a voiceless velar plosive. This is the same sound as in the English word “kite” or “cat”. But, the K sound is closely associated with a soft “r” sound, which is pronounced by touching the tip of the tongue to the roof of the mouth, just behind the front teeth, and making a soft hissing sound (like a shorter version of the “r” in Paris spoken in French).

Next, move on to the “yi” sound, which is a diphthong in Ukrainian. It is pronounced as a combination of the vowel sounds “i” and “y”. The sound is similar to the “ee” sound in the English word “beet” but with a slightly rounded lip shape.

Finally, end with the “v” sound, which is a voiced labiodental fricative. This sound is similar to the “v” sound in the English word “vase” or “vine”.
So the correct pronunciation of Kyiv is “K(r)eev” with the emphasis on the first syllable.