Avoid Using .find() to Check for Substrings

Check if a Python String Contains a Substring Martin Breuss 05:20

00:00 In the previous lesson, I showed you that you can use text_lower.index() to get the index position of the first occurrence of the substring.

00:10 This is where it starts. And there is a second string method called .find() that does basically the same. So if you use .find() and then pass it "secret", you will get the same result, 59.

00:25 This is where the first occurrence of the substring starts. But this has a name that is sometimes a little misleading for Python programmers because .find() seems like a good way to find a string in Python, right? And I would argue that it is quite descriptively named.

00:42 You want to find where is the string, and if you think of it like that, then .find() also makes sense. It gives you the index position, right?

00:50 I prefer to use .index() because I have the feeling it’s a little more descriptive of what do both of these methods actually return because they return the index positions. Now there’s a difference between them, which is how they handle if they do not find the substring.

01:06 So if I say text_lower.index() and pass it a substring that’s not in there—let’s say "treasure"—

01:15 then .index() gives a ValueError and just tells me that the substring isn’t found. So, in my opinion, it’s quite descriptive and understandable. If I do the same with .find(),

01:29 it does not throw an error but instead it returns the value—no, well that was the wrong string.

01:38 Instead it returns the value -1, and this -1 in the context of .find(), it means the substring is not in the string, but it’s not really very descriptive in my opinion.

01:52 I’d rather have a ValueError thrown that tells me this is not in there than have -1. And the reason for that is that you’ll see people finding the .find() string method and using it for finding a substring in a string, and then they start writing code, a conditional where they say if text_lower.find(), let’s say "treasure",

02:18 != -1: then print("found it"), right, and else maybe print something like

03:04 It works because we know that .find() returns -1 if it’s not found, we know "treasure" is not in that string, so this whole expression is going to evaluate to False, and because of that, you’re going to print out that the substring is not in the string.

03:20 But if you think back to how we did that initially, which is saying "treasure" in text_lower, this is much shorter and more concise, and also the output makes a lot of sense.

03:35 So in this case you’re just saying is it in there, and Python returns False, which means you can write your conditional statement like so. You can say if—let me just copy that—if "treasure" in text_lower: print("found it")

03:53 and else

04:29 Well, you shouldn’t be using that, and instead just stick with using the in operator, which is the most readable way of checking for a substring in a string.

05:12 All of these string methods are useful if you want to learn more about the substring, and that’s what you should use them for.

Become a Member to join the conversation.