Introducing PHP
Conditions and Branches
Loops
A Working Example
Arrays
Strings
Regular Expressions
Date and Time Functions
Integer and Float Functions
User-Defined Functions
Objects
Common Mistakes
In this chapter, we introduce the PHP scripting language. PHP is similar to high-level languages such as C, Perl, Pascal, FORTRAN, and Java, and programmers who have experience with any of these languages should have little trouble learning PHP. This chapter serves as an introduction to PHP; it's not a programming guide. We assume you are already familiar with programming in a high-level language.
The topics covered in this chapter include:
PHP basics, including script structure, variables, supported types, constants, expressions, and type conversions
Condition and branch statements supported by PHP, including if, if...else, and the switch statements
Looping statements
Arrays and array library functions
Strings and string library functions
Regular expressions
Date and time functions
Integer and float functions
How to write functions, reuse components, and determine the scope and type of variables
An introduction to PHP object-oriented programming support
Common mistakes made by programmers new to PHP, and how to solve them
Programmers new to PHP should read Section 2.1, which describes the basic structure of a PHP script and its relationship to HTML, and includes discussion of how PHP handles variables and types. The two sections that follow, Section 2.2 and Section 2.3, deal with conditional statements and looping structures and should be familiar material. We then present a short example that puts many of the basic PHP concepts together.
The remainder of the chapter expands on the more advanced features of PHP, presents a reference to selected library functions, and discusses some of the common mistakes that programmers make when learning PHP. This material can be examined briefly, and used later as a reference while reading Chapter 4 to 13 and while programming in PHP. However, programmers new to PHP should consider reading the beginning of the Section 2.5 and Section 2.6 sections to understand the way PHP supports these concepts, as there are important differences from other languages.
We don't attempt to cover every function and every library that are supported by PHP. However, we provide brief descriptions of the supported libraries in Appendix E. In later chapters, we discuss more specialized library functions that support the topics and techniques presented here.
The current version of PHP is PHP4, which we call PHP throughout this book. The current release at the time of writing is 4.0.6.
PHP is a recursive acronym that stands for PHP: Hypertext Preprocessor; this is in the naming style of GNU, which stands for GNU's Not Unix and which began this odd trend. The name isn't a particularly good description of what PHP is and what it's commonly used for. PHP is a scripting language that's usually embedded or combined with HTML and has many excellent libraries that provide fast, customized access to DBMSs. It's an ideal tool for developing application logic in the middle tier of a three-tier application.
Example 2-1 shows the first PHP script in this book, the ubiquitous "Hello, world." When requested by a web browser, the script is run on the web server and the resulting HTML document sent back to the browser and rendered as shown in Figure 2-1.
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd" > <html> <head> <title>Hello, world</title> </head> <body bgcolor="#ffffff"> <h1> <?php echo "Hello, world"; ?> </h1> </body> </html>
Example 2-1 illustrates the basic features of a PHP script. It's a mixture of HTML—in this case it's mostly HTML—and a PHP script. The script in this example:
<?php echo "Hello, world"; ?>
simply prints the greeting, "Hello, world."
The PHP script shown in Example 2-1 is rather pointless: we could simply have authored the HTML to include the greeting directly. Because PHP integrates so well with HTML, using PHP to produce static strings is far less complicated and less interesting than using other high-level languages. However, the example does illustrate several features of PHP:
The begin and end script tags are <?php and ?> or, more simply, just <? and ?>. The longer begin tag style <?php avoids conflicts with other processing instructions that can be used in HTML. We use both styles in this book.
Other begin and end tag styles can also be configured, such as the HTML style that is used with JavaScript or other embedded scripts: <script language="PHP"> and </script>.
Whitespace has no effect, except to aid readability for the developer. For example, the script could have been written succinctly as <?php echo "Hello, world";?> with the same effect. Any mix of spaces, tabs, carriage returns, and so on in separating statements is allowed.
A PHP script is a series of statements, each terminated with a semicolon. Our simple example has only one statement: echo "Hello, world";.
A PHP script can be anywhere in a file and interleaved with any HTML fragment. While Example 2-1 contains only one script, there can be any number of PHP scripts in a file.
When a PHP script is run, the entire script including the start and end script tags <?php and ?> is replaced with the output of the script.
TIP: When we present a few lines of code that are sections of larger scripts, we usually omit the start and end tags.
The freedom to interleave any number of scripts with HTML is one of the most powerful features of PHP. A short example is shown in Example 2-2; a variable, $outputString="Hello, world", is initialized before the start of the HTML document, and later this string variable is output twice, as part of the <title> and <body> elements. We discuss more about variables and how to use them later in this chapter.
<?php $outputString = "Hello, world"; ?> <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd" > <html> <head> <title><?php echo $outputString; ?></title> </head> <body bgcolor="#ffffff"> <h1><?php echo $outputString; ?></h1> </body> </html>
The flexibility to add multiple scripts to HTML can also lead to unwieldy, hard-to-maintain code. Care should be taken in modularizing code and HTML; we discuss how to separate code and HTML using templates in Chapter 13.
A PHP script can be written using plain text[4] and can be created with any text editor, such as joe, vi, nedit, emacs, or pico.
[4]While printable characters with the most significant bit are allowed, PHP scripts are usually written using characters from the 7-bit ASCII character set.
If you save a PHP script in a file with a .php extension under the directory configured as Apache's document root, Apache executes the script when a request is made for the resource. Following the installation instructions given in Appendix A, the document root is:
/usr/local/apache/htdocs/
Consider what happens when the script shown in Example 2-1 is saved in the file:
/usr/local/apache/htdocs/example.2-1.php
Apache—when configured with the PHP module—executes the script when requests to the URL http://localhost/example.2-1.php are made, assuming the web browser is running on the same machine as the web server.
If directory permissions don't permit creation of files in the document root, it's also possible to work in the user home directories. If the installation instructions in Appendix A have been followed, a directory can be created by a user beneath her home directory and the permissions set so that the directory is readable by the web server:
mkdir ~/public_html chmod a+rx ~/public_html
The example file can then be created with the filename:
~/public_html/example.2-1.php
The file can then be retrieved with the URL http://localhost/~user/example.2-1.php, where user is the user login name.
Comments can be included in code using familiar styles from other high-level programming languages. This includes the following styles:
// This is a one-line comment # This is another one-line comment style /* This is how you can create a multi-line comment */
The echo statement used in Example 2-1 and Example 2-2 is frequently used and designed to output any type of data. The print statement can be used for the same purpose. Consider some examples:
echo "Hello, world"; // print works just the same print "Hello, world"; // numbers can be printed too echo 123; // So can the contents of variables echo $outputString;
The difference between print and echo is that echo can output more than one argument:
echo "Hello, ", "world";
There is also a shortcut that can output data. The following very short script outputs the value of the variable $temp:
<?=$temp; ?>
The print and echo statements are also often seen with parentheses:
echo "hello"; // is the same as echo ("hello");
Parentheses make no difference to the behavior of print. However, when they are used with echo, only one output parameter can be provided.
The echo and print statements can be used for most tasks and can output any combination of static strings, numbers, arrays, and other variable types discussed later in this chapter. We discuss more complex output with printf in Section 2.6 later in this section.
PHP can create double- and single-quoted string literals. If double quotation marks are needed as part of a string, the easiest approach is to switch to the single-quotation style:
echo 'This works'; echo "just like this."; // And here are some strings that contain quotes echo "This string has a ': a single quote!"; echo 'This string has a ": a double quote!';
Quotation marks can be escaped like this:
echo "This string has a \": a double quote!"; echo 'This string has a \': a single quote!';
One of the convenient features of PHP is the ability to include the value of a variable in a string literal. PHP parses double-quoted strings and replaces variable names with the variable's value. The following example shows how:
$number = 45; $vehicle = "bus"; $message = "This $vehicle holds $number people"; // prints "This bus holds 45 people" echo $message;
To include backslashes and dollar signs in a double-quoted string, the escaped sequences \\ and \$ can be used. The single-quoted string isn't parsed in the same way as a double-quoted string and can print strings such as:
'a string with a \ and a $'
We discuss parsing of string literals in more detail in Section 2.6.
Variables in PHP are identified by a dollar sign followed by the variable name. Variables don't need to be declared, and they have no type until they are assigned a value. The following code fragment shows a variable $var assigned the value of an expression, the integer 15. Therefore, $var is defined as being of type integer.
$var = 15;
Because the variable in this example is used by assigning a value to it, it's implicitly declared. Variables in PHP are simple: when they are used, the type is implicitly defined—or redefined—and the variable implicitly declared.
The variable type can change over the lifetime of the variable. Consider an example:
$var = 15; $var = "Sarah the Cat";
This fragment is acceptable in PHP. The type of $var changes from integer to string as the variable is reassigned. Letting PHP change the type of a variable as the context changes is very flexible and a little dangerous.
Variable names are case-sensitive in PHP, so $Variable, $variable, $VAriable, and $VARIABLE are all different variables.
WARNING: One of the most common sources of bugs in PHP is failing to detect that more than one variable has accidentally been created. The flexibility of PHP is a great feature but is also dangerous. We discuss later how to set the error reporting of PHP so that it creates warning messages sensitive to unassigned variables being used.
PHP has four scalar types—boolean, float, integer, and string—and two compound types, array and object.
TIP: In this book, and particularly in this chapter, we present function prototypes that specify the types of arguments and return values. There are many functions that allow arguments or return values to be of different types, which we describe as mixed.
Variables of a scalar type can contain a single value at any given time. Variables of a compound type—array or object—are made up of multiple scalar values or other compound values. Arrays and objects have their own sections later in this chapter. Other aspects of variables—including global variables and scope—are discussed later, with user-defined functions.
Boolean variables are as simple as they get: they can be assigned either true or false. Here are two example assignments of a Boolean variable:
$variable = false; $test = true;
An integer is a whole number, while a float is a number that has an exponent and a fractional part. The number 123.01 is a float, and so is 123.0. The number 123 is an integer. Consider the following two examples:
// This is an integer $var1 = 6; // This is a float $var2 = 6.0;
A float can also be represented using an exponential notation:
// This is a float that equals 1120 $var3 = 1.12e3; // This is also a float that equals 0.02 $var4 = 2e-2
You've already seen examples of strings earlier, when echo( ) and print( ) were introduced, and string literals are covered further in Section 2.6. Consider two example string variables:
$variable = "This is a string"; $test = 'This is also a string';
Constants associate a name with a simple, scalar value. For example, the Boolean values true and false are constants associated with the values 1 and 0, respectively. It's also common to declare constants in a script. Consider this example constant declaration:
define("pi", 3.14159); // This outputs 3.14159 echo pi;
Constants aren't preceded by a $ character; they can't be changed once they have been defined; they can be accessed anywhere in a script, regardless of where they are declared; and they can only be simple, scalar values.
Constants are useful because they allow parameters internal to the script to be grouped. When one parameter changes—for example, if you define a new maximum number of lines per web page—you can alter this constant parameter in only one place and not throughout the code.
We've already described simple examples of assignment, in which a variable is assigned the value of an expression using an equals sign. Most numeric assignments and expressions that work in other high-level languages also work in PHP. Here are some examples:
// Assign a value to a variable $var = 1; // Sum integers to produce an integer $var = 4 + 7; // Subtraction, multiplication, and division // that might have a result that is a float or // an integer, depending on the initial value of $var $var = (($var - 5) * 2) / 3; // These all add 1 to $var $var = $var + 1; $var += 1; $var++; // And these all subtract 1 from $var $var = $var - 1; $var -= 1; $var--; // Double a value $var = $var * 2; $var *= 2; // Halve a value $var = $var / 2; $var /= 2; // These work with float types too $var = 123.45 * 28.2;
There are many mathematical functions available in the math library of PHP for more complex tasks. We introduce some of these in Section 2.9.
String assignments and expressions are similar:
// Assign a string value to a variable $var = "test string"; // Concatenate two strings together // to produce "test string" $var = "test" . " string"; // Add a string to the end of another // to produce "test string" $var = "test"; $var = $var . " string"; // Here is a shortcut to add a string to // the end of another $var .= " test";
Expressions in PHP are formulated in much the same way as other languages. An expression is formed from literal values (integers, strings, floats, Booleans, arrays, and objects), operators, and function calls that return values. An expression has a value and a type; for example, the expression 4 + 7 has the value 11 and the type integer, and the expression "Kelpie" has the value Kelpie and the type string. PHP automatically converts types when combining values in an expression. For example, the expression 4 + 7.0 contains an integer and a float; in this case, PHP considers the integer as a floating-point number, and the result is a float. The type conversions are largely straightforward; however, there are some traps, which are discussed later in this section.
The precedence of operators in an expression is similar to the precedence defined in any other language. Multiplication and division occur before subtraction and addition, and so on. However, reliance on evaluation order leads to unreadable, confusing code. Rather than memorize the rules, we recommend you construct unambiguous expressions with parentheses, because parentheses have the highest precedence in evaluation.
For example, in the following fragment $variable is assigned a value of 32 because of the precedence of multiplication over addition:
$variable = 2 + 5 * 6;
The result is much clearer if parentheses are used:
$variable = 2 + (5 * 6);
PHP provides several mechanisms to allow variables of one type to be considered as another type. Variables can be explicitly converted to another type with the following functions:
string strval(mixed variable) integer intval(mixed variable) float floatval(mixed variable)
The function settype(mixed variable, string type) can explicitly set the type of variable to type, where type is again one of array, boolean, float, integer, object, or string.
PHP supports type-casting in much the same way as C, to allow the type of an expression to be changed. By placing the type name in parentheses in front of a variable, PHP converts the value to the desired type:
(int) $var or (integer) $var |
Cast to integer |
(bool) $var or (boolean) $var |
Cast to Boolean |
(float) $var, (double) $var or (real) $var |
Cast to float |
(string) $var |
Cast to string |
(array) $var |
Cast to array |
(object) $var |
Cast to object |
The rules for converting types are mostly common sense, but some conversions may not appear so straightforward. Table 2-1 shows how various values of $var are converted using the (int), (bool), (string), and (float) casting operators.
Value of $var |
(int) $var |
(bool) $var |
(string) $var |
(float) $var |
---|---|---|---|---|
null |
0 |
false |
"" |
0 |
true |
1 |
true |
"1" |
1 |
false |
0 |
false |
"" |
0 |
0 |
0 |
false |
"0" |
0 |
3.8 |
3 |
true |
"3.8" |
3.8 |
"0" |
0 |
false |
"0" |
0 |
"10" |
10 |
true |
"10" |
10 |
"6 feet" |
6 |
true |
"6 feet" |
6 |
"foo" |
0 |
true |
"foo" |
0 |
Automatic type conversion occurs when two differently typed variables are combined in an expression or when a variable is passed as an argument to a library function that expects a different type. When a variable of one type is used as if it were another type, PHP automatically converts the variable to a value of the required type. The same rules are used for automatic type conversion as are demonstrated in Table 2-1.
Some simple examples show what happens when strings are added to integers and floats and when strings and integers are concatenated:
// $var is set as an integer = 115 $var = "100" + 15; // $var is set as a float = 115.0 $var = "100" + 15.0; // $var is set as a string = "39 Steps" $var = 39 . " Steps";
Not all type conversions are so obvious and can be the cause of hard-to-find bugs:
// $var is set as an integer = 39 $var = 39 + " Steps"; // $var is an integer = 42 $var = 40 + "2 blind mice"; // $var is a float, but what does it mean $var = "test" * 4 + 3.14159;
Automatic type conversion can change the type of a variable. Consider the following example:
$var = "1"; // $var is a string == "1" $var += 2; // $var is now an integer == 3 $var /= 2; // $var is now a float == 1.5 $var *= 2; // $var is still a float == 3
WARNING: Care must be taken when interpreting non-Boolean values as Boolean. Many library functions in PHP return values of different types: false if a valid result could not be determined, or a valid result. A valid return value of 0, 0.0, "0", an empty string, null, or an empty array is interpreted false when used as a Boolean value.The solution is to test the type of the variable using the functions described in the next section.
Because PHP is flexible with types, it provides the following functions that can check a variable's type:
boolean is_int(mixed variable) boolean is_float(mixed variable) boolean is_bool(mixed variable) boolean is_string(mixed variable) boolean is_array(mixed variable) boolean is_object(mixed variable)
All the functions return a Boolean value of true or false for the variable variable, depending on whether it matches the variable type that forms the name of the function. For example, the following prints 1, that is, true:
$test = 13.0; echo is_float($test); // prints 1 for true
PHP provides the print_r( ) and var_dump( ) functions, which print the type and value of an expression in a human-readable form:
print_r(mixed expression) var_dump(mixed expression [, mixed expression ...])
These functions are useful for debugging a script, especially when dealing with arrays or objects. To test the value and type of $variable at some point in the script, the following code can be used:
$variable = 15; var_dump($variable);
This prints:
int(15)
While the var_dump( ) function allows multiple variables to be tested in one call, and provides information about the size of the variable contents, print_r( ) provides a more concise representation of arrays and objects. These functions can be used on variables of any type, and we use them throughout this chapter to help illustrate the results of our examples.
During the running of a PHP script, a variable may be in an unset state or may not yet be defined. PHP provides the isset( ) function and the empty( ) language construct to test the state of variables:
boolean isset(mixed var) boolean empty(mixed var)
isset( ) tests if a variable has been set with a non-null value, while empty( ) tests if a variable has a value. The two are different, as shown by the following code:
$var = "test"; // prints: "Variable is Set" if (isset($var)) echo "Variable is Set"; // does not print if (empty($var)) echo "Variable is Empty";
A variable can be explicitly destroyed using unset( ):
unset(mixed var [, mixed var [, ...]])
After the call to unset in the following example, $var is no longer defined:
$var = "foo"; // Later in the script unset($var); // Does not print if (isset($var)) echo "Variable is Set";
Another way to test that a variable is empty is to force it to the Boolean type using the (bool) cast operator discussed earlier. The example interprets the $var variable as type Boolean, which is equivalent to testing for !empty($var):
$var = "foo"; // Both lines are printed if ((bool)$var) echo "Variable is not Empty"; if (!empty($var)) echo "Variable is not Empty";
Table 2-2 show the return values for isset($var), empty($var), and (bool)$var when the variable $var is tested. Some of the results may be unexpected: when $var is set to "0", empty( ) returns true.
Copyright © 2003 O'Reilly & Associates. All rights reserved.