diff --git a/01-introduction.md b/01-introduction.md
index e64c424c..c29d0c18 100644
--- a/01-introduction.md
+++ b/01-introduction.md
@@ -53,7 +53,7 @@ into a field that should contain a number. Understanding the nature of relationa
databases, and using SQL, will help you in using databases in programming languages
such as R or Python.
-Many web applications (including WordPress and ecommerce sites like Amazon) run on a SQL (relational) database. Understanding SQL is the first step in eventually building custom web applications that can serve data to users.
+Many web applications (including WordPress and e-commerce sites like Amazon) run on a SQL (relational) database. Understanding SQL is the first step in eventually building custom web applications that can serve data to users.
## Why are people working in library- and information-related roles well suited to SQL?
@@ -77,7 +77,7 @@ direct way of finding information.
- You can use SQL to query your library database and explore new views that are not necessarily provided via library systems patron facing interfaces.
-- SQL can be used to keep an inventory of items, for instance, for a library's makerspace, or it can be used to track licenses for journals.
+- SQL can be used to keep an inventory of items, for instance, for a library's makerspace, or it can be used to track licences for journals.
- For projects involving migrating and cleaning data from one system to another, SQL can be a handy tool.
@@ -108,34 +108,35 @@ Let's all open the database we downloaded via the setup in DB Browser for SQLite
You can see the tables in the database by looking at the left hand side of the
screen under Tables.
-To see the contents of a table, click on that table and then click on the Browse
-Data tab above the table data.
+To see the contents of a table, click on "Browse Data" then select the table in the "Table" dropdown in the upper left corner.
-If we want to write a query, we click on the Execute SQL tab.
+If we want to write a query, we click on the "Execute SQL" tab.
There are two ways to add new data to a table without writing SQL:
1. Enter data into a CSV file and append
2. Click the "Browse Data" tab, then click the "New Record" button.
-The steps for adding data from a CSV file are:
+To add data from a CSV file:
-1. Choose "File" > "Import" > "Table" from CSV file...
-2. DB Browser for SQLite will prompt you if you want to add the data to the existing table.
+1. Choose "File" > "Import" > "Table from CSV file..."
+2. Select a CSV file to import
+3. Review the import settings and confirm that the column names and fields are correct
+4. Click "OK" to import the data. If the table name matches an existing table and the number of columns match, DB Browser will ask if you want to add the data to the existing table.
## Dataset Description
-The data we will be using consists of 5 csv files that contain tables of article titles, journals, languages, licenses, and publishers. The information in these tables are from a sample of 51 different journals published during 2015.
+The data we will use was created from 5 csv files that contain tables of article titles, journals, languages, licences, and publishers. The information in these tables are from a sample of 51 different journals published during 2015.
**articles**
-- Contains individual article Titles and the associated citations and metadata
+- Contains individual article titles and the associated citations and metadata.
- (16 fields, 1001 records)
-- Field names: `id`, `Title`, `Authors`, `DOI`, `URL`, `Subjects`, `ISSNs`, `Citation`, `LanguageID`, `LicenseID`, `Author_Count`, `First_Author`, `Citation_Count`, `Day`, `Month`, `Year`
+- Field names: `id`, `Title`, `Authors`, `DOI`, `URL`, `Subjects`, `ISSNs`, `Citation`, `LanguageID`, `LicenceID`, `Author_Count`, `First_Author`, `Citation_Count`, `Day`, `Month`, `Year`
**journals**
-- Contains various journal Titles and associated metadata. The table also associates Journal Titles with ISSN numbers that are then referenced in the 'articles' table by the `ISSNs` field.
+- Contains various journal titles and associated metadata. The table also associates Journal Titles with ISSN numbers that are then referenced in the 'articles' table by the `ISSNs` field.
- (5 fields, 51 records)
- Field names: `id`, `ISSN-L`,`ISSNs`, `PublisherID`, `Journal_Title`
@@ -145,9 +146,9 @@ The data we will be using consists of 5 csv files that contain tables of article
- (2 fields, 4 records)
- Field names: `id`, `Language`
-**licenses**
+**licences**
-- ID table which associates License codes with id numbers. These id numbers are then referenced in the 'articles' table by the `LicenseID` field.
+- ID table which associates Licence codes with id numbers. These id numbers are then referenced in the 'articles' table by the `LicenceID` field.
- (2 fields, 4 records)
- Field names: `id`, `Licence`
@@ -163,14 +164,14 @@ The main data types that are used in doaj-article-sample database are `INTEGER`
## SQL Data Type Quick Reference
-Different database software/platforms have different names and sometimes different definitions of data types, so you'll need to understand the data types for any platform you are using. The following table explains some of the common data types and how they are represented in SQLite; [more details available on the SQLite website](https://www.sqlite.org/datatype3.html).
+Different database software/platforms have different names and sometimes different definitions of data types, so you'll need to understand the data types for any platform you are using. The following table explains some of the common data types and how they are represented in SQLite; [more details available on the SQLite website](https://www.sqlite.org/datatype3.html).
| Data type | Details | Name in SQLite |
-| :--------------------- | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ | :-------------------------------------------------------------------------------------------------------------------- |
+| :--------------------- |:--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| :-------------------------------------------------------------------------------------------------------------------- |
| boolean or binary | this variable type is often used to represent variables that can only have two values: yes or no, true or false. | doesn't exist - need to use integer data type and values of 0 or 1. |
-| integer | sometimes called whole numbers or counting numbers. Can be 1,2,3, etc., as well as 0 and negative whole numbers: -1,-2,-3, etc. | INTEGER |
+| integer | sometimes called whole numbers or counting numbers. Can be 1, 2, 3, etc., as well as 0 and negative whole numbers: -1, -2, -3, etc. | INTEGER |
| float, real, or double | a decimal number or a floating point value. The largest possible size of the number may be specified. | REAL |
-| text or string | and combination of numbers, letters, symbols. Platforms may have different data types: one for variables with a set number of characters - e.g., a zip code or postal code, and one for variables with an open number of characters, e.g., an address or description variable. | TEXT |
+| text or string | any combination of numbers, letters, symbols. Platforms may have different data types: one for variables with a set number of characters - e.g., a zip code or postal code, and one for variables with an open number of characters, e.g., an address or description variable. | TEXT |
| date or datetime | depending on the platform, may represent the date and time or the number of days since a specified date. This field often has a specified format, e.g., YYYY-MM-DD | doesn't exist - need to use built-in date and time functions and store dates in real, integer, or text formats. See [Section 2.2 of SQLite documentation](https://www.sqlite.org/datatype3.html#date_and_time_datatype) for more details. |
| blob | a Binary Large OBject can store a large amount of data, documents, audio or video files. | BLOB |
diff --git a/02-selecting-sorting-data.md b/02-selecting-sorting-data.md
index bb316357..74ff7556 100644
--- a/02-selecting-sorting-data.md
+++ b/02-selecting-sorting-data.md
@@ -22,11 +22,11 @@ exercises: 5
## What is a query?
-A query is a question or request for data. For example, "How many journals does our library subscribe to?" When we query a database, we can ask the same question using a common language called Structured Query Language or SQL in what is called a statement. Some of the most useful queries - the ones we are introducing in this first section - are used to return results from a table that match specific criteria.
+A query is a question or request for data. For example, "How many journals does our library subscribe to?". When we query a database, we can ask the same question using Structured Query Language (SQL) in what is called a statement. Some of the most useful queries - the ones we are introducing in this first section - are used to return results from a table that match specific criteria.
## Writing my first query
-Let's start by opening DB Browser for SQLite and the doaj-article-sample database (see Setup). Choose `Browse Data` and the `articles` table. The articles table contains columns or fields such as `Title`, `Authors`, `DOI`, `URL`, etc.
+Let's start by opening DB Browser for SQLite and the doaj-article-sample database (see [Setup](/)). Click "Browse Data" and select the `articles` table in the "Table" dropdown menu. The articles table contains columns or fields such as `Title`, `Authors`, `DOI`, `URL`, etc.
Let's write a SQL query that selects only the `Title` column from the `articles` table.
@@ -60,7 +60,7 @@ SELECT Title, Authors, ISSNs, Year, DOI
FROM articles;
```
-Or we can select all of the columns in a table using the wildcard `*`.
+Or we can select all of the columns in a table using the wildcard `*`:
```sql
SELECT *
diff --git a/03-filtering.md b/03-filtering.md
index 1cba9a17..02930b74 100644
--- a/03-filtering.md
+++ b/03-filtering.md
@@ -20,12 +20,12 @@ exercises: 10
## Filtering
-SQL is a powerful tool for filtering data in databases based on a set of conditions. Let's say we only want data for a specific ISSN, for instance, for the *Acta Crystallographica* journal from the `articles` table. The journal has an ISSN code `2056-9890`. To filter by this ISSN code, we will use the `WHERE` clause.
+SQL is a powerful tool for filtering data in databases based on a set of conditions. Let's say we only want data for a specific ISSN, for instance, for the *Acta Crystallographica* journal from the `articles` table. The journal has an ISSN code `2056-9890`. To filter by this ISSN code, we will use the `WHERE` clause.
```sql
SELECT *
FROM articles
-WHERE ISSNs='2056-9890';
+WHERE ISSNs = '2056-9890';
```
We can add additional conditions by using `AND`, `OR`, and/or `NOT`. For example, suppose we want the data on *Acta Crystallographica* published after October:
@@ -33,7 +33,7 @@ We can add additional conditions by using `AND`, `OR`, and/or `NOT`. For example
```sql
SELECT *
FROM articles
-WHERE (ISSNs='2056-9890') AND (Month > 10);
+WHERE (ISSNs = '2056-9890') AND (Month > 10);
```
Parentheses are used merely for readability in this case but can be required by the SQL interpreter in order to disambiguate formulas.
@@ -44,7 +44,7 @@ ISSNs codes "2076-0787" and "2077-1444", we can combine the tests using OR:
```sql
SELECT *
FROM articles
-WHERE (issns = '2076-0787') OR (issns = '2077-1444');
+WHERE (ISSNs = '2076-0787') OR (ISSNs = '2077-1444');
```
When you do not know the entire value you are searching for, you can use comparison keywords such as `LIKE`, `IN`, `BETWEEN...AND`, `IS NULL`. For instance, we can use `LIKE` in combination with `WHERE` to search for data that matches a pattern.
diff --git a/04-ordering-commenting.md b/04-ordering-commenting.md
index db0c03fb..3105666b 100644
--- a/04-ordering-commenting.md
+++ b/04-ordering-commenting.md
@@ -56,7 +56,8 @@ Consider the following query:
```sql
SELECT *
FROM articles
-WHERE (ISSNs = '2076-0787') OR (ISSNs = '2077-1444') OR (ISSNs = '2067-2764|2247-6202');
+WHERE (ISSNs = '2076-0787') OR (ISSNs = '2077-1444')
+ OR (ISSNs = '2067-2764|2247-6202');
```
SQL offers the flexibility of iteratively adding new conditions but you may reach a point where the query is difficult to read and inefficient. For instance, we can use `IN` to improve the query and make it more readable:
@@ -68,7 +69,7 @@ WHERE (ISSNs IN ('2076-0787', '2077-1444', '2067-2764|2247-6202'));
```
We started with something simple, then added more clauses one by one, testing
-their effects as we went along. For complex queries, this is a good strategy, to make sure you are getting what you want. Sometimes it might help to take a subset of the data that you can easily see in a temporary database to practice your queries on before working on a larger or more complicated database.
+their effects as we went along. For complex queries, this is a good strategy, to make sure you are getting what you want. Sometimes it might help to take a subset of the data that you can easily see in a temporary database to practice your queries on before working on a larger or more complicated database.
When the queries become more complex, it can be useful to add comments to express to yourself, or to others, what you are doing with your query. Comments help explain the logic of a section and provide context for anyone reading the query. It's essentially a way of making notes within your SQL. In SQL, comments begin using \--
and end at the end of the line. To mark a whole paragraph as a comment, you can enclose it with the characters /\* and \*/. For example, a commented version of the above query can be written as:
@@ -79,7 +80,8 @@ join multiple tables because they represent a good example of using
comments in SQL to explain more complex queries.*/
-- First we mention all the fields we want to display
-SELECT articles.Title, articles.First_Author, journals.Journal_Title, publishers.Publisher
+SELECT articles.Title, articles.First_Author, journals.Journal_Title,
+ publishers.Publisher
-- from the first table
FROM articles
-- and join it with the second table.
@@ -93,11 +95,11 @@ ON publishers.id = journals.PublisherId;
```
To see the introduction and explanation of JOINS, please click to [Episode 6](06-joins-aliases.md).
-{: .sql}
:::::::::::::::::::::::::::::::::::::::: keypoints
- Queries often have the structure: SELECT data FROM table WHERE certain criteria are present.
+- Comments can make our queries easier to read and understand.
::::::::::::::::::::::::::::::::::::::::::::::::::
diff --git a/05-aggregating-calculating.md b/05-aggregating-calculating.md
index cd24b1b7..2b389dad 100644
--- a/05-aggregating-calculating.md
+++ b/05-aggregating-calculating.md
@@ -1,5 +1,5 @@
---
-title: Aggregating & calculating values
+title: Aggregating and calculating values
teaching: 15
exercises: 5
---
@@ -72,7 +72,7 @@ For example, we can adapt the last request we wrote to only return information a
SELECT ISSNs, COUNT(*)
FROM articles
GROUP BY ISSNs
-HAVING count(Title) >= 10;
+HAVING COUNT(Title) >= 10;
```
The `HAVING` keyword works exactly like the `WHERE` keyword, but uses aggregate functions instead of database fields. When you want to filter based on an aggregation like `MAX, MIN, AVG, COUNT, SUM`, use `HAVING`; to filter based on the individual values in a database field, use `WHERE`.
@@ -94,7 +94,7 @@ but only for the journals with 5 or more citations on average.
SELECT ISSNs, AVG(Citation_Count)
FROM articles
GROUP BY ISSNs
-HAVING AVG(Citation_Count)>=5;
+HAVING AVG(Citation_Count) >= 5;
```
:::::::::::::::::::::::::
@@ -106,9 +106,10 @@ HAVING AVG(Citation_Count)>=5;
In SQL, we can also perform calculations as we query the database. Also known as computed columns, we can use expressions on a column or multiple columns to get new values during our query. For example, what if we wanted to calculate a new column called `CoAuthor_Count`:
```sql
-SELECT Title, ISSNs, Author_Count -1 as CoAuthor_Count
+SELECT Title, ISSNs, Author_Count - 1 as CoAuthor_Count
FROM articles
-ORDER BY Author_Count -1 DESC;
+ORDER BY CoAuthor_Count DESC;
+
```
In section [6\. Joins and aliases](06-joins-aliases.md) we are going to learn more about the SQL keyword `AS` and how to make use of aliases - in this example we simply used the calculation and `AS` to represent that the new column is different from the original SQL table data.
diff --git a/06-joins-aliases.md b/06-joins-aliases.md
index 9bd30538..71f302f8 100644
--- a/06-joins-aliases.md
+++ b/06-joins-aliases.md
@@ -54,7 +54,8 @@ We will cover [relational database design](08-database-design.md) in the next ep
When joining tables, you can specify the columns you want by using `table.colname` instead of selecting all the columns using `*`. For example:
```sql
-SELECT articles.ISSNs, journals.Journal_Title, articles.Title, articles.First_Author, articles.Month, articles.Year
+SELECT articles.ISSNs, journals.Journal_Title, articles.Title,
+ articles.First_Author, articles.Month, articles.Year
FROM articles
JOIN journals
ON articles.ISSNs = journals.ISSNs;
@@ -63,7 +64,8 @@ ON articles.ISSNs = journals.ISSNs;
Joins can be combined with sorting, filtering, and aggregation. So, if we wanted the average number of authors for articles on each journal, we can use the following query:
```sql
-SELECT articles.ISSNs, journals.Journal_Title, ROUND(AVG(articles.Author_Count), 2)
+SELECT articles.ISSNs, journals.Journal_Title,
+ ROUND(AVG(articles.Author_Count), 2)
FROM articles
JOIN journals
ON articles.ISSNs = journals.ISSNs
@@ -83,7 +85,7 @@ Write a query that `JOINS` the `articles` and `journals` tables and that returns
## Solution
```sql
-SELECT journals.Journal_Title, count(*), avg(articles.Citation_Count)
+SELECT journals.Journal_Title, COUNT(*), AVG(articles.Citation_Count)
FROM articles
JOIN journals
ON articles.ISSNs = journals.ISSNs
@@ -142,16 +144,18 @@ We can alias both table names:
```sql
SELECT ar.Title, ar.First_Author, jo.Journal_Title
FROM articles AS ar
-JOIN journals AS jo
+JOIN journals AS jo
ON ar.ISSNs = jo.ISSNs;
```
And column names:
```sql
-SELECT ar.title AS title, ar.first_author AS author, jo.journal_title AS journal
+SELECT ar.title AS title,
+ ar.first_author AS author,
+ jo.journal_title AS journal
FROM articles AS ar
-JOIN journals AS jo
+JOIN journals AS jo
ON ar.issns = jo.issns;
```
diff --git a/08-database-design.md b/08-database-design.md
index d544d0f8..fd1c00ee 100644
--- a/08-database-design.md
+++ b/08-database-design.md
@@ -52,11 +52,11 @@ Database design involves a model or plan developed to determine how the data is
## Terminology
-
+![](fig/field-record-value.png){alt='Fields, Records, Values'}
In the [Introduction to SQL](01-introduction.md) lesson, we introduced the terms "fields", "records", and "values". These terms are commonly used in databases while the "columns", "rows", and "cells" terms are more common in spreadsheets. Fields store a single kind of information (text, integers, etc.) related to one topic (title, author, year), while records are a set of fields containing specific values related to one item in your database (a book, a person, a library).
-To design a database, we must first decide what kinds of things we want to represent as tables. A table is the physical manifestation of a kind of "entity". An entity is the conceptual representation of the thing we want to store informtation about in the database, with each row containing information about one entity. An entity has "attributes" that describe it, represented as fields. For example, an article or a journal is an entity. Attributes would be things like the article title, or journal ISSN which would appear as fields.
+To design a database, we must first decide what kinds of things we want to represent as tables. A table is the physical manifestation of a kind of "entity". An entity is the conceptual representation of the thing we want to store information about in the database, with each row containing information about one entity. An entity has "attributes" that describe it, represented as fields. For example, an article or a journal is an entity. Attributes would be things like the article title, or journal ISSN which would appear as fields.
To create relationships between tables later on, it is important to designate one column as a primary key. A primary key, often designated as PK, is one attribute of an entity that distinguishes it from the other entities (or records) in your table. The primary key must be unique for each row for this to work. A common way to create a primary key in a table is to make an 'id' field that contains an auto-generated integer that increases by 1 for each new record. This will ensure that your primary key is unique.
@@ -68,11 +68,11 @@ ERDs are helpful tools for visualising and structuring your data more efficientl
![](https://user-images.githubusercontent.com/30397506/115917162-6cc7ef00-a43b-11eb-97af-16fe50caa6a6.png){alt='Articles Database'}
-*Or you can view the [dbdiagram.io interactive version of the ERD](https://dbdiagram.io/d/5cc32b0cf7c5bb70c72fc530)*
+*Or you can view the [dbdiagram.io interactive version of the ERD](https://dbdiagram.io/d/5cc32b0cf7c5bb70c72fc530).*
Relationships between entities and their attributes are represented by lines linking them together. For example, the line linking journals and publishers is interpreted as follows: The 'journals' entity is related to the 'publishers' entity through the attributes 'PublisherId' and 'id' respectively.
-Conceptually, we know that a journal has only one publisher but a publisher can publish many journals. This is known as a one-to-many relationship. In modeling relationships, we usually assign a unique identifier to the 'one' side of the relationship and use that same identifier to refer to that entity on the 'many' side. In 'publishers' table, the 'id' attribute is that unique identifier. We use that same identifier in the 'journals' table to refer to an individual publisher. That way, there is an unambiguous way for us to distinguish which journals are associated with which publisher in a way that keeps the integrity of the data (see the Normalization section below).
+Conceptually, we know that a journal has only one publisher but a publisher can publish many journals. This is known as a one-to-many relationship. In modeling relationships, we usually assign a unique identifier to the 'one' side of the relationship and use that same identifier to refer to that entity on the 'many' side. In 'publishers' table, the 'id' attribute is that unique identifier. We use that same identifier in the 'journals' table to refer to an individual publisher. That way, there is an unambiguous way for us to distinguish which journals are associated with which publisher in a way that keeps the integrity of the data (see [the Normalisation section](#normalisation) below).
## More Terminology
@@ -95,7 +95,7 @@ ERDs are helpful in normalising your data which is a process that can be used to
In the example ERD above, creating a separate table for publishers and linking to it from the journals table via PK and FK identifiers allows us to normalise the data and avoid inconsistencies. If we used one table, we could introduce publisher name errors such as misspellings or alternate names as demonstrated below.
-![](fig/normalisation.png){alt='Introducting inconsistencies and normalising data'}
+![](fig/normalisation.png){alt='Introducing inconsistencies and normalising data'}
There are a number of normal forms in the normalisation process that can help you reduce redundancy in database tables. [Study Tonight](https://www.studytonight.com/dbms/database-normalization.php) features tutorials where you can learn more about them.
@@ -129,5 +129,3 @@ Additional database design tutorials to consult from Lucidchart:
- Database design is helpful for creating more efficient databases.
::::::::::::::::::::::::::::::::::::::::::::::::::
-
-
diff --git a/09-create.md b/09-create.md
index ffe7da04..a1a28709 100644
--- a/09-create.md
+++ b/09-create.md
@@ -50,15 +50,14 @@ We talked about data types earlier [in Introduction to SQL: SQL Data Type Quick
When we create a table,
we can specify several kinds of constraints on its columns.
-For example,
-a better definition for the `journals` table would be:
+For example, a better definition for the `journals` table would be:
```sql
CREATE TABLE "journals" (
- "id" INTEGER NOT NULL PRIMARY KEY AUTOINCREMENT,
- "ISSN-L" TEXT,
- "ISSNs" TEXT,
- "PublisherId" INTEGER,
+ "id" INTEGER NOT NULL PRIMARY KEY AUTOINCREMENT,
+ "ISSN-L" TEXT,
+ "ISSNs" TEXT,
+ "PublisherId" INTEGER,
"Journal_Title" TEXT,
CONSTRAINT "PublisherId" FOREIGN KEY("PublisherId") REFERENCES "publishers"("id")
);
@@ -76,17 +75,22 @@ we can add, change, and remove records using our other set of commands,
Here is an example of inserting rows into the `journals` table:
```sql
-INSERT INTO "journals" VALUES (1,'2077-0472','2077-0472',2,'Agriculture');
-INSERT INTO "journals" VALUES (2,'2073-4395','2073-4395',2,'Agronomy');
-INSERT INTO "journals" VALUES (3,'2076-2616','2076-2616',2,'Animals');
-
+INSERT INTO "journals"
+VALUES (1,'2077-0472','2077-0472',2,'Agriculture');
+INSERT INTO "journals"
+VALUES (2,'2073-4395','2073-4395',2,'Agronomy');
+INSERT INTO "journals"
+VALUES (3,'2076-2616','2076-2616',2,'Animals');
```
We can also insert values into one table directly from another:
```sql
CREATE TABLE "myjournals"(Journal_Title text, ISSNs text);
-INSERT INTO "myjournals" SELECT Journal_Title, ISSNs FROM journals;
+INSERT INTO "myjournals"
+SELECT Journal_Title, ISSNs
+FROM journals;
+
```
Modifying existing records is done using the `UPDATE` statement.
@@ -98,7 +102,9 @@ For example, if we made a typo when entering the ISSNs
of the last `INSERT` statement above, we can correct it with an update:
```sql
-UPDATE journals SET ISSN-L = 2076-2615, ISSNs = 2076-2615 WHERE id = 3;
+UPDATE journals
+SET ISSN-L = 2076-2615, ISSNs = 2076-2615
+WHERE id = 3;
```
Be careful to not forget the `WHERE` clause or the update statement will
@@ -112,7 +118,8 @@ that matches the records we want to discard.
We can remove the journal `Animals` from the `journals` table like this:
```sql
-DELETE FROM journals WHERE Journal_Title = 'Animals';
+DELETE FROM journals
+WHERE Journal_Title = 'Animals';
```
But now the article `Early Onset of Laying and Bumblefoot Favor Keel Bone Fractures` from the table `articles`
@@ -126,15 +133,17 @@ matching every row `ISSNs`in the table `articles`.
## Exercise
Write an SQL statement to add the journal "New Journal of Physics" (ISSNs \& ISSNs: 1367-2630; publisher: "Institute of Physics (IOP)") to the table
-`journals`. You need to add the publisher "IOP" to the table `publishers` as well.
+`journals`. You need to add the publisher "IOP" to the table `publishers` as well.
::::::::::::::: solution
## Solution
```sql
-INSERT INTO "publishers" VALUES (7,'Institute of Physics (IOP)');
-INSERT INTO "journals" VALUES (52,'1367-2630','1367-2630',7,'New Journal of Physics');
+INSERT INTO "publishers"
+VALUES (7,'Institute of Physics (IOP)');
+INSERT INTO "journals"
+VALUES (52,'1367-2630','1367-2630',7,'New Journal of Physics');
```
:::::::::::::::::::::::::
@@ -158,7 +167,7 @@ stored in any particular order.)
## Solution
-#### Advantages
+### Advantages
- A version control system will be able to show differences between versions
of the dump file; something it can't do for binary files like databases
@@ -167,7 +176,7 @@ stored in any particular order.)
- The version control log will explain the reason for the changes in each version
of the database
-#### Disadvantages
+### Disadvantages
- Artificial differences between commits because records don't have a fixed order
diff --git a/11-extra-challenges.md b/11-extra-challenges.md
index 4d9178a4..160dc9b8 100644
--- a/11-extra-challenges.md
+++ b/11-extra-challenges.md
@@ -35,7 +35,7 @@ How many `articles` are there from each `First_author`? Can you make an alias fo
## Solution 1
```sql
-SELECT First_Author, COUNT( * ) AS n_articles
+SELECT First_Author, COUNT(*) AS n_articles
FROM articles
GROUP BY First_Author
ORDER BY n_articles DESC;
@@ -56,7 +56,7 @@ How many papers have a single author? How many have 2 authors? How many 3? etc?
## Solution 2
```sql
-SELECT Author_Count, COUNT( * )
+SELECT Author_Count, COUNT(*)
FROM articles
GROUP BY Author_Count;
```
@@ -77,10 +77,10 @@ language is unknown.
## Solution 3
```sql
-SELECT Language, COUNT( * )
+SELECT Language, COUNT(*)
FROM articles
JOIN languages
-ON articles.LanguageId=languages.id
+ON articles.LanguageId = languages.id
WHERE Language != ''
GROUP BY Language;
```
@@ -101,10 +101,10 @@ number of citations for that `Licence` type?
## Solution 4
```sql
-SELECT Licence, AVG( Citation_Count ), COUNT( * )
+SELECT Licence, AVG(Citation_Count), COUNT(*)
FROM articles
JOIN licences
-ON articles.LicenceId=licences.id
+ON articles.LicenceId = licences.id
WHERE Licence != ''
GROUP BY Licence;
```
@@ -124,12 +124,13 @@ Write a query that returns `Title, First_Author, Author_Count, Citation_Count, M
## Solution 5
```sql
-SELECT Title, First_Author, Author_Count, Citation_Count, Month, Year, Journal_Title, Publisher
+SELECT Title, First_Author, Author_Count, Citation_Count,
+ Month, Year, Journal_Title, Publisher
FROM articles
JOIN journals
-ON articles.issns=journals.ISSNs
+ON articles.issns = journals.ISSNs
JOIN publishers
-ON publishers.id=journals.PublisherId;
+ON publishers.id = journals.PublisherId;
```
:::::::::::::::::::::::::
diff --git a/Bonus_GoodStyle.md b/Bonus_GoodStyle.md
index 762d5269..ad3752cb 100644
--- a/Bonus_GoodStyle.md
+++ b/Bonus_GoodStyle.md
@@ -40,7 +40,8 @@ SELECT articles.Title, articles.First_Author, journals.Journal_Title, publishers
Into something that looks like this:
```sql
-SELECT articles.Title, articles.First_Author, journals.Journal_Title, publishers.Publisher
+SELECT articles.Title, articles.First_Author, journals.Journal_Title,
+ publishers.Publisher
FROM articles
JOIN journals
ON articles.ISSNs = journals.ISSNs
diff --git a/discuss.md b/discuss.md
index 5a054b89..136bad06 100644
--- a/discuss.md
+++ b/discuss.md
@@ -4,7 +4,6 @@ title: Discussion
There are many ways to discuss Library Carpentry lessons:
-- Join our [Gitter discussion forum](https://gitter.im/LibraryCarpentry/).
- Join our [Slack organisation](https://slack-invite.carpentries.org/) and #libraries channel.
- Stay in touch with our [Topicbox Group](https://carpentries.topicbox.com/groups/discuss-library-carpentry).
- Follow updates on [Twitter](https://twitter.com/LibCarpentry).
diff --git a/learner-profiles.md b/learner-profiles.md
deleted file mode 100644
index 434e335a..00000000
--- a/learner-profiles.md
+++ /dev/null
@@ -1,5 +0,0 @@
----
-title: FIXME
----
-
-This is a placeholder file. Please add content here.
diff --git a/learner_profile.md b/learner_profile.md
index bfb02ec0..692e4a92 100644
--- a/learner_profile.md
+++ b/learner_profile.md
@@ -2,8 +2,6 @@
title: 'Elias'
---
-**Learners' Profiles**
-
Elias is a senior librarian in Utrecht University Library, specialising in manuscripts and special collections.
He has no programming experience but he is familiar with information principles, collection management and several metadata formats.
diff --git a/md5sum.txt b/md5sum.txt
index 91be673f..b10423fb 100644
--- a/md5sum.txt
+++ b/md5sum.txt
@@ -3,21 +3,20 @@
"LICENSE.md" "b24ebbb41b14ca25cf6b8216dda83e5f" "site/built/LICENSE.md" "2023-05-02"
"config.yaml" "5b9010a1be28b0bbe9103826d33ad762" "site/built/config.yaml" "2023-05-02"
"index.md" "e3b06a792f01db253ba3104fb358bf67" "site/built/index.md" "2023-09-21"
-"episodes/01-introduction.md" "cf1fc4a4002ecb50dbbe0dbef22131ca" "site/built/01-introduction.md" "2024-03-20"
-"episodes/02-selecting-sorting-data.md" "379b665682f5ce5d34027967214d50a2" "site/built/02-selecting-sorting-data.md" "2024-03-20"
-"episodes/03-filtering.md" "f8695e2befa7d714caea9c833c48f1e6" "site/built/03-filtering.md" "2024-03-20"
-"episodes/04-ordering-commenting.md" "2818b322e7cec000c8dd2380bbbd2cbb" "site/built/04-ordering-commenting.md" "2024-03-20"
-"episodes/05-aggregating-calculating.md" "a02ee7c28fd11b7d658afa825e98ad2d" "site/built/05-aggregating-calculating.md" "2024-03-20"
-"episodes/06-joins-aliases.md" "94c30bab2ff9761ef57e942d76e7231b" "site/built/06-joins-aliases.md" "2024-03-20"
+"episodes/01-introduction.md" "fb8a78b62c4a105d83159f1c760c3296" "site/built/01-introduction.md" "2024-03-26"
+"episodes/02-selecting-sorting-data.md" "d6ff7f7e49af4896c5c260d75bab8dc2" "site/built/02-selecting-sorting-data.md" "2024-03-26"
+"episodes/03-filtering.md" "66d25fee1c1e5b80ac26617b4e0136aa" "site/built/03-filtering.md" "2024-03-26"
+"episodes/04-ordering-commenting.md" "910ea0a408ba6477b70c5b60d70ffbeb" "site/built/04-ordering-commenting.md" "2024-03-26"
+"episodes/05-aggregating-calculating.md" "19832b24150b294c381b9f7f5b5b6e61" "site/built/05-aggregating-calculating.md" "2024-03-26"
+"episodes/06-joins-aliases.md" "ff5a2ae6edbdb00520f5d3d4d5ba8110" "site/built/06-joins-aliases.md" "2024-03-26"
"episodes/07-saving-queries.md" "996f14166e6ec81d2acc6b59975fb20f" "site/built/07-saving-queries.md" "2023-05-02"
-"episodes/08-database-design.md" "e15a8e2a508e0493d8c79c68e6a298fa" "site/built/08-database-design.md" "2024-03-20"
-"episodes/09-create.md" "cbc27188d4cb5cab949887bc887a0be2" "site/built/09-create.md" "2024-03-20"
+"episodes/08-database-design.md" "a2fb2fcb34b34e0e57b75de666bc2027" "site/built/08-database-design.md" "2024-03-26"
+"episodes/09-create.md" "b1d9d06757b8f08c9ab094ac671feb69" "site/built/09-create.md" "2024-03-26"
"episodes/10-other-tools.md" "76e7d5b8c85b30db01d318e6cb57688d" "site/built/10-other-tools.md" "2023-05-02"
-"episodes/11-extra-challenges.md" "4c0f71750b1fcdb17d9a712bd1c4ea28" "site/built/11-extra-challenges.md" "2024-03-20"
-"episodes/Bonus_GoodStyle.md" "e8544f1f61bf51712b7ef36007ddb2e7" "site/built/Bonus_GoodStyle.md" "2024-03-20"
+"episodes/11-extra-challenges.md" "36e5b6254a17364519679b7361348c64" "site/built/11-extra-challenges.md" "2024-03-26"
+"episodes/Bonus_GoodStyle.md" "d6a13fc715c5daea41ddd8dfb341859e" "site/built/Bonus_GoodStyle.md" "2024-03-26"
"instructors/instructor-notes.md" "27af90efce1615e70ef091809f06236b" "site/built/instructor-notes.md" "2023-05-02"
-"learners/discuss.md" "9e47eba4de5b1fa887013407bb248fee" "site/built/discuss.md" "2024-03-20"
-"learners/reference.md" "12a03e8e57f53890db3fbe49a54f1296" "site/built/reference.md" "2024-03-20"
-"learners/setup.md" "5e4bde30d1943c3b3c4740bd9354b54d" "site/built/setup.md" "2024-03-20"
-"profiles/learner-profiles.md" "60b93493cf1da06dfd63255d73854461" "site/built/learner-profiles.md" "2024-03-20"
-"profiles/learner_profile.md" "2301a3d5aca3894b8e4f992f688e8e75" "site/built/learner_profile.md" "2024-03-20"
+"learners/discuss.md" "fa2c97ac5aa1cc496c7ba1c9c5f2efd4" "site/built/discuss.md" "2024-03-26"
+"learners/reference.md" "d78741e078e2063227c909814e63135d" "site/built/reference.md" "2024-03-26"
+"learners/setup.md" "c502295e0a7bb41286d4eea0cdbfee14" "site/built/setup.md" "2024-03-26"
+"profiles/learner_profile.md" "77db4fa8b0fa73725da25fca12a62b85" "site/built/learner_profile.md" "2024-03-26"
diff --git a/reference.md b/reference.md
index 9297f0c2..33ebbe72 100644
--- a/reference.md
+++ b/reference.md
@@ -2,10 +2,6 @@
title: 'SQL Cheat Sheet'
---
-## SQL Cheat Sheet
-
-***
-
### Basic query
```sql
@@ -98,7 +94,7 @@ ON table_name1.column_name = table_name2.column_name;
```
- Combine data from two tables where the values of column\_name in the two tables are the same.
-- Instead of `ON`, you can use the `USING` keyword as a shorthand. E.g. `USING (coolumn_name)`.
+- Instead of `ON`, you can use the `USING` keyword as a shorthand. E.g. `USING (column_name)`.
```sql
SELECT alias1.column_name1, alias1.column_name2, alias2.column_name3
diff --git a/setup.md b/setup.md
index a16dc319..e7137ced 100644
--- a/setup.md
+++ b/setup.md
@@ -16,7 +16,7 @@ To import data, you'll need to open DB Browser for SQLite and download a zip fil
1. Download the data files doaj-article-sample.zip from
[Zenodo](https://doi.org/10.5281/zenodo.8360812).
-2. Open the zip file with the zip utlity on your machine and save the folder and files to a location where you can easily find them. For example, your Desktop.
+2. Open the zip file with the zip utility on your machine and save the folder and files to a location where you can easily find them. For example, your Desktop.
3. Contained in the zip file are two files, doaj-article-sample.db and doaj-article-sample.db.sql. You can either open the database file (less steps) or import the SQL file (more steps).
### Open the database file
@@ -59,7 +59,7 @@ To exit type `exit`.
#### Windows
-On Windows download the [Windows Installer](https://github.com/swcarpentry/windows-installer/releases/download/v0.3/SWCarpentryInstaller.exe)
+On Windows download the [Windows Installer](https://github.com/swcarpentry/windows-installer/releases/download/v0.3/SWCarpentryInstaller.exe).
Copy the file to a directory and open the directory using the windows command line. Type `sqlite3`.
For a more detailed explanation see this [tutorial](https://www.sqlitetutorial.net/download-install-sqlite/).